모델 유형별 학습 해상도

학습 해상도는 모델 정확도, 추론 속도 및 학습 시간에 영향을 줍니다. 각 모델 아키텍처는 이러한 요소들을 균형 있게 맞추는 기본 해상도를 가지고 있습니다. 기본적으로 Roboflow는 선택한 모델 아키텍처에 대한 기본 학습 해상도를 제안합니다.

아래 표는 각 모델 아키텍처 및 크기에 대한 기본 학습 해상도를 보여줍니다. 새 Dataset Version을 만들 때 resize 전처리 단계를 구성하여 이러한 기본값을 재정의할 수 있습니다. Dataset Version.

Object Detection

Model Type
Family & Size
Default Training Resolution

Object Detection

RF-DETR Nano

384×384

Object Detection

RF-DETR Small

512×512

Object Detection

RF-DETR Medium

576×576

Object Detection

RF-DETR Large

704×704

Object Detection

RF-DETR X Large

700x700

Object Detection

RF-DETR 2X Large

880x880

Object Detection

Roboflow 3.0 - Fast

640×640

Object Detection

Roboflow 3.0 - Accurate

640×640

Object Detection

Roboflow 3.0 - Medium

640×640

Object Detection

Roboflow 3.0 - Large

640×640

Object Detection

Roboflow 3.0 - Extra Large

640×640

Object Detection

YOLOv26(n/s/m/l/x)

640×640

Object Detection

YOLOv12 (n/s/m/l/x)

640×640

Object Detection

YOLOv11 (n/s/m/l/x)

640×640

Object Detection

YOLOv10 (n/s/m/b/l/x)

640×640

Object Detection

YOLOv9 (s/m/c/e)

640×640

Object Detection

YOLOv8 (n/s/m/l/x)

640×640

Object Detection

YOLOv5 (n/s/m/l/x)

640×640

Object Detection

YOLOv7 (legacy)

640×640

Object Detection

YOLO‑NAS Small

640×640

Object Detection

YOLO‑NAS Medium

640×640

Object Detection

Roboflow Instant

1008x1008

Instance Segmentation

Model Type
Family & Size
Default Training Resolution

Instance Segmentation

RF-DETR Seg Nano

312x312

Instance Segmentation

RF-DETR Seg Small

384x384

Instance Segmentation

RF-DETR Seg Medium

432x432

Instance Segmentation

RF-DETR Seg Large

504x504

Instance Segmentation

RF-DETR Seg X Large

624x624

Instance Segmentation

RF-DETR Seg 2X Large

768x768

Instance Segmentation

Roboflow 3.0 - Fast (Seg)

640×640

Instance Segmentation

Roboflow 3.0 - Accurate (Seg)

640×640

Instance Segmentation

Roboflow 3.0 - Medium (Seg)

640×640

Instance Segmentation

Roboflow 3.0 - Large (Seg)

640×640

Instance Segmentation

Roboflow 3.0 - Extra Large (Seg)

640×640

Instance Segmentation

YOLO-seg (v8/10/11/12)

640×640

Instance Segmentation

SAM3 (Segment Anything 3)

1008x1008

Instance Segmentation

Semantic segmentation (DeepLabV3+)

≥ 512×512

Classification & Pose

Model Type
Family & Size
Default Training Resolution

Classification & Pose

Resnet-18/34/50

224x224

Classification & Pose

YOLO-cls (v8/11)

224x224

Classification & Pose

Vision Transformer (ViT)

224x224

Classification & Pose

YOLO-pose (keypoints)

640x640

Multimodal/VLM

Model Type
Family & Size
Default Training Resolution

Multimodal/VLM

PaliGemma 2 - 3 B

448x448

Multimodal/VLM

PaliGemma 2 - 10 B/28 B

448x448

Multimodal/VLM

Florence-2

448x448

Multimodal/VLM

QWEN 2.5 VL

448x448

Multimodal/VLM

SmolVLM2

384x384

Last updated

Was this helpful?