모델 유형별 학습 해상도
학습 해상도는 모델 정확도, 추론 속도 및 학습 시간에 영향을 줍니다. 각 모델 아키텍처는 이러한 요소들을 균형 있게 맞추는 기본 해상도를 가지고 있습니다. 기본적으로 Roboflow는 선택한 모델 아키텍처에 대한 기본 학습 해상도를 제안합니다.
아래 표는 각 모델 아키텍처 및 크기에 대한 기본 학습 해상도를 보여줍니다. 새 Dataset Version을 만들 때 resize 전처리 단계를 구성하여 이러한 기본값을 재정의할 수 있습니다. Dataset Version.
Object Detection
Object Detection
RF-DETR Nano
384×384
Object Detection
RF-DETR Small
512×512
Object Detection
RF-DETR Medium
576×576
Object Detection
RF-DETR Large
704×704
Object Detection
RF-DETR X Large
700x700
Object Detection
RF-DETR 2X Large
880x880
Object Detection
Roboflow 3.0 - Fast
640×640
Object Detection
Roboflow 3.0 - Accurate
640×640
Object Detection
Roboflow 3.0 - Medium
640×640
Object Detection
Roboflow 3.0 - Large
640×640
Object Detection
Roboflow 3.0 - Extra Large
640×640
Object Detection
YOLOv26(n/s/m/l/x)
640×640
Object Detection
YOLOv12 (n/s/m/l/x)
640×640
Object Detection
YOLOv11 (n/s/m/l/x)
640×640
Object Detection
YOLOv10 (n/s/m/b/l/x)
640×640
Object Detection
YOLOv9 (s/m/c/e)
640×640
Object Detection
YOLOv8 (n/s/m/l/x)
640×640
Object Detection
YOLOv5 (n/s/m/l/x)
640×640
Object Detection
YOLOv7 (legacy)
640×640
Object Detection
YOLO‑NAS Small
640×640
Object Detection
YOLO‑NAS Medium
640×640
Object Detection
Roboflow Instant
1008x1008
Instance Segmentation
Instance Segmentation
RF-DETR Seg Nano
312x312
Instance Segmentation
RF-DETR Seg Small
384x384
Instance Segmentation
RF-DETR Seg Medium
432x432
Instance Segmentation
RF-DETR Seg Large
504x504
Instance Segmentation
RF-DETR Seg X Large
624x624
Instance Segmentation
RF-DETR Seg 2X Large
768x768
Instance Segmentation
Roboflow 3.0 - Fast (Seg)
640×640
Instance Segmentation
Roboflow 3.0 - Accurate (Seg)
640×640
Instance Segmentation
Roboflow 3.0 - Medium (Seg)
640×640
Instance Segmentation
Roboflow 3.0 - Large (Seg)
640×640
Instance Segmentation
Roboflow 3.0 - Extra Large (Seg)
640×640
Instance Segmentation
YOLO-seg (v8/10/11/12)
640×640
Instance Segmentation
SAM3 (Segment Anything 3)
1008x1008
Instance Segmentation
Semantic segmentation (DeepLabV3+)
≥ 512×512
Classification & Pose
Classification & Pose
Resnet-18/34/50
224x224
Classification & Pose
YOLO-cls (v8/11)
224x224
Classification & Pose
Vision Transformer (ViT)
224x224
Classification & Pose
YOLO-pose (keypoints)
640x640
Multimodal/VLM
Multimodal/VLM
PaliGemma 2 - 3 B
448x448
Multimodal/VLM
PaliGemma 2 - 10 B/28 B
448x448
Multimodal/VLM
Florence-2
448x448
Multimodal/VLM
QWEN 2.5 VL
448x448
Multimodal/VLM
SmolVLM2
384x384
Last updated
Was this helpful?