> For the complete documentation index, see [llms.txt](https://docs.roboflow.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.roboflow.com/roboflow/roboflow-ko/deploy/serverless.md). # (레거시) Serverless Hosted API {% hint style="info" %} 저희는 **권장합니다** 저희 Serverless Hosted API의 V2를 사용하는 것을 권장합니다. V2 API가 더 빠릅니다.\ \ [새 API를 시작하려면 Serverless Hosted API V2 문서를 참조하세요.](/roboflow/roboflow-ko/deploy/serverless-hosted-api-v2.md) {% endhint %} ## 모델 지원 {% hint style="warning" %} Florence-2, SAM 3 및 기타 최신 모델 아키텍처는 Serverless Hosted API V2에서만 사용할 수 있습니다. credit-based plan으로 이전하지 않은 기존 workspaces는 `402` 이 모델을 사용하려고 하면 오류가 발생합니다. [요금제를 업그레이드하세요](/roboflow/roboflow-ko/billing/plans/purchase-a-plan.md) 를 통해 지원되는 전체 모델 세트를 이용하려면 [Serverless Hosted API V2](/roboflow/roboflow-ko/deploy/serverless-hosted-api-v2.md). {% endhint %} Serverless Hosted API(v1)에서 지원되는 모델 유형은 다음과 같습니다: | 작업 유형 | Hosted API(v1)에서 지원됨 | | ---------------------------------------------------------------------------------------------------- | -------------------- | | [객체 탐지](/roboflow/roboflow-ko/deploy/serverless/object-detection.md) | ✅ | | [분류](/roboflow/roboflow-ko/deploy/serverless/classification.md) | ✅ | | [인스턴스 세그멘테이션](/roboflow/roboflow-ko/deploy/serverless/instance-segmentation.md) | ✅ | | [시맨틱 세그멘테이션](/roboflow/roboflow-ko/deploy/serverless/instance-segmentation/semantic-segmentation.md) | ✅ | | [키포인트 감지](/roboflow/roboflow-ko/deploy/serverless/keypoint-detection.md) | ✅ | ## 지연 시간 비교(v1 vs v2) Serverless Hosted API로 전송된 요청의 엔드투엔드 지연 시간은 여러 요인에 따라 달라집니다: 1. 실행 시간에 영향을 미치는 모델 아키텍처 2. 업로드 시간과 실행 중 모델 추론 시간에 영향을 미치는 이미지의 크기와 해상도 3. 요청 업로드 시간과 응답 다운로드 시간에 영향을 미치는 네트워크 지연 시간과 대역폭. 4. 특정 시점의 서비스 구독 및 다른 사용자의 사용으로 인해 대기열 지연이 발생할 수 있습니다

아래 표에는 v1과 v2 Serverless Hosted API의 대표적인 벤치마크를 보여 드립니다. 이 표는 엔드투엔드 지연 시간(E2E)과 실행 시간(Exec)을 모두 보여 줍니다. 이 수치는 참고용이며, 사용자가 다음을 사용해 직접 벤치마크를 수행해 보시기를 권장합니다 [저희 추론 벤치마크 도구](https://inference.roboflow.com/inference_helpers/cli_commands/benchmark/) 또는 자체 사용자 지정 벤치마크를 사용해 보세요.

모델	V2 (엔드투엔드)	V2 (실행)	V1 (엔드투엔드)	V1 (실행)
yolov8x-640	401 ms	29 ms	4084 ms	821 ms
yolov8m-640	757 ms	21 ms	572 ms	265 ms
yolov8n-640	384 ms	17 ms	312 ms	63 ms
yolov8x-1280	483 ms	97 ms	6431 ms	3032 ms
yolov8m-1280	416 ms	52 ms	1841 ms	1006 ms
yolov8n-1280	428 ms	35 ms	464 ms	157 ms

사용자가 자신의 모델 추론과 워크플로에 대해 직접 벤치마크를 실행하여 특정 사용 사례에 대한 실제 지표를 얻어 보시기를 권장합니다. ## 제한 사항 Serverless Hosted API(v1)는 특정 작업 유형과 관계없이 최대 5MB의 파일을 허용합니다. 이 제한에는 이미지 파일 크기와 첨부된 모든 요청 정보가 포함되며, 이에 국한되지 않습니다. {% hint style="info" %} 요청이 너무 큰 경우, 첨부된 이미지를 축소하는 것을 권장합니다. 이미지는 서버에 수신된 후 모델 아키텍처가 허용하는 입력 크기로 다시 축소되므로, 이는 일반적으로 성능 저하를 초래하지 않습니다.\ \ Python SDK와 같은 일부 SDK는 API로 전송하기 전에 이미지를 모델 아키텍처의 입력 크기로 자동 축소합니다. {% endhint %}