YOLO-World

YOLO-World is a zero-shot object detection model that allows you to perform object detection without any training, just by describing the items you want to detect.

You can also run YOLO-World locally using Inference, our open-source inference server.

API Reference

The base URL for our hosted API is at https://infer.roboflow.com.

More information on using YOLO-World with Inference, through the Python SDK or the self-hosted API, see the YOLO-World Inference docs page.

YOLO-World inference.

post

Run the YOLO-World zero-shot object detection model.

Query parameters

api_keyany ofOptional

Roboflow API Key that will be passed to the model during initialization for artifact retrieval

stringOptional

nullOptional

countinferenceany ofOptional

booleanOptional

nullOptional

service_secretany ofOptional

stringOptional

nullOptional

Body

Request for Grounding DINO zero-shot predictions.

Attributes: text (List[str]): A list of strings.

idstringRequired

api_keyany ofOptional

Roboflow API Key that will be passed to the model during initialization for artifact retrieval

stringOptional

nullOptional

usage_billablebooleanOptionalDefault: true

startany ofOptional

numberOptional

nullOptional

sourceany ofOptional

stringOptional

nullOptional

source_infoany ofOptional

stringOptional

nullOptional

disable_model_monitoringany ofOptional

If true, disables model monitoring for this request

Default: false

booleanOptional

nullOptional

model_idany ofOptional

stringOptional

nullOptional

model_typeany ofOptional

The type of the model, usually referring to what task the model performs

Example: object-detection

stringOptional

nullOptional

imageany ofRequired

disable_preproc_auto_orientany ofOptional

If true, the auto orient preprocessing step is disabled for this call.

Default: false

booleanOptional

nullOptional

disable_preproc_contrastany ofOptional

If true, the auto contrast preprocessing step is disabled for this call.

Default: false

booleanOptional

nullOptional

disable_preproc_grayscaleany ofOptional

If true, the grayscale preprocessing step is disabled for this call.

Default: false

booleanOptional

nullOptional

disable_preproc_static_cropany ofOptional

If true, the static crop preprocessing step is disabled for this call.

Default: false

booleanOptional

nullOptional

textstring[]Required

A list of strings

Example: ["person","dog","cat"]

yolo_world_version_idany ofOptionalDefault: l

stringOptional

nullOptional

confidenceany ofOptionalDefault: 0.4

numberOptional

nullOptional

Responses

200

Successful Response

application/json

422

Validation Error

application/json

post

POST /yolo_world/infer HTTP/1.1
Host: 
Content-Type: application/json
Accept: */*
Content-Length: 464

{
  "id": "text",
  "api_key": "text",
  "usage_billable": true,
  "start": 1,
  "source": "text",
  "source_info": "text",
  "disable_model_monitoring": false,
  "model_id": "text",
  "model_type": "object-detection",
  "image": [
    {
      "type": "url",
      "value": "http://www.example-image-url.com"
    }
  ],
  "disable_preproc_auto_orient": false,
  "disable_preproc_contrast": false,
  "disable_preproc_grayscale": false,
  "disable_preproc_static_crop": false,
  "text": [
    "person",
    "dog",
    "cat"
  ],
  "yolo_world_version_id": "l",
  "confidence": 0.4
}

{
  "visualization": "text",
  "inference_id": "text",
  "frame_id": 1,
  "time": 1,
  "image": [
    {
      "width": 1,
      "height": 1
    }
  ],
  "predictions": [
    {
      "x": 1,
      "y": 1,
      "width": 1,
      "height": 1,
      "confidence": 1,
      "class": "text",
      "class_confidence": 1,
      "class_id": 1,
      "tracker_id": 1,
      "detection_id": "text",
      "parent_id": "text"
    }
  ]
}

PreviousOCR NextVideo Inference

Last updated 1 year ago

Was this helpful?