For the complete documentation index, see llms.txt. This page is also available as Markdown.

Manage Dedicated Deployments

Provision and manage Dedicated Deployment GPU machines from Python.

Dedicated Deployments are managed GPU machines that run your Roboflow models with predictable latency and high throughput. The SDK manages them through the roboflow.adapters.deploymentapi adapter - the high-level Workspace class doesn't currently expose deployment methods.

Each function returns a (status_code, body) tuple so you can branch on the HTTP result:

from roboflow.adapters import deploymentapi

status, body = deploymentapi.list_deployment("YOUR_API_KEY")
if status == 200:
    for d in body.get("deployments", []):
        print(d["deployment_name"], d["status"])
else:
    print("Failed:", body)

List available machine types

from roboflow.adapters import deploymentapi

status, body = deploymentapi.list_machine_types("YOUR_API_KEY")
for m in body.get("machine_types", []):
    print(m["name"], m.get("description"))

Create a deployment

The deployment provisions asynchronously. Poll get_deployment until status == "ready".

Get deployment details

Pause / resume / delete

Logs

Usage

Running inference against a dedicated deployment

Once a deployment is ready, point inference SDK calls at its public_url (returned by get_deployment):

REST and CLI equivalents

Last updated

Was this helpful?