Roboflow Docs
DashboardResourcesProducts
  • Product Documentation
  • Developer Reference
  • Changelog
  • Roboflow Documentation
  • Quickstart
  • Workspaces
    • Workspaces, Projects, and Models
    • Create a Workspace
    • Rename a Workspace
    • Delete a Workspace
  • Team Members
    • Invite a Team Member
    • Role-Based Access Control (RBAC)
    • Change a Team Member Role
    • Remove a Team Member
  • Single Sign On (SSO)
  • Workflows
    • What is Workflows?
    • Create a Workflow
    • Build a Workflow
    • Test a Workflow
    • Deploy a Workflow
    • Workflow Examples
      • Multimodal Model Workflow
    • Share a Workflow
    • Workflows AI Assistant
  • Enterprise Integrations
  • Workflow Blocks
    • Run a Model
      • Object Detection Model
      • Single-Label Classification Model
    • Visualize Predictions
      • Bounding Box Visualization
      • Label Visualization
      • Circle Visualization
      • Background Color Visualization
      • Classification Label Visualization
      • Crop Visualization
  • Dataset Management
    • Create a Project
    • Upload Images, Videos, and Annotations
      • Import Data from Cloud Providers
        • AWS S3 Bucket
        • Azure Blob Storage
        • Google Cloud Storage
      • Import from Roboflow Universe
    • Manage Datasets
      • Dataset Batches
      • Search a Dataset
      • Set Dataset Classes
      • Add Tags to Images
      • Create an Annotation Attribute
      • Download an Image
      • Delete an Image
    • Dataset Versions
      • Create a Dataset Version
      • Preprocess Images
      • Augment Images
      • Delete a Version
    • Dataset Analytics
    • Merge Projects
    • Rename a Project
    • Delete a Project
    • Project Folders
    • Make a Project Public
    • Download a Dataset
  • Annotate
    • Introduction to Roboflow Annotate
    • Annotate an Image
      • Keyboard Shortcuts
      • Comment on an Image
      • Annotation History
      • Similarity Search
    • AI Labeling
      • Label Assist
      • Enhanced Smart Polygon with SAM
        • Smart Polygon (Legacy)
      • Box Prompting
      • Auto Label
    • Set Keypoint Skeletons
    • Annotate Keypoints
    • Annotate Multimodal Data
    • Collaborate on Labeling
    • Annotation Insights
  • Managed Labeling
  • Train
    • Train a Model
      • Train from a Universe Checkpoint
      • Train from Azure Vision
      • Train from Google Cloud
    • Roboflow Instant
    • Cancel a Training Job
    • Stop Training Early
    • View Training Results
    • View Trained Models
    • Evaluate Trained Models
  • Deploy
    • Deploy a Model or Workflow
    • Supported Models
    • Managed Deployments
    • Serverless Hosted API V2
      • Use in a Workflow
      • Use with the REST API
      • Run an Instant Model
    • Serverless Hosted API
      • Object Detection
      • Classification
      • Instance Segmentation
        • Semantic Segmentation
      • Keypoint Detection
      • Foundation Models
        • CLIP
        • OCR
        • YOLO-World
      • Video Inference
        • Use a Fine-Tuned Model
        • Use CLIP
        • Use Gaze Detection
        • API Reference
        • Video Inference JSON Output Format
      • Pre-Trained Model APIs
        • Blur People API
        • OCR API
        • Logistics API
        • Image Tagging API
        • People Detection API
        • Fish Detection API
        • Bird Detection API
        • PPE Detection API
        • Barcode Detection API
        • License Plate Detection API
        • Ceramic Defect Detection API
        • Metal Defect Detection API
    • Dedicated Deployments
      • Create a Dedicated Deployment
      • Make Requests to a Dedicated Deployment
      • Manage Dedicated Deployments with an API
    • Batch Processing
    • SDKs
      • Python inference-sdk
      • Web Browser
        • inferencejs Reference
        • inferencejs Requirements
      • Lens Studio
        • Changelog - Lens Studio
      • Luxonis OAK
    • Upload Custom Model Weights
    • Download Model Weights
    • Enterprise Deployment
      • License Server
      • Offline Mode
      • Kubernetes
      • Docker Compose
    • Model Monitoring
      • Alerting
  • Universe
    • What is Roboflow Universe?
    • Find a Dataset on Universe
    • Explore Images in a Universe Dataset
    • Fork a Universe Dataset
    • Find a Model on Universe
    • Download a Universe Dataset
  • Set a Project Description
  • View Project Analytics
  • Support
    • Share a Workspace with Support
    • Delete Your Roboflow Account
    • Apply for Academic Credits
  • Billing
    • Premium Trial
    • Credits
      • View Credit Usage
      • Enable or Disable Flex Billing
      • Purchase Prepaid Credits
    • Plans
      • Purchase a Plan
      • Cancel a Plan
      • Update Billing Details
      • Update Payment Method
      • View Invoices
Powered by GitBook
On this page
  • What are Dedicated Deployments?
  • Benefits of Dedicated Deployments
  • Current Limitations
  • Types of Dedicated Deployments
  • Bill Information
  • Useful Links

Was this helpful?

  1. Deploy

Dedicated Deployments

Run Your Vision Models on Dedicated Servers with Roboflow

What are Dedicated Deployments?

Dedicated Deployments are private cloud servers managed by Roboflow, specifically designed to run your computer vision models. These models can include:

  • Object detection

  • Image segmentation

  • Classification

  • Keypoint detection

  • Foundation models like CLIP (if trained on Roboflow)

  • Roboflow Workflows (low-code vision applications)

  • ...and many others!

Benefits of Dedicated Deployments

  • Focus on your machine vision business problem, leave the infrastructure to us: Spin up inference serving infrastructure with a few clicks and without having to signup with cloud providers, installing and securing servers, managing TLS certificates or worrying about server management, patching, updates etc.

  • Dedicated Resources: Get cloud servers allocated specifically for your use, ensuring consistent performance for your models.

  • Secure Access: Dedicated Deployments are accessible with your workspace's unique API key and utilize HTTPS for secure communication.

  • Easy Integration: Each deployment receives a subdomain within roboflow.cloud, simplifying integration with your applications.

  • Pay-Per-Hour: You're only charged for the duration of the server's existence (billed in 1 minute intervals).

  • Auto Pause & Resume: Your Dedicated Deployments will automatically pause after 1 hour of inactivity. They can be quickly resumed by sending a request with your API key. This feature is designed to help you save on costs.

Current Limitations

  • All dedicated deployments are currently hosted in US-based data centers; users from other Geographies may see higher latencies. Please contact us for a customized solution if you are outside of US, we can help you to reduce the network latency.

Types of Dedicated Deployments

Roboflow offers 4 different types of Dedicated Deployments, i.e., dev-cpu, dev-gpu, prod-cpu, and prod-gpu. While dev-cpu and dev-gpu are designed for development and testing purposes, will be deleted automatically after a few hours, prod-cpu and prod-gpu are persistent, ideally for serving large-scale production traffic.

Type
Features

dev-cpu

Ephemeral: will be automatically deleted after 3 hours

CPU: model inference can be done on the CPU

Ideal for testing integrations and prototyping applications

dev-gpu

Ephemeral: will be automatically deleted after 3 hours

Ideal for testing integrations and prototyping applications

GPU: models need GPU acceleration (like Florence 2)

Ideal for testing integrations and prototyping applications

prod-cpu

Persistent: dedicated subdomain <some-name>.roboflow.cloud

CPU: model inference can be done on the CPU

Ideal for serving production traffic

prod-gpu

Persistent: dedicated subdomain <some-name>.roboflow.cloud

GPU: models need GPU acceleration (like Florence 2)

Ideal for serving production traffic

Bill Information

The rate for GPU deployments (dev-gpu, prod-gpu) is 1 credit/hour, while the rate for CPU deployments (dev-cpu, prod-cpu) is 0.25 credit/hour.

Useful Links

  • How to create a dedicated deployment (Roboflow CLI)

PreviousPre-Trained Model APIsNextCreate a Dedicated Deployment

Last updated 1 month ago

Was this helpful?

Dedicated deployments are available to Basic, Growth, Growth-UBP, and Enterprise plan workspaces. See .

If you prefer to be billed based on number of requests sent to your dedicated deployment server, please .

All dedicated deployment servers will run , our open-source inference server. Review the to learn more about all of the features available.

Roboflow plans
click here to contact our sales
Roboflow Inference
Roboflow Inference documentation
How to use a dedicated deployment
HTTP APIs
How to create a dedicated deployment (Roboflow App)