Roboflow Docs
ResourcesProductsSign InBook a DemoGet Started
  • Product Documentation
  • Developer Reference
  • Changelog
  • Roboflow Documentation
  • Quickstart
  • Workspaces
    • Workspaces, Projects, and Models
    • Create a Workspace
    • Rename a Workspace
    • Delete a Workspace
  • Team Members
    • Invite a Team Member
    • Role-Based Access Control (RBAC)
    • Change a Team Member Role
    • Remove a Team Member
  • Single Sign On (SSO)
  • Workflows
    • What is Workflows?
    • Create a Workflow
    • Build a Workflow
    • Test a Workflow
    • Deploy a Workflow
    • Workflow Examples
      • Multimodal Model Workflow
    • Share a Workflow
    • Workflows AI Assistant
  • Enterprise Integrations
  • Workflow Blocks
    • Run a Model
      • Object Detection Model
      • Single-Label Classification Model
    • Visualize Predictions
      • Bounding Box Visualization
      • Label Visualization
      • Circle Visualization
      • Background Color Visualization
      • Classification Label Visualization
      • Crop Visualization
  • Dataset Management
    • Create a Project
    • Upload Images, Videos, and Annotations
      • Import Data from Cloud Providers
        • AWS S3 Bucket
        • Azure Blob Storage
        • Google Cloud Storage
      • Import from Roboflow Universe
    • Manage Datasets
      • Dataset Batches
      • Search a Dataset
      • Set Dataset Classes
      • Add Tags to Images
      • Create an Annotation Attribute
      • Download an Image
      • Delete an Image
    • Dataset Versions
      • Create a Dataset Version
      • Preprocess Images
      • Image Augmentation
        • Augmentation Types
          • Flip Augmentation
          • 90º Rotate Augmentation
          • Crop Augmentation
          • Rotation Augmentation
          • Shear Augmentation
          • Grayscale Augmentation
          • Hue Augmentation
          • Saturation Augmentation
          • Brightness Augmentation
          • Exposure Augmentation
          • Blur Augmentation
          • Noise Augmentation
          • Cutout Augmentation
          • Mosaic Augmentation
        • Add Augmentations to Images
      • Delete a Version
    • Dataset Analytics
    • Merge Projects
    • Rename a Project
    • Delete a Project
    • Project Folders
    • Make a Project Public
    • Download a Dataset
  • Annotate
    • Introduction to Roboflow Annotate
    • Annotate an Image
      • Keyboard Shortcuts
      • Comment on an Image
      • Annotation History
      • Similarity Search
    • AI Labeling
      • Label Assist
      • Smart Polygon
      • Box Prompting
      • Auto Label
    • Set Keypoint Skeletons
    • Annotate Keypoints
    • Annotate Multimodal Data
    • Collaborate on Labeling
    • Annotation Insights
  • Roboflow Labeling Services
  • Train
    • Train a Model
      • Train from a Universe Checkpoint
      • Train from Azure Vision
      • Train from Google Cloud
    • Roboflow Instant
    • Cancel a Training Job
    • Stop Training Early
    • View Training Results
    • View Trained Models
    • Evaluate Trained Models
  • Deploy
    • Deploy a Model or Workflow
    • Supported Models
    • Serverless Hosted API V2
      • Use in a Workflow
      • Use with the REST API
      • Run an Instant Model
    • Serverless Hosted API
      • Object Detection
      • Classification
      • Instance Segmentation
        • Semantic Segmentation
      • Keypoint Detection
      • Foundation Models
        • CLIP
        • OCR
        • YOLO-World
      • Video Inference
        • Use a Fine-Tuned Model
        • Use CLIP
        • Use Gaze Detection
        • API Reference
        • Video Inference JSON Output Format
      • Pre-Trained Model APIs
        • Blur People API
        • OCR API
        • Logistics API
        • Image Tagging API
        • People Detection API
        • Fish Detection API
        • Bird Detection API
        • PPE Detection API
        • Barcode Detection API
        • License Plate Detection API
        • Ceramic Defect Detection API
        • Metal Defect Detection API
    • Dedicated Deployments
      • Create a Dedicated Deployment
      • Make Requests to a Dedicated Deployment
      • Manage Dedicated Deployments with an API
    • Self-Hosted Deployment
    • Batch Processing
    • SDKs
      • Python inference-sdk
      • Web Browser
        • inferencejs Reference
        • inferencejs Requirements
      • Lens Studio
        • Changelog - Lens Studio
      • Luxonis OAK
    • Upload Custom Model Weights
    • Download Model Weights
    • Enterprise Deployment
      • License Server
      • Offline Mode
      • Kubernetes
      • Docker Compose
    • Device Manager
      • Add a Device
      • Add a Stream
      • View a Stream
    • Model Monitoring
      • Alerting
  • Universe
    • What is Roboflow Universe?
    • Find a Dataset on Universe
    • Explore Images in a Universe Dataset
    • Fork a Universe Dataset
    • Find a Model on Universe
    • Download a Universe Dataset
  • Set a Project Description
  • View Project Analytics
  • Support
    • Share a Workspace with Support
    • Delete Your Roboflow Account
    • Apply for Academic Credits
  • Billing
    • Premium Trial
    • Credits
      • View Credit Usage
      • Enable or Disable Flex Billing
      • Purchase Prepaid Credits
    • Plans
      • Purchase a Plan
      • Cancel a Plan
      • Update Billing Details
      • Update Payment Method
      • View Invoices
Powered by GitBook
On this page
  • Dimension Insights
  • Annotation Heat Map
  • Histogram of Object Count by Image

Was this helpful?

  1. Dataset Management

Dataset Analytics

Assess and improve the quality of your dataset.

PreviousDelete a VersionNextMerge Projects

Last updated 3 days ago

Was this helpful?

Dataset Analytics shows a range of statistics about the dataset associated with a project. You can see the following pieces of information:

  • Number of images in your dataset;

  • Number of annotations;

  • Average image size;

  • Median image ratio;

  • Number of missing annotations;

  • Number of null annotations;

  • Image dimensions across your dataset;

  • Object count histogram, and;

  • A heatmap of annotation locations.

Using Dataset Analytics, you can derive a range of insights about your dataset. For example, if you have no null annotations, you may want to consider adding a few depending on the project on which you are working; if there are images with missing annotations, you can dig deeper to add the requisite annotations.

To see Dataset Analytics for a project, click "Analytics" in the left sidebar of a project:

The Dataset Analytics tab will then open:

On this page, you can see:

  • A breakdown of the number of classes in the images in your train, test, and valid datasets.

  • An overview of the sizes and aspect ratios of the images in your dataset.

  • A heatmap showing where most of your annotations are.

  • A histogram showing how many classes are annotated in each image in your dataset.

Dimension Insights

The Dimension Insights section describes the sizes and aspect ratios of raw images in your dataset.

If you apply the Resize augmentation when you create a project version — which we strongly recommend for almost all use cases — images in your version will be resizes, but the raw images will stay the same.

Annotation Heat Map

When you are training a model, it is important that your dataset is representative of the conditions in which your model will be deployed.

If your model will be deployed in an environment in which annotations may appear anywhere in the camera frame — for example, on a factory line where objects of different sizes are moving in real time, or in an image taken on a phone of an object — it is important that you annotate objects that appear in different places in an image.

Labeling objects in different parts of an image will ensure your model doesn't overfit to learn only how to identify objects in specific places.

The Annotation Heat Map shows where there are more or less annotations in images. This can be used to identify scenarios where your dataset annotations are too concentrated in a particular place.

You can drag over an area in the Heat Map to see images in the chosen range:

Histogram of Object Count by Image

This shows you the distribution of how many annotated objects appear in each image.

If images that you pass through your model may contain multiple instances of an object, we recommend ensuring your dataset contains images with different numbers of object instances. This will help you ensure your model can generalise well to images with no, one, or multiple objects of interest.

Here if an example of a histogram:

You can select any of the bars on the histogram to see images with a given count: