Dataset Health Check
Assessing and improving the quality of your dataset.

The best way to understand Roboflow's Dataset Health Check is by following the Dataset Health Check Guided Tutorial, which walks through a health check on a public dataset of hard hat construction workers.

Understanding your dataset health helps you make informed decisions about preprocessing and augmentation for your dataset.

Images counts the number of images in your dataset, including those images contain missing or null annotations.
  • Missing annotations are images that do not have an accompanying annotation file.
  • Null annotations are images that deliberately do not contain any objects.
Annotations describes the total number of objects annotated (i.e. the number of bounding boxes).
Average Image Size is the size of images in megapixels.

Class Balance shows how many of each object there are and easily visualizes class balance/imbalance. Imbalanced data can yield unfavorable results, especially when measuring models with accuracy.

Roboflow Pro users have access to additional health check features. Get a walkthrough of these additional features here.
Copy link
On this page
Follow the Guided Tutorial
Breaking Down the Health Check (Object Detection)
The Basics
Class Balance
Advanced Health Check