Dataset Analytics

Assess and improve the quality of your dataset.

Dataset Analytics shows a range of statistics about the dataset associated with a project. You can see the following pieces of information:

  • Number of images in your dataset;

  • Number of annotations;

  • Average image size;

  • Median image ratio;

  • Number of missing annotations;

  • Number of null annotations;

  • Image dimensions across your dataset;

  • Object count histogram, and;

  • A heatmap of annotation locations.

Using Dataset Analytics, you can derive a range of insights about your dataset. For example, if you have no null annotations, you may want to consider adding a few depending on the project on which you are working; if there are images with missing annotations, you can dig deeper to add the requisite annotations.

See more on the difference between null and missing annotations.

Dataset Analytics was previously called Health Check

Class Balance

The Dataset Analytics feature also shows class balance across your annotations. Class Balance shows how many of each object there are and easily visualizes class balance/imbalance. Imbalanced data can yield unfavorable results, especially when measuring models with accuracy.

Here is an example of the class balance feature in use:

Last updated