> For the complete documentation index, see [llms.txt](https://docs.roboflow.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.roboflow.com/developer/rest-api/model-evaluations.md).

# Model Evaluations

A model evaluation captures how a model performs on a Version's test split - per-class metrics, confidence-threshold curves, image-embedding clustering, per-image predictions, and improvement recommendations. For object detection and instance segmentation the headline metric is mAP; for semantic segmentation it is mIoU. Evaluations are produced automatically when a training completes and can be re-triggered manually from the app.

The Model Evaluations API lets you read everything the app's evaluation page shows. Each panel in the UI maps to a dedicated endpoint:

* [List model evaluations in a workspace](/developer/rest-api/model-evaluations/list-model-evaluations.md)
* [Get one evaluation's metadata and headline metrics](/developer/rest-api/model-evaluations/get-a-model-evaluation.md)
* [Get full per-split metric detail (mAP or mIoU)](/developer/rest-api/model-evaluations/map-results.md)
* [Get the confidence-threshold sweep and F1-optimal thresholds](/developer/rest-api/model-evaluations/confidence-sweep.md)
* [Get per-class performance for one split](/developer/rest-api/model-evaluations/performance-by-class.md)
* [Get the confusion matrix](/developer/rest-api/model-evaluations/confusion-matrix.md)
* [Get the image-embedding clustering (vector analysis)](/developer/rest-api/model-evaluations/vector-analysis.md)
* [Get per-image predictions](/developer/rest-api/model-evaluations/per-image-predictions.md)
* [Get model improvement recommendations](/developer/rest-api/model-evaluations/recommendations.md)

## Authentication

All endpoints require an API key with the `model-eval:read` scope. Pass it as a query parameter or as a `Bearer` token in the `Authorization` header.

## Common errors

| Status | Error code             | When                                                                      |
| ------ | ---------------------- | ------------------------------------------------------------------------- |
| `401`  | unauthenticated        | API key missing or invalid                                                |
| `404`  | `model_eval_not_found` | Evaluation does not exist or belongs to a different workspace             |
| `409`  | `model_eval_not_done`  | Evaluation has not completed; the panel data is not yet available         |
| `400`  | `invalid_confidence`   | `confidence` query parameter is not an integer in `[0, 100]`              |
| `400`  | `invalid_split`        | `split` query parameter is not one of the allowed values for the endpoint |