> For the complete documentation index, see [llms.txt](https://docs.roboflow.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.roboflow.com/roboflow/roboflow-hi/train/evaluate-trained-models.md).

# Trained Models का मूल्यांकन करें

Model evaluations दिखाती हैं:

1. एक production metrics explorer, जो आपको वह optimal confidence threshold खोजने में मदद करता है जिस पर आप अपने model को run कर सकते हैं;
2. Model improvement recommendations, जो इस बारे में सुझाव देती हैं कि आप अपने model की accuracy कैसे बढ़ा सकते हैं;
3. Performance by class, जो दिखाता है कि आपका model अलग-अलग classes को कितनी अच्छी तरह पहचानता है;
4. एक confusion matrix, जिसका उपयोग आप उन specific classes को खोजने के लिए कर सकते हैं जिनमें आपका model अच्छा करता है और जिनमें वह संघर्ष करता है, और;
5. एक interactive vector explorer, जो आपको images के clusters पहचानने देता है जहाँ आपका model अच्छा या खराब प्रदर्शन करता है;

आप model evaluation का उपयोग अपने model में सुधार के क्षेत्रों की पहचान करने के लिए कर सकते हैं।

Paid users द्वारा Roboflow पर train किए गए या upload किए गए सभी versioned models के लिए model evaluations अपने आप run होती हैं। कुछ सौ images वाले dataset के लिए evaluation run होने में कई मिनट लग सकते हैं, और हजारों या उससे अधिक images वाले बड़े datasets के लिए कई घंटे लग सकते हैं।

### Supported Project Types

Model evaluation Object Detection, Instance Segmentation, Classification, और Semantic Segmentation projects को support करती है।

Semantic Segmentation के लिए, प्रमुख metric है **mIoU** (mean Intersection-over-Union) mAP के बजाय। सभी metrics (precision, recall, F1) प्रति-instance के बजाय pixel level पर compute की जाती हैं। per-class breakdown में IoU, precision, recall, F1, और प्रत्येक class के लिए एक optimal confidence threshold दिखता है। Confusion matrix के values object counts के बजाय pixel counts को दर्शाते हैं।

### Open Model Evaluation

अपने model के लिए confusion matrix और vector explorer खोजने के लिए, अपने project में किसी भी trained model version को खोलें। फिर, "View Evaluation" बटन पर क्लिक करें:

<figure><img src="/files/d19d702b435278edd8ab44e2b34bb4e4b50eaf62" alt=""><figcaption></figcaption></figure>

एक window खुलेगी जहाँ आप अपनी confusion matrix और vector analysis देख सकते हैं।

### Production Metrics Explorer

Production metrics explorer आपके model के लिए सभी संभव confidence thresholds पर Precision, Recall, और F1 score दिखाता है। यह जानकारी एक graph पर प्रस्तुत की जाती है।

इन statistics का उपयोग करके, production metrics explorer एक "optimal confidence" की अनुशंसा करेगा। यह वह threshold है जो आपको Precision/Recall/F1 Score का सबसे अच्छा trade-off देगा।

Model evaluation पूरी होने के बाद, optimal confidence threshold अपने आप आपके model के inference requests के default के रूप में लागू हो जाता है। यदि per-class thresholds उपलब्ध हैं, तो वे भी लागू होते हैं, और किसी भी class जिसके लिए अपना value नहीं है, उसके लिए global threshold fallback के रूप में उपयोग होता है।

आप अभी भी किसी भी individual inference request पर confidence threshold को override कर सकते हैं, इसके लिए `confidence` parameter को स्पष्ट रूप से pass करें।

<figure><img src="/files/3950d04a2251eacbe5a3209211f8266e45569b20" alt=""><figcaption></figcaption></figure>

आप slider को drag करके अलग-अलग confidence thresholds पर F1/Precision/Recall values देख सकते हैं:

<figure><img src="/files/b07ede186f569ca1d97d07d8b17521407c79ed30" alt=""><figcaption></figcaption></figure>

### Model Improvement Recommendations

आपके model evaluation का model improvement recommendations section सुझावों की सूची देता है कि आप अपने model की accuracy कैसे बढ़ा सकते हैं। ये improvements आपके model के साथ calculate की गई confusion matrix के results पर आधारित हैं। (इस page पर आगे अपनी confusion matrix के बारे में अधिक जानकारी देखें).

Model improvement recommendations feature इनसे संबंधित सुझाव दे सकता है:

* ऐसे model को कैसे बेहतर करें जो बहुत सारे false negatives predict करता है।
* ऐसे model को कैसे बेहतर करें जो बहुत सारे false positives predict करता है।
* कौन-सी classes अक्सर confuse (गलत पहचानी) जाती हैं।
* कौन-सी classes की accuracy सुधारने के लिए अधिक data की जरूरत है।
* जब test या validation set बहुत छोटा हो सकता है।
* और भी।

<figure><img src="/files/8f681dec82adc1273ddb3cd5fc9793ef6d94b243" alt=""><figcaption></figcaption></figure>

### Performance by Class

performance by class chart दिखाता है कि आपके dataset की सभी classes में कितनी correct predictions, misclassifications, false negatives, और false positives हैं।

आप इस जानकारी का उपयोग एक नज़र में यह देखने के लिए कर सकते हैं कि आपका model किन classes को अच्छी तरह पहचान सकता है और किन classes को पहचानने में हमारे model को संघर्ष होता है।

<figure><img src="/files/5ce7c00a360627eda210d2a5b6db14eda14e1d6d" alt=""><figcaption></figcaption></figure>

यदि आपके dataset में classes की संख्या बहुत अधिक है, तो आप "All Classes" dropdown खोलकर और जिन classes को highlight करना चाहते हैं उन्हें चुनकर chart को विशिष्ट classes पर केंद्रित कर सकते हैं:

<figure><img src="/files/7182f2ef853e5eb27befd49c378131378b2f8c92" alt=""><figcaption></figcaption></figure>

आप Confidence Threshold slider को हिलाकर यह भी देख सकते हैं कि अलग-अलग confidence thresholds पर यह chart कैसे बदलता है:

<figure><img src="/files/c34d186e521c3b796185643699a21c7fd0d5ef60" alt=""><figcaption></figcaption></figure>

डिफ़ॉल्ट रूप से, यह chart उस optimal confidence threshold का उपयोग करेगा जिसकी हम अनुशंसा करते हैं।

### Confusion Matrix

आपकी confusion matrix दिखाती है कि आपका model अलग-अलग classes पर कितना अच्छा प्रदर्शन करता है।

आपकी confusion matrix आपके test और validation sets की images को आपके trained model के साथ run करके calculate की जाती है। इसके बाद आपके model के results की तुलना आपके dataset annotations से प्राप्त "ground truth" के साथ की जाती है।

confusion matrix tool की मदद से, आप पहचान सकते हैं:

* वे classes जिन पर आपका model अच्छा प्रदर्शन करता है।
* वे classes जहाँ आपका model किसी object के लिए गलत class पहचानता है (false positives)।
* वे instances जहाँ आपका model ऐसे object की पहचान करता है जो मौजूद नहीं है (false negatives)।

यहाँ एक उदाहरण confusion matrix है:

<figure><img src="/files/f890564655cb8d718c9f75de09910fa157e39e91" alt=""><figcaption></figcaption></figure>

यदि आपका model कई classes detect करता है, तो scroll bars दिखाई देंगी जो आपको अपनी confusion matrix में नेविगेट करने देती हैं।

डिफ़ॉल्ट रूप से, confusion matrix दिखाती है कि आपका model तब कैसा प्रदर्शन करता है जब उसे आपके model के लिए calculate किए गए optimal threshold पर run किया जाता है।

आप Confidence Threshold slider का उपयोग करके confidence threshold को adjust कर सकते हैं। जैसे-जैसे आप slider configure करेंगे, आपकी confusion matrix, precision, और recall update हो जाएंगे:

<figure><img src="/files/f111576e4673e92717ebeaae52fd1294097c322c" alt=""><figcaption></figcaption></figure>

आप confusion matrix के हर box पर क्लिक करके देख सकते हैं कि संबंधित category में कौन-सी images दिखाई देती हैं।

उदाहरण के लिए, आप "False Positive" column में किसी भी box पर क्लिक करके उन images की पहचान कर सकते हैं जहाँ ground truth data में कोई object मौजूद नहीं था, फिर भी उसकी पहचान की गई।

<figure><img src="/files/e46d724107a4d36cc0f1eeb791bafe70b1041f32" alt=""><figcaption></figcaption></figure>

आप किसी individual image पर क्लिक करके एक interactive view में जा सकते हैं जहाँ आप ground truth (आपकी annotations) और model predictions के बीच toggle कर सकते हैं:

<figure><img src="/files/bff592e7c8ff325b82cae2c8f45f96a8c9cb70c2" alt=""><figcaption></figcaption></figure>

अपनी annotations देखने के लिए "Ground Truth" पर क्लिक करें और आपका model क्या लौटा, यह देखने के लिए "Model Predictions" पर क्लिक करें।


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.roboflow.com/roboflow/roboflow-hi/train/evaluate-trained-models.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.