# Label Error Detection

## Introduction

Datasaur's **Label error detection** feature revolutionizes the way you approach data labeling by automating the identification of inaccuracies in your dataset. It flags potential label errors and suggests alternatives, significantly improving data integrity and model performance.

<figure><img src="https://448889121-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MbjY0HseEqu7LtYAt4d%2Fuploads%2Fgit-blob-e87837f45127a7b47f8c6c21ba78898acfc2422a%2FExtension%20-%20Label%20error%20detection%20-%20cover.png?alt=media" alt=""><figcaption></figcaption></figure>

## **Key Features**

Manual data review is notoriously time-consuming and prone to human error. Leveraging automated error detection:

1. **Boosts efficiency:** Seamlessly pinpoint label errors without sifting through your entire dataset.
2. **Enhances focus:** Apply a specific error threshold to zero in on the most questionable labels, optimizing your review process.
3. **Improves accuracy:** Receive intelligent label suggestions, enabling quick, precise corrections.

## Terminologies

1. **Label errors**: Discrepancies or inaccuracies in the assigned labels of your dataset, which can mislead your model's learning process.
2. **Error possibility**: A probability score that indicates the likelihood of a label being incorrect, allowing for prioritized review.
3. **Label correction suggestions:** Offering alternative labels suggested by the model for accurate and refined dataset curation.

## **Quick Start Guide**

Unlock the full potential of label error detection in your row-based project with four simple steps:

### **Step 1: Enable the Feature**

Activate the **Label error detection** extension from the [Manage extensions](https://docs.datasaur.ai/advanced/extensions) button (gear icon) in the extension panel on the right.

<figure><img src="https://448889121-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MbjY0HseEqu7LtYAt4d%2Fuploads%2Fgit-blob-c6d9dccef5eb34ac30229a257068dee463798393%2FExtension%20-%20Manage%20extensions%20-%20Label%20error%20detection.png?alt=media" alt=""><figcaption><p>Activating label error detection.</p></figcaption></figure>

### **Step 2: Configure Settings**

In the **Label error detection** extensions (on the right side), configure:

1. The **Target column** for input text.
2. The **Target question** for labels.

<figure><img src="https://448889121-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MbjY0HseEqu7LtYAt4d%2Fuploads%2Fgit-blob-08cc6554cd8a63d25b7c171ab825b539ca4968e6%2FExtension%20-%20Label%20error%20detection%20-%20project%20-%20initial.png?alt=media" alt=""><figcaption><p>Setting up your detection parameters.</p></figcaption></figure>

### **Step 3: Run Detection**

Ensure all rows are labeled before clicking **Find label errors**. This might take a moment.

<figure><img src="https://448889121-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MbjY0HseEqu7LtYAt4d%2Fuploads%2Fgit-blob-c7badb8c53976b34e23c86f7d251da32b1340422%2FExtension%20-%20Label%20error%20detection%20-%20project%20-%20predicting.png?alt=media" alt=""><figcaption><p>Initiating error detection.</p></figcaption></figure>

Detected errors in rows will appear in the **Label errors** section. Adjust the error possibility threshold to fine-tune your review focus.

<figure><img src="https://448889121-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MbjY0HseEqu7LtYAt4d%2Fuploads%2Fgit-blob-c1ac5d76de14de46d04299c1a7e063a127facffd%2FExtension%20-%20Label%20error%20detection%20-%20project%20-%20predicted.png?alt=media" alt=""><figcaption><p>Identifying errors in your dataset.</p></figcaption></figure>

### **Step 4: Review and Adjust**

Click the problematic rows from the **Label errors** section and make necessary corrections by editing labels directly or accepting suggested changes.

<figure><img src="https://448889121-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MbjY0HseEqu7LtYAt4d%2Fuploads%2Fgit-blob-9a989f194c6331b1860142dd1937f08b6329abce%2FExtension%20-%20Label%20error%20detection%20-%20review%20suggestions.png?alt=media" alt=""><figcaption><p>Focusing on label errors for correction.</p></figcaption></figure>

## **Troubleshooting Common Issues**

* **Feature not working:** Ensure all rows are labeled, and the extension is properly enabled.
* **Incorrect suggestions:** Adjust the **Error possibility** threshold for more accurate detection.

Happy labeling!
