Label Error Detection

Enhance Your Dataset Quality with Datasaur's Label Error Detection

Introduction

Datasaur's Label Error Detection feature revolutionizes the way you approach data labeling by automating the identification of inaccuracies in your dataset. It flags potential label errors and suggests alternatives, significantly improving data integrity and model performance.

Key Features

Manual data review is notoriously time-consuming and prone to human error. Leveraging automated error detection:

Boosts Efficiency: Seamlessly pinpoint label errors without sifting through your entire dataset.
Enhances Focus: Apply a specific error threshold to zero in on the most questionable labels, optimizing your review process.
Improves Accuracy: Receive intelligent label suggestions, enabling quick, precise corrections.

Terminologies

Label Errors: Discrepancies or inaccuracies in the assigned labels of your dataset, which can mislead your model's learning process.
Error Possibility: A probability score that indicates the likelihood of a label being incorrect, allowing for prioritized review.
Label Correction Suggestions: Offering alternative labels suggested by the model for accurate and refined dataset curation.

Quick Start Guide

Unlock the full potential of label error detection in your row-based project with four simple steps:

Step 1: Enable the Feature

Activate the Label Error Detection extension from the Manage Extensions section in the right sidebar of your project dashboard.

Step 2: Configure Settings

In the Label Error Detection extensions (on the right side), configure:

The Target Column for input text.
The Target Question for labels.

Step 3: Run Detection

Ensure all rows are labeled before initiating the 'Find Label Errors' process. This might take a moment.

Detected errors in rows will appear in the Label Errors section. Adjust the error possibility threshold to fine-tune your review focus.

Step 4: Review and Adjust

Click the problematic rows from Label Errors section and make necessary corrections by editing labels directly or accepting suggested changes.

Troubleshooting Common Issues

Feature Not Working: Ensure all rows are labeled, and the extension is properly enabled.
Incorrect Suggestions: Adjust the error possibility threshold for more accurate detection.

Happy labeling!

Last updated 4 months ago