Labeling Agent

In some projects, ML models are just as important as human labelers. Labeling Agents allows you to assign ML models as labelers in your project and evaluate their performance alongside human labelers. This helps you better understand which labeling approach works best for your needs — whether human, machine, or both.

Why Use Labeling Agents?

Labeling Agents simplify the process of testing and comparing ML models inside Datasaur:

  • You no longer need to create separate accounts or log in as the model to run predictions.

  • Model outputs are now part of the same analytics and comparison tools used for human labelers.

  • It’s easier to measure performance and decide what labeling strategy to use.

Requirements

  • Models must be in the same team workspace as the Data Studio project.

  • ML models must be deployed applications from LLM Labs with “Deployed” status.

  • This feature is currently only supported for Span Labeling project.

    • This is currently a limitation that will be improved in the future.

How to Create a Labeling Agent

Supported Labeling Types: Span labeling, Row labeling

In LLM Labs, create a new sandbox and set up the model to act as a labeling agent. To help the model understand what to label, you’ll need to provide clear system and user instructions. You can see this pagearrow-up-right to learn more.

The output of the LLM Labs Sandbox must be in JSON object format, aligned with the label set defined in your NLP project. This ensures compatibility with regex-based string matching for labeling in your NLP platform.

1. Prepare the label set

Make sure the label set you will be using matches the one you configure in step 2. Below is a simple example of labels that can later be used in Data Studio:

2. Define your instructions

In LLM Labs, create a new sandbox and set up the model to act as a labeling agent. To help the model understand what to label, you’ll need to provide clear system and user instructions. Below is an example setup:

System Instruction

User Instruction

3. Test with a prompt example

To check if your instructions work as expected, you can test them using an example sentence. Here's how you might write a prompt:

After you click the Run button, the expected output will be:

4. Deploy the model

You need to deploy the model first before it becomes available and visible in Data Studio as a Labeling Agent.

Using Labeling Agents

Once you’ve set up the model, you can now assign it as a labeler in Data Studio.

1. Assign models as labelers

You can assign models during the project creation process:

  1. Go to Projects page > Create New Project.

  2. Upload files and select Span Labeling.

  3. In the Assignment step, open the Labeling agents tab.

  4. Once you are in the Labeling agents tab. You can select the deployed LLM Labs Sandbox.

    Selecting deployed LLM Labs Sandbox as labeling agent
    Selecting deployed LLM Labs Sandbox as labeling agent
  5. For Row labeling project, you need to set the agent task by clicking Set a default agent task button.

    Configuring labeling agents tasks

    Configuring labeling agents tasks

    1. Target question: the column(s) you wish to answer.

    2. Input columns: the column(s) your model should use as input.

    Supported Question Types

    The Row-Based Labeling Agent supports the following question types:

    Question Type
    Description

    Radio

    Single-select from a predefined list of options

    Dropdown

    Single-select via a dropdown menu

    Hierarchical Dropdown

    Nested dropdown with parent-child option relationships

    Text

    Free-form text input

    Date

    Date picker input

    Time

    Time picker input

    Checkbox

    Multi-select from a list of options

    Slider

    Numeric value selection via a slider control

    URL

    Text input validated as a URL

  6. Complete the project setup.

circle-info

You can assign both human members and models. Each model counts toward your assignment limit.

2. Launch the project and trigger labeling

When you click Launch Project, models will automatically begin applying labels.

Current limitation:

  • Only the first label set is used.

  • Each span will only have one label.

  • Labeling agents cannot yet draw arrows.

3. Review labels applied by the labeling agent

Once all documents are fully labeled — either through external model assistance or manual input, the project can undergo a final review. This stage typically involves a reviewer ensuring the consistency and accuracy of all annotations before submission or export through Reviewer Mode.

4. View and compare performance

You can track the performance of both human labelers and models from the Analytics page.

From here, you’ll be able to compare IAA scores and other metrics across all labelers — human and models.

Best Practices

  • Use the external model as a timesaving aid but always include a human review step.

  • Train your model with high-quality data to improve suggestion accuracy.

  • Communicate clearly with labelers about how to handle model predictions.

  • Automate some of the work with consensus by using multiple models, e.g., use the consensus of 3 and deploy 3 Labeling Agents, then focus only on those that are not accepted through consensus.

FAQs

  • Can I assign multiple models to the same project?

    • Yes. You can assign up to 10 Labeling Agents.

  • Can I use Labeling Agents in Line Labeling?

    • Not yet. They can be assigned to Span + Line project but will only apply labels for Span Labeling.

  • How are Labeling Agent labels shown in the UI?

    • They are treated like human labelers but are masked. You’ll see their labels in the Reviewer mode and analytics.

Last updated