# Pre-Labeled Project

**Pre-labeled projects** allow you to create a new labeling project using a file that already contains labeled data. This helps you start faster since you don’t have to label everything from scratch.

## Use cases

It is especially useful in the following scenarios:

* **Streamlined onboarding:** When starting a new project that shares the same labeling schema as previous one, you can use a pre-labeled file to quickly set up the new project.
* **Consistency in labeling:** When you need to apply a standard set of labels across multiple projects, pre-labeled projects help ensure uniformity.
* **Efficiency:** Save time by using pre-defined labels for projects with known labeling requirements.
* **Data preparation:** Import data with preliminary labels from external sources directly into Datasaur.

## How to create pre-labeled projects

### Span labeling

1. Prepare the pre-labeled file. Supported formats include:
   * **Span:** [IOB TSV](https://docs.datasaur.ai/data-studio-projects/pages/hnBPIPVkJZS5fLhyUjKO#iob-specialized-.tsv), [TSV non-IOB](/compatibility-and-updates/supported-formats.md#tsv_non_iob), [JSON Simplified](/compatibility-and-updates/supported-formats.md#json), [JSON Advanced](/compatibility-and-updates/supported-formats.md#json_advanced), [CoNLL-U](/compatibility-and-updates/supported-formats.md#conll-u), [CoNLL 2003](/compatibility-and-updates/supported-formats.md#conll_2003), [Datasaur Schema (.json)](/compatibility-and-updates/supported-formats.md#datasaur-schema-format)
   * **Span with arrows:** [TSV non-IOB](/compatibility-and-updates/supported-formats.md#tsv_non_iob), [JSON Advanced](/compatibility-and-updates/supported-formats.md#json_advanced), [CoNLL-U](/compatibility-and-updates/supported-formats.md#conll-u), [Datasaur Schema (.json)](/compatibility-and-updates/supported-formats.md#datasaur-schema-format)
   * **Character-based span labeling:** [JSON Simplified](/compatibility-and-updates/supported-formats.md#json), [JSON Advanced](/compatibility-and-updates/supported-formats.md#json_advanced),[Datasaur Schema (.json)](/compatibility-and-updates/supported-formats.md#datasaur-schema-format)
2. Go to **Projects** page and click **Create project.**
3. In step 1, upload the pre-labeled file.

   <figure><img src="/files/nIBkbHb8dLOk3fvYlfGr" alt=""><figcaption></figcaption></figure>
4. Complete the remaining project setup steps, and your span labeling project will be ready with pre-labeled data.

   <figure><img src="/files/1raJdo1Z6JLbpa1GoHtb" alt=""><figcaption></figcaption></figure>

### Row labeling

1. Prepare the pre-labeled file. Supported formats include: [CSV](/compatibility-and-updates/supported-formats.md#csv), [JSON Tabular](/compatibility-and-updates/supported-formats.md#json_tabular), [TSV](/compatibility-and-updates/supported-formats.md#tsv), [XLS and XLSX](/compatibility-and-updates/supported-formats.md#xls-and-xlsx), [JSON Lines](/compatibility-and-updates/supported-formats.md#jsonl-json-lines), [Datasaur Schema (.json)](/compatibility-and-updates/supported-formats.md#datasaur-schema-format)
   * Include a column in the file for the answers to the questions you will configure in step 3 of project creation.
2. Go to **Projects** page and click **Create project.**
3. In step 1, upload the pre-labeled file.

   <figure><img src="/files/YlxTzIdbTU23Qhv5RVbW" alt=""><figcaption></figcaption></figure>
4. In step 3, create or upload the question set. Then link each answer column to its question by using [**Refer answer to table column**](/data-studio-projects/lets-get-labeling/label-sets/refer-answer-to-table-column.md) in **More settings** accordion.

   <figure><img src="/files/fUCBSGKJh5moEi4Q0G69" alt=""><figcaption></figcaption></figure>
5. Complete the remaining project setup steps, and your row labeling project will be ready with pre-labeled data.

   <figure><img src="/files/G3hyLw2DJAi2h3csBLLi" alt=""><figcaption></figcaption></figure>

### Document labeling

1. Prepare your documents and pre-labeled answer files. Pre-labeling works as long as the media files are uploaded together with their answer files.
   * **Notes:**
     * **Answer file**: A JSON file that contains the answers to your question set. The filename must end with <mark style="color:red;">`.answer.json`</mark>. Example:

       ```json
       {
         "caption": "A realistic photograph of a white sedan parked on an asphalt road, facing the camera at a front and slightly right angle. The car is centered slightly to the left, with a visible license plate reading 'HZ20 SBV'. The natural daylight provides bright and clear lighting across the scene. In the midground, the asphalt road extends horizontally, flanked by green grassy areas with scattered bushes. The background features a clear, blue sky and a line of four wind turbines with white blades and pale orange towers positioned along a grassy landscape with a body of water visible behind them. The entire composition centers on the car with the wind turbines providing a modern, eco-friendly backdrop."
       }
       ```
     * **File naming**: Each document and its answer file must have the same filename. For example, if the document is named <mark style="color:red;">`a.jpg`</mark>, its answer file should be named <mark style="color:red;">`a.answer.json`</mark>.
     * **Multiple documents**: If you have multiple documents and multiple answer files, prepare them like this:
       * <mark style="color:red;">`a.jpg`</mark>, <mark style="color:red;">`a.answer.json`</mark>
       * <mark style="color:red;">`b.jpg`</mark>, <mark style="color:red;">`b.answer.json`</mark>
       * <mark style="color:red;">`c.jpg`</mark>, <mark style="color:red;">`c.answer.json`</mark>
2. Go to **Projects** page and click **Create project.**
3. In step 1, upload the documents along with their answer files.

   <figure><img src="/files/sNarp09Hz1u2p47V71rt" alt=""><figcaption></figcaption></figure>
4. In step 3, create or upload the question set. Ensure that the answers in the pre-labeled answer files are configured in this step.

   <figure><img src="/files/bmUzSCaLdyz2bSro310d" alt=""><figcaption></figcaption></figure>
5. Complete the remaining project setup steps, and your document labeling project will be ready with pre-labeled data.

   <figure><img src="/files/cAmNyDQ2Ipyr5VP8EkQc" alt=""><figcaption></figcaption></figure>

### Bounding box labeling

1. Prepare your documents and pre-labeled answer files. Pre-labeling works as long as the media files are uploaded together with their answer files.
   * **Answer file format**: Supported file formats include: [YOLO (.txt)](/compatibility-and-updates/supported-formats.md#yolo-txt), [LabelMe (.xml)](/compatibility-and-updates/supported-formats.md#labelme-xml), [Pascal VOC (.xml)](/compatibility-and-updates/supported-formats.md#pascal-voc-xml), [Datasaur Schema (.json)](/compatibility-and-updates/supported-formats.md#datasaur-schema-format)
   * **File naming:** Each document and its answer file must have the same filename. For example, if the document file is named <mark style="color:red;">`a.jpg`</mark>, its answer file in YOLO format should be named <mark style="color:red;">`a.txt`</mark>.
   * **Multiple documents**: If you have multiple documents and multiple answer files, prepare them like this:
     * <mark style="color:red;">`a.jpg`</mark>, <mark style="color:red;">`a.txt`</mark>
     * <mark style="color:red;">`b.jpg`</mark>, <mark style="color:red;">`b.txt`</mark>
     * <mark style="color:red;">`c.jpg`</mark>, <mark style="color:red;">`c.txt`</mark>
2. Go to **Projects** page and click **Create project.**
3. In step 1, upload the documents along with their answer files.
   * **Notes:**

     * Labels in your pre-labeled file automatically match the labels in your label set.
     * Extra labels in the file that are not found in the label set will be added as new label classes.

     <figure><img src="/files/NA5WolEKvLeKwfIMcvwK" alt=""><figcaption></figcaption></figure>
4. In step 3, create or upload the label set.
5. Complete the remaining project setup steps, and your bounding box labeling labeling project will be ready with pre-labeled data.

   <figure><img src="/files/P8UeVJGyvRa73z3fq3iK" alt=""><figcaption></figcaption></figure>

## Pre-labeled propagation

By default, pre-labeled data is treated as the **labeler’s submission**. Once applied in labeler mode, it instantly appears in reviewer mode. Depending on the project’s consensus settings, it may appear as as either *accepted* or conflicted labels.

This behavior helps speed up labeling when the pre-labeled data is reliable. If the data is unreliable, turn on the **Require approval for pre-labeled labels** setting to ensure labelers review each label before it is applied.

### Require approval for pre-labeled labels/answers <a href="#require-approval-for-pre-labeled-label-setting" id="require-approval-for-pre-labeled-label-setting"></a>

This project setting adds a quality control step for pre-labeled data. It is useful when the pre-labeled source is less reliable or when verification is required before the data can be accepted.

Instead of automatically accepting all pre-labeled labels or answers, this option requires labelers to **manually** **accept** or **reject** each one before it moves to the review stage. This ensures that only verified data is used.

This setting is useful in the following scenarios:

* **Untrusted external data**\
  When importing labels from external sources, this setting prevents low-quality or inconsistent labels from automatically appearing in reviewer mode before verification.
* **Initial quality verification**\
  Allows labelers to validate pre-labeled data before continuing with more complex labeling tasks, ensuring a reliable baseline.
* **Assisted labeling**\
  Allows labelers to treat pre-labeled labels or answers as suggestions, approving or rejecting them based on project requirements.

#### Differences from the default behavior

<table data-full-width="true"><thead><tr><th width="121.48046875">Feature</th><th width="295.0859375">Default</th><th valign="middle">Require approval</th></tr></thead><tbody><tr><td><strong>Propagation</strong></td><td><strong>Automatic.</strong> Pre-labeled data is treated as the labeler’s submission and appears in reviewer mode.</td><td valign="middle"><strong>Manual approval required.</strong> Pre-labeled data remains in the labeler’s copy and does not appear in reviewer mode until the labeler accepts it.</td></tr><tr><td><strong>Labeler action</strong></td><td><strong>No action required</strong>. Labelers can modify or delete pre-labeled data as needed.</td><td valign="middle"><strong>Manual</strong> <strong>review</strong> <strong>required</strong>. Labelers must accept or reject each pre-labeled label and answer.</td></tr><tr><td><strong>State in reviewer mode</strong></td><td><strong>Immediately visible</strong>, with color depending on consensus rules.</td><td valign="middle"><strong>Not visible</strong> in reviewer mode. They appear only after the labeler accepts them.</td></tr></tbody></table>

You can configure this setting in the step 5 of project creation, by enabling the **Require approval for pre-labeled labels** toggle. This setting can only be defined during project creation and cannot be changed afterward.

![](/files/ug2NEIlMJw4hiveprq2R)

{% hint style="info" %}
This setting is currently available for the following project types:

* Span labeling.
* Row labeling.

Support for additional labeling types will be added in future releases.
{% endhint %}

<figure><img src="/files/1fNSPWxJteMg4tO44vxS" alt=""><figcaption><p>Hovering on a pre-labeled span label requiring approval</p></figcaption></figure>

<figure><img src="/files/nWxoPARXpALPBD7c1hOS" alt=""><figcaption><p>Hovering on a pre-labeled row answer requiring approval</p></figcaption></figure>

#### Labeler workflow and outcomes

When this setting is enabled, pre-labeled labels and answers appear in each labeler’s view with dashed outlines. Labelers must review them before proceeding.

**Accept**

When accepted, the pre-labeled data becomes the labeler’s submission. It is marked as accepted and appears in reviewer mode.

Labelers can accept pre-labeled labels or answers in the following ways:

* **Span labeling**
  * Right-click a pre-labeled label and select **Accept**.
  * Apply the same label to the same span.
  * Use the **Labels** extension to:
    * Accept a single pre-labeled label.
    * Accept multiple pre-labeled labels using bulk actions.
* **Row labeling**
  * Select the corresponding answer in the **Row labeling** extension and submit.

**Reject**

When rejected, the pre-labeled data is removed from the labeler’s view and does not appear in reviewer mode.

Labelers can reject pre-labeled labels or answers in the following ways:

* **Span labeling**
  * Right-click a pre-labeled label and select **Reject**.
  * Use the **Labels** extension to:
    * Reject a single pre-labeled label.
    * Reject multiple pre-labeled labels using bulk actions.
* **Row labeling**
  * Leave the answer unselected and submit.

This workflow ensures that all pre-labeled data is reviewed before reaching the review stage, improving accuracy and data quality.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.datasaur.ai/data-studio-projects/pre-labeled-project.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
