# Audio Project

### Project creation

<figure><img src="/files/z8KBcQieFy0nbGVvt0ck" alt=""><figcaption></figcaption></figure>

Creating an audio project is simple in Datasaur. All the steps are the same as creating other labeling task types.

1. From the **Projects page**, click **Create project**.
2. Upload both your audio file and its transcription.
   * Accepted audio formats: `.mp3`, `.flac`, and `.wav`
   * Accepted transcription formats: `.srt` and `.txt`
   * You can download example audio and transcription files below.

{% file src="/files/KcoYlp3WS2cohl7RHgdW" %}
Audio file sample
{% endfile %}

3. Make sure the transcription file name matches the audio file name. For example: `SampleFile.mp3` and `SampleFile.srt` . When both files have the same name, the system recognizes them as corresponding files.

<figure><img src="/files/Fn4OBkiFIaaI9uhZEW09" alt=""><figcaption></figcaption></figure>

4. Continue with the remaining project setup steps: **Preview**, **Labeler's tasks**, **Assignment**, and **Project Settings**.

![The Preview step will only display the transcription. The Labeler's tasks step will only allow span labeling.](/files/p6sq7oqb4LWa6zHHFkwm)

### Audio interface legend

![](/files/54gSdGhU3Q8uiX4wbT1b)

### Label an audio project

![](/files/E30ex9h0C9o8GgQ6ea5r)

At the top of the interface, you will find an audio player with timestamps. Below it, you will see the transcript, where you can label spans using the label sets you have added.

Between the audio player and the transcript, there is a control panel where you can perform the actions listed below, or [watch this brief video for a visual guide](https://www.loom.com/share/a0078486e0fc4fda8b26f0354e2c2100).

* Rewind 10 seconds
* Play/Pause
* Fast forward 10 seconds
* Adjust volume
* Use the timestamp field to jump to a specific time
* Enter **Create timestamp** mode
* Zoom in or out of the audio timestamps
* Open audio settings (audio speed and auto-scroll)

![The Control Panel](/files/VkbJivu9115DYXktHolj)

The **Create timestamp** button allows you to create a new timestamp and link it to the corresponding text. Select a portion of the audio timeline, then highlight the matching span of tokens in the transcript. The timestamp is now linked to that span of tokens.

![](/files/F87DJwKEpYcOdkWuZY5R)

<figure><img src="/files/TUG5uizkoE7p3aI1D51Z" alt=""><figcaption></figcaption></figure>

### Edit sentences in audio project

When editing transcriptions, the system adjusts timestamp behavior based on how much the text changes.

* If the similarity between the original transcription and the edited version is **above 70%**, the timestamp labels are kept, and you only need to adjust the corresponding text.
* If the similarity falls **below 70%**, the timestamp labels may be removed. This happens when the edited text differs significantly from the original. The threshold ensures that timestamp labels remain for minor edits but are removed for more major changes.

For example, after editing the sentence in the first line, the timestamp labels will disappear because the similarity between the original and edited sentence is below 70%.

{% hint style="info" %}
If you upload an empty transcription, placeholder lines with <mark style="color:red;">**`-`**</mark> will be created and automatically associated with timestamps. Editing the placeholder content will remove the timestamp label, and you will need to create it again manually.
{% endhint %}

### Multi-channel audio

Audio projects can include files with multiple channels, enabling labeling across different audio sources within the same file.

* If an uploaded audio file contains multiple channels, the interface displays:
  * A **main audio track** (combined view).
  * Up to **2 individual channels.** Any additional channels in the source file are not shown.
* If the audio file does not contain channels, only the **main audio track** is displayed.

**Timestamp labeling behavior**

* **Multi-channel audio:**
  * Timestamp labels can be created **only within individual channels.**
  * Labels created in channels are **reflected in the main audio track.**
  * The **main audio track does not support direct timestamp labeling.**
* **Single-channel audio:**
  * Timestamp labels can be created **directly on the main audio track.**
  * Channel-specific labeling is not applicable.

This behavior keeps labeling structured at the channel level while maintaining a unified view in the main audio track.

You can use the sample file below to test multi-channel audio.

{% file src="/files/VkKAqaUBfDLmtPVz9m2Z" %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.datasaur.ai/data-studio-projects/nlp-task-types/span-based/audio-project.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
