Audio Project

Project Creation

Let's start an audio project: Creating an audio project is simple in Datasaur. All the steps are the same as creating token-based projects.

From your project home page open “Create a Custom Project.” (You can also watch this Youtube video for instructions on how to create an audio project). In order to begin, you should upload both your audio and transcription on this page:

The type of audio files we accept: .mp3, .flac, and .wav

The type of transcription files we accept: .srt and .txt format. Here is an example:

Before we move on, please make sure the name of the transcription file is the same as the name of the audio file. For example: SampleFile.vtt and SampleFile.flac. Since both the transcription and the audio file have the same name, Datasaur will recognize them as corresponding files.

So now our window should look like this,

For the rest of the Project Creation Process, you will then be able to take all the steps one takes in a token-based project: Preview, Labeler's Tasks, Assignment, and Project Settings:

Audio Interface Legend

Labeling an Audio Project

In the general layout of the text tool, you may regularly label any tokens in the transcript with the label sets that you have either created or uploaded. You will find an audio player on top of the interface with timestamps for your audio. You will also see a control panel between the text interface and the audio timestamps.

The buttons on this control panel enable you to: Rewind, Play/Pause, Fast-Forward, Volume, Control Audio Settings (which toggles audio speed and auto-scroll), Create a Timestamp Label, and Zoom Out/In of the Audio Timestamps. [Please watch this brief video for a visual guide].

You can also create a new timestamp and bound that timestamp to its corresponding text. Click on the 'Create a Timestamp Label' button found on the control panel. Highlight a portion of the audio interface, and then highlight the span of tokens that correspond to your new timestamp. Then you're done! You've now made a new timestamp corresponding to a span of tokens.

Edit Sentence in Audio Project Behavior

When editing transcriptions, we have optimized our behavior to accommodate changes. If the similarity between the original transcription and the edited version is above 70%, we keep the timestamp labels, and you will only need to adjust the corresponding text accordingly.

However, if the similarity falls below 70%, there is a chance that the timestamp labels will be removed. This happens when the edited text differs significantly from the original, indicating substantial changes. Our threshold ensures that timestamp labels remain for minor modifications but are removed for more significant alterations.

After we edit the sentence in the first line, the timestamp labels will be disappeared. It because the similarity between the original sentence and the edited sentence is less than 70%.

If you upload an empty transcription, we will create placeholder lines with - as the content, and they will already be associated with the timestamp. Please note that editing the placeholder content will remove the timestamp label, and you will need to re-draw it manually.

Last updated 3 months ago