For the complete documentation index, see llms.txt. This page is also available as Markdown.

Export Project

Export projects

Both features are supported through API call. Click here for more detailed explanation.

  1. Go to the File menu.

  2. Select Export file or Export all files.

  • Export file: Export only the currently opened file. The output reflects the latest project state and does not include all project data.

  • Export all files: Export all files in the project. For projects with multiple assignees, each labeled version is exported separately. The output is a .zip file containing three folders:

    • DOCUMENT-Labeler-name: Contains files labeled by each labeler.

    • REVIEW: Contains the final version, including auto-accepted labels and reviewer edits.

    • ROOT: Contains the original raw text without any labels or edits.

Include unresolved labels / answers in the export result

You can include unresolved labels or answers inside the export result. This is supported for span, row, and document labeling projects in both Datasaur Schema and CSV formats.

Enable the option

When you select a supported export format, a checkbox will appear. If selected, unresolved labels or answers will be included in the export.

Export result

  • Comma-separated values (.csv)

    • New column: Label Status

      This column indicates whether a line is conflicted or resolved.

    • New column: Line

      This column indicates the line number.

  • Datasaur Schema (.json)

    • New values: rowAnswers, documentAnswers, spanLabels, arrowLabels

      The conflicted values will be added to rowAnswers, documentAnswers, spanLabels, or arrowLabels.

      You can differentiate between resolved and unresolved answer by looking at labeledBy attribute. Unresolved label should have CONFLICT as their labeledBy value.

The following section will give you some illustration on the result.

  • Span labeling

    • Datasaur Schema (.json)

      Conflicted labels will be added to spanLabels or arrowLabels.

    • Comma-separated values (.csv)

      This format is similar to the Amazon Comprehend CSV export format, but with an additional column titled Label Status.

  • Row and document Labeling

    • Datasaur Schema (.json)

      Unresolved answer will not be added to the answer set (rowAnswerSets for Row Labeling, while documentAnswerSets for Document Labeling)

      However, it will be added to rowAnswers for Row Labeling, while documentAnswers for Document Labeling, along with the resolved answers.

    • Comma-separated values (.csv)

      Adds Label Status and Line column.

      There may be cases where a single line contains both resolved and unresolved answers due to consensus. In such cases, the answers are split into two lines: one for resolved answers and one for unresolved answers.

Export methods

When exporting a file, you can choose from several delivery options. All methods are also available via API.

Download

  • The export result is uploaded to Datasaur's storage bucket, and you download it directly to your device via a generated link.

  • Keep in mind that the time needed to generate the link will be directly proportional to the size of the project.

Email

  • Datasaur will generate a download link and send it to the email address used for your account.

  • The link expires in 6 hours.

Webhook

  • The export result is sent as a payload in a webhook request.

  • See the Webhook documentation for full details.

  • The link expires in 6 hours.

External Object Storage

  • The export result is uploaded directly to your selected external storage bucket.

  • You can also add a prefix to the file name. The prefix is appended directly without a separator. For example:

    • Prefix: test

    • File name: name.json

    • Result: testname.json

Export multiple projects from the Projects page

You can export multiple projects with the same project settings in the Projects page.

  1. Select all projects you want to export.

  2. Click Export button. You can choose the export format and method. The output will be a .zip file.

We recommend exporting up to 10 projects at once for better performance.

Last updated