Split Files

With the Split files option, you can divide each file in your project among labelers so they don’t work on the same data. You can also assign specific files to particular labelers.

Split Files

Sample case

Let's say we have a file named 100-rows-reviewed.csv with 100 rows. Our goal is to divide this file equally between 5 labelers.

  1. In step 1 of project creation, upload the files.

  2. In step 2, preview the files.

  3. In step 3, set the task type and the question set.

  4. In step 4, choose 5 members as the labelers.

    1. If the purpose in this project is only to speed up the labeling process and doesn't require the reviewer judgement, set the consensus to 1.

    2. If reviewer judgement is required, set the desired consensus accordingly.

  5. Select Advanced.

  6. In the Advanced menu, select the Split files checkbox. Then enter the number of parts in the Parts field. In this example, we set it to 5. The file will be split into 5 parts, and - Part 1 through - Part 5 will be added as suffixes to the file names.

  7. Select Distribute files automatically. The system will distribute the file parts based on the consensus setting.

    1. In this case, the file parts will be assigned as follows:

      • John Doe will get 100-rows-reviewed.csv - Part 1

      • Johnson Dominic will get 100-rows-reviewed.csv - Part 2

      • James Doe will get 100-rows-reviewed.csv - Part 3

      • Jason Doe will get 100-rows-reviewed.csv - Part 4

      • Jeremy Doe will get 100-rows-reviewed.csv - Part 5

    2. They will only see their assigned file parts in the project.

Additional information

  • The Split files option is only available during project creation. File parts cannot be split or merged after the project is created.

  • Labelers will only receive the assigned parts.

  • Reviewers will receive all file parts.

  • The Split files option is available for span labeling, row labeling, and conversational labeling projects.

  • The Distribute files automatically option is available for span labeling, row labeling, conversational labeling, bounding box labeling, and document labeling projects.

Last updated