Labels
Last updated
Last updated
A core extension to any project, the Labels extension contains the label set or taxonomy used for the project. There are three ways to add a label set:
Upload a label set with a .csv
or .tsv
format
Choose from a library of label sets
Type the labels in manually
Note: maximum label set size is 500 KB
The library has three label sets loaded by default: INACL
, Stanford
, and Default NER
. If you have created a project before, previous label sets will be automatically added to the library.
📌 One important thing to note is that uploading or choosing a new label set will delete all existing labels in the document.
While labeling your project, it is possible to add or remove label items dynamically. Furthermore, you can edit or add label items and edit the label color by clicking on the triple dots and clicking Edit label set, as shown below. We provide twelve colors that you can select from.
You can also move a label by clicking the dots and dragging the label up and down.
Some users find that colors can help them memorize labels and reduce human error while labeling.
We now support labels hierarchies up to 10 levels!
As you can see in the screenshots above, Datasaur count all labels applied to a span of tokens and a relation. If you click on the counter, you can see the instances.
Clicking one of the instances will directly go to the instance's location in the text editor.
While labeling, users may find that certain labels do not apply to a document. For example, if a label set includes a "Signature" label, but it is not applicable to a particular document, there is no straightforward method for users to indicate the absence of certain label classes.
The introduction of this capability allows users to mark labels that are not applied to a document as N/A
(not applicable).
This can be enabled either during the project's creation through the Project Creation Wizard or in the Project Settings within a project.
To enable it through the Project Creation Wizard, proceed to Step 5.
To enable it through Project Settings, navigate to the File, Settings menu and open the Administrator tab. Then, enable the "Allow marking unapplied label classes as N/A" option.
Once enabled, you can mark a label class as N/A
(not applicable) within the extension. A label eligible for being marked as N/A
is one that has not yet been applied to the document, indicated by an instance counter of 0
displayed in the extension.
To mark the label class as “N/A”, click on the instance counter.
Note:
• If you apply a label class that was previously marked as N/A
to a span in the in the document, the N/A
mark will be automatically removed, as it becomes applicable to the document.
• The N/A
mark will only be visible to the user who marks the label class as N/A
. It will not affect other users.
To remove the N/A
mark, simply click again on the label class you previously marked as N/A
.
When you mark a document as complete and there are label classes that have not been applied, a dialog box will appear displaying a list of these unapplied label classes. This list excludes any label classes that have already been marked as N/A
, enabling you to review and take appropriate action.
If you wish to mark all unapplied label classes as N/A
, check the "Mark all unapplied label classes as N/A" option and then click "Mark as complete."