Search
Last updated
Last updated
The Search extension helps users quickly find specific words, phrases, or labeled tokens within their data. It’s useful for navigating both individual documents and entire projects, with features like label-specific searches, regex searches, and exact word matching. Results are clearly displayed in a list, making it easier to analyze and work with large datasets efficiently.
In a Span Labeling project, two types of searches are available: Standard and Advanced.
The Standard Search allows users to perform simple searches based on text and labels using keywords or regular expressions (regex). This search type is intuitive and provides quick access to relevant data by matching the input with the text or labels in the project.
Text-based search allows users to search for specific words or patterns within the data by specifying a word filter and entering a keyword to locate matching text in the project.
Word Filter
This option lets users define how their search keywords are matched to results. The available options are:
Contains any word: Matches results that contain any of the specified words.
Example: Searching for men
will match with men
, mentioned
, abandonment
.
Exact word: Displays only exact matches for the search keyword.
Example: Searching for men
will match men
but not mentioned
.
Regex: Allows users to search using regular expressions for advanced pattern matching.
Example: Searching for men*
will match words starting with men
, such as mentioned
.
Label-based search allows users to find specific labels or categories in the data, with the word filter set to "Contains any word.”
The Advanced Search provides a more sophisticated way to search by allowing the combination of multiple conditions to refine results. This search type supports complex queries using MongoDB query syntax.
Users can create searches with multiple conditions, where each condition includes a search target, a filter operation, and a keyword. These conditions can be combined using logical operators such as "AND" or "AND" to define the relationship between the conditions.
Search target: The focus of the search.
Text
: Matches words or content in the spans.
Label
: Matches the labels applied to the text.
Metadata
: Matches information attached to the line (in key-value pair).
Filter operation: Determines how the search target is matched.
is
: Matches search target that exactly matches the specified keyword.
is not
: Matches search target that explicitly does not match the specified keyword.
contains
: Matches search target that contains the specified keyword.
does not contain
: Matches search target that does not contain the specified keyword
matches regex
: Matches search target that fits the regular expression pattern.
Keyword: The value the search will look for.
For Text and Label, this is the word or phrase to match.
For Metadata, this is the key:value pair used to filter information.
Logical operator: Specifies how multiple conditions are connected.
OR
: Matches results that meet at least one condition.
AND
: Matches results that meet all conditions.
To help users build accurate queries and easily check their work, the Advanced Search offers two preview options:
Conditions preview: Displays a summary of the applied conditions in plain language, helping users confirm that their input aligns with their intent.
Raw query preview: Displays the underlying MongoDB query being executed, providing advanced users with the ability to validate the query syntax.
The search operates at the line level, meaning it evaluates each line individually against the list of specified conditions.
💡Notes
Search results will exclude texts and labels that span across multiple lines.
For conditions with negative operators (is not
, does not contain
), only the lines that meet the specified conditions will be displayed in the results.
Allows users to search within the data of the table, enabling them to find specific information across multiple rows and columns quickly by specifying the search target, word filter, and entering the keyword.
Search target
This option allows users to specify the focus of the search. The available options are:
Text
: Matches the words or content in the data column.
Label
: Matches the words or content in the answer column.
Word Filter
This option lets users define how their search keywords are matched to results. The available options are:
Contains any word: Matches results that contain any of the specified words.
Example: Searching for men
will match with men
, mentioned
, abandonment
.
Exact word: Displays only exact matches for the search keyword.
Example: Searching for men
will match men
but not mentioned
.
Regex: Allows users to search using regular expressions for advanced pattern matching.
Example: Searching for men*
will match words starting with men
, such as mentioned
.
The Search All Files option allows users to search across all files within a project. When this option is checked, the search will include results from every file in the project. If the option is unchecked, the search will be limited to the current file only.
This is useful for users who want to either perform a broad search across all files or focus on a specific file within the project.
Only available for Span labeling projects.
The Label All feature allows users to quickly label all matching results in the project.
For example, searching for the text james
will show the number of instances of james
in the document. After selecting PER
from the dropdown, pressing the Label All button will apply the PER
label to all instances of james
in the document.
This feature is a useful tool for bulk labeling, making the process faster and more efficient. It is especially beneficial for projects that require detailed text analysis, enhancing accuracy and saving time.
Tips & Tricks To make it easier to navigate through the results, you can use the Up Arrow or Down Arrow keys to move to the next or previous result.