Datasaur
Visit our websitePricingBlogPlaygroundAPI Docs
  • Welcome to Datasaur
    • Getting started with Datasaur
  • Data Studio Projects
    • Labeling Task Types
      • Span Based
        • OCR Labeling
        • Audio Project
      • Row Based
      • Document Based
      • Bounding Box
      • Conversational
      • Mixed Labeling
      • Project Templates
        • Test Project
    • Creating a Project
      • Data Formats
      • Data Samples
      • Split Files
      • Consensus
      • Dynamic Review Capabilities
    • Pre-Labeled Project
    • Let's Get Labeling!
      • Span Based
        • Span + Line Labeling
      • Row & Document Based
      • Bounding Box Labeling
      • Conversational Labeling
      • Label Sets / Question Sets
        • Dynamic Question Set
      • Multiple Label Sets
    • Reviewing Projects
      • Review Sampling
    • Adding Documents to an Ongoing Project
    • Export Project
  • LLM Projects
    • LLM Labs Introduction
    • Sandbox
      • Direct Access LLMs
      • File Attachment
      • Conversational Prompt
    • Deployment
      • Deployment API
    • Knowledge base
      • External Object Storage
      • File Properties
    • Models
      • Amazon SageMaker JumpStart
      • Amazon Bedrock
      • Open AI
      • Azure OpenAI
      • Vertex AI
      • Custom model
      • Fine-tuning
      • LLM Comparison Table
    • Evaluation
      • Automated Evaluation
        • Multi-application evaluation
        • Custom metrics
      • Ranking (RLHF)
      • Rating
      • Performance Monitoring
    • Dataset
    • Pricing Plan
  • Workspace Management
    • Workspace
    • Role & Permission
    • Analytics
      • Inter-Annotator Agreement (IAA)
        • Cohen's Kappa Calculation
        • Krippendorff's Alpha Calculation
      • Custom Report Builder
      • Project Report
      • Evaluation Metrics
    • Activity
    • File Transformer
      • Import Transformer
      • Export Transformer
      • Upload File Transformer
      • Running File Transformer
    • Label Management
      • Label Set Management
      • Question Set Management
    • Project Management
      • Self-Assignment
        • Self-Unassign
      • Transfer Assignment Ownership
      • Reset Labeling Work
      • Mark Document as Complete
      • Project Status Workflow
        • Read-only Mode
      • Comment Feature
      • Archive Project
    • Automation
      • Action: Create Projects
  • Assisted Labeling
    • ML Assisted Labeling
      • Amazon Comprehend
      • Amazon SageMaker
      • Azure ML
      • CoreNLP NER
      • CoreNLP POS
      • Custom API
      • FewNERD
      • Google Vertex AI
      • Hugging Face
      • LLM Assisted Labeling
        • Prompt Examples
        • Custom Provider
      • LLM Labs (beta)
      • NLTK
      • Sentiment Analysis
      • spaCy
      • SparkNLP NER
      • SparkNLP POS
    • Data Programming
      • Example of Labeling Functions
      • Labeling Function Analysis
      • Inter-Annotator Agreement for Data Programming
    • Predictive Labeling
  • Assisted Review
    • Label Error Detection
  • Building Your Own Model
    • Datasaur Dinamic
      • Datasaur Dinamic with Hugging Face
      • Datasaur Dinamic with Amazon SageMaker Autopilot
  • Advanced
    • Script-Generated Question
    • Shortcuts
    • Extensions
      • Labels
      • Review
      • Document and Row Labeling
      • Bounding Box Labels
      • List of Files
      • Comments
      • Analytics
      • Dictionary
      • Search
      • Labeling Guidelines
      • Metadata
      • Grammar Checker
      • ML Assisted Labeling
      • Data Programming
      • Datasaur Dinamic
      • Predictive Labeling
      • Label Error Detection
      • LLM Sandbox
    • Tokenizers
  • Integrations
    • External Object Storage
      • AWS S3
        • With IRSA
      • Google Cloud Storage
      • Azure Blob Storage
    • SAML
      • Okta
      • Microsoft Entra ID
    • SCIM
      • Okta
      • Microsoft Entra ID
    • Webhook Notifications
      • Webhook Signature
      • Events
      • Custom Headers
    • Robosaur
      • Commands
        • Create Projects
        • Apply Project Tags
        • Export Projects
        • Generate Time Per Task Report
        • Split Document
      • Storage Options
  • API
    • Datasaur APIs
    • Credentials
    • Create Project
      • New mutation (createProject)
      • Python Script Example
    • Adding Documents
    • Labeling
      • Create Label Set
      • Add Label Sets into Existing Project
      • Get List of Label Sets in a Project
      • Add Label Set Item into Project's Label Set
      • Programmatic API Labeling
      • Inserting Span and Arrow Label into Document
    • Export Project
      • Custom Webhook
    • Get Data
      • Get List of Projects
      • Get Document Information
      • Get List of Tags
      • Get Cabinet
      • Export Team Overview
      • Check Job
    • Custom OCR
      • Importable Format
    • Custom ASR
    • Run ML-Assisted Labeling
  • Security and Compliance
    • Security and Compliance
      • 2FA
  • Compatibility & Updates
    • Common Terminology
    • Recommended Machine Specifications
    • Supported Formats
    • Supported Languages
    • Release Notes
      • Version 6
        • 6.111.0
        • 6.110.0
        • 6.109.0
        • 6.108.0
        • 6.107.0
        • 6.106.0
        • 6.105.0
        • 6.104.0
        • 6.103.0
        • 6.102.0
        • 6.101.0
        • 6.100.0
        • 6.99.0
        • 6.98.0
        • 6.97.0
        • 6.96.0
        • 6.95.0
        • 6.94.0
        • 6.93.0
        • 6.92.0
        • 6.91.0
        • 6.90.0
        • 6.89.0
        • 6.88.0
        • 6.87.0
        • 6.86.0
        • 6.85.0
        • 6.84.0
        • 6.83.0
        • 6.82.0
        • 6.81.0
        • 6.80.0
        • 6.79.0
        • 6.78.0
        • 6.77.0
        • 6.76.0
        • 6.75.0
        • 6.74.0
        • 6.73.0
        • 6.72.0
        • 6.71.0
        • 6.70.0
        • 6.69.0
        • 6.68.0
        • 6.67.0
        • 6.66.0
        • 6.65.0
        • 6.64.0
        • 6.63.0
        • 6.62.0
        • 6.61.0
        • 6.60.0
        • 6.59.0
        • 6.58.0
        • 6.57.0
        • 6.56.0
        • 6.55.0
        • 6.54.0
        • 6.53.0
        • 6.52.0
        • 6.51.0
        • 6.50.0
        • 6.49.0
        • 6.48.0
        • 6.47.0
        • 6.46.0
        • 6.45.0
        • 6.44.0
        • 6.43.0
        • 6.42.0
        • 6.41.0
        • 6.40.0
        • 6.39.0
        • 6.38.0
        • 6.37.0
        • 6.36.0
        • 6.35.0
        • 6.34.0
        • 6.33.0
        • 6.32.0
        • 6.31.0
        • 6.30.0
        • 6.29.0
        • 6.28.0
        • 6.27.0
        • 6.26.0
        • 6.25.0
        • 6.24.0
        • 6.23.0
        • 6.22.0
        • 6.21.0
        • 6.20.0
        • 6.19.0
        • 6.18.0
        • 6.17.0
        • 6.16.0
        • 6.15.0
        • 6.14.0
        • 6.13.0
        • 6.12.0
        • 6.11.0
        • 6.10.0
        • 6.9.0
        • 6.8.0
        • 6.7.0
        • 6.6.0
        • 6.5.0
        • 6.4.0
        • 6.3.0
        • 6.2.0
        • 6.1.0
        • 6.0.0
      • Version 5
        • 5.63.0
        • 5.62.0
        • 5.61.0
        • 5.60.0
  • Deployment
    • Self-Hosted
      • AWS Marketplace
        • Data Studio
        • LLM Labs
Powered by GitBook
On this page
  • Model Details
  • Usage
  1. Assisted Labeling
  2. ML Assisted Labeling

FewNERD

Last updated 1 month ago

Supported Labeling Types: Span Labeling

Datasaur's implementation of FewNERD (Few-shot Named Entity Recognition Dataset) represents a significant advancement in fine-grained entity recognition capability. This state-of-the-art model delivers unprecedented granularity in entity classification, enabling more precise document understanding and information extraction than traditional NER systems. The model used is sourced from .

This provider will support you with Span Labeling task for your Named Entity Recognition model.

You can download the FewNERD label set below:

Model Details

  • Built on SpanMarker, a specialized span-based approach to NER that outperforms traditional token classification methods

  • FewNERD provides a unique combination of 66 fine-grained entity types organized within a hierarchical structure of 8 coarse-grained types.

  • This allows for more precise entity identification, distinguishing between general categories (e.g., people, places) and more specific subtypes (e.g., musicians, companies, medical conditions).

  • The model is optimized for NER tasks, making it particularly effective for structured information extraction.

  • Hosted locally within the Datasaur Intelligence container for consistent performance and data security.

Usage

  • This provider supports Span Labeling tasks for Named Entity Recognition (NER) models.

  • It allows for fine-grained classification of entities within a text, offering more detailed insights compared to traditional NER approaches.

  • The hierarchical entity structure improves classification accuracy and contextual understanding.

  • Here are some examples of labels from the FewNERD label set:

Label
Examples

art-broadcastprogram

"Street Cents", "Corazones", "The Gale Storm Show : Oh , Susanna"

art-film

"Bosch", "L'Atlantide", "Shawshank Redemption"

art-music

"Atkinson , Danko and Ford ( with Brockie and Hilton )", "Champion Lover", "Hollywood Studio Symphony"

art-other

"Aphrodite of Milos", "Venus de Milo", "The Today Show"

art-painting

"Production/Reproduction", "Touit", "Cofiwch Dryweryn"

art-writtenart

"Imelda de ' Lambertazzi", "Time", "The Seven Year Itch"

building-airport

"Luton Airport", "Newark Liberty International Airport", "Sheremetyevo International Airport"

building-hospital

"Hokkaido University Hospital", "Yeungnam University Hospital", "Memorial Sloan-Kettering Cancer Center"

building-hotel

"The Standard Hotel", "Radisson Blu Sea Plaza Hotel", "Flamingo Hotel"

building-library

"British Library", "Berlin State Library", "Bayerische Staatsbibliothek"

building-other

"Communiplex", "Alpha Recording Studios", "Henry Ford Museum"

building-restaurant

"Fatburger", "Carnegie Deli", "Trumbull"

building-sportsfacility

"Glenn Warner Soccer Facility", "Boston Garden", "Sports Center"

building-theater

"Pittsburgh Civic Light Opera", "Sanders Theatre", "National Paris Opera"

event-attack/battle/war/militaryconflict

"Easter Offensive", "Vietnam War", "Jurist"

event-disaster

"the 1912 North Mount Lyell Disaster", "1693 Sicily earthquake", "1990s North Korean famine"

event-election

"March 1898 elections", "1982 Mitcham and Morden by-election", "Elections to the European Parliament"

event-other

"Eastwood Scoring Stage", "Union for a Popular Movement", "Masaryk Democratic Movement"

event-protest

"French Revolution", "Russian Revolution", "Iranian Constitutional Revolution"

event-sportsevent

"National Champions", "World Cup", "Stanley Cup"

location-GPE

"Mediterranean Basin", "the Republic of Croatia", "Croatian"

location-bodiesofwater

"Atatürk Dam Lake", "Norfolk coast", "Arthur Kill"

location-island

"Laccadives", "Staten Island", "new Samsat district"

location-mountain

"Salamander Glacier", "Miteirya Ridge", "Ruweisat Ridge"

location-other

"Northern City Line", "Victoria line", "Cartuther"

location-park

"Gramercy Park", "Painted Desert Community Complex Historic District", "Shenandoah National Park"

location-road/railway/highway/transit

"Friern Barnet Road", "Newark-Elizabeth Rail Link", "NJT"

organization-company

"Dixy Chicken", "Texas Chicken", "Church 's Chicken"

organization-education

"MIT", "Belfast Royal Academy and the Ulster College of Physical Education", "Barnard College"

organization-government/governmentagency

"Congregazione dei Nobili", "Diet", "Supreme Court"

organization-media/newspaper

"TimeOut Melbourne", "Clash", "Al Jazeera"

organization-other

"Defence Sector C", "IAEA", "4th Army"

organization-politicalparty

"Shimpotō", "Al Wafa ' Islamic", "Kenseitō"

organization-religion

"Jewish", "Christian", "UPCUSA"

organization-showorganization

"Lizzy", "Bochumer Symphoniker", "Mr. Mister"

organization-sportsleague

"China League One", "First Division", "NHL"

organization-sportsteam

"Tottenham", "Arsenal", "Luc Alphand Aventures"

other-astronomything

"Zodiac", "Algol", "`` Caput Larvae ''"

other-award

"GCON", "Order of the Republic of Guinea and Nigeria", "Grand Commander of the Order of the Niger"

other-biologything

"N-terminal lipid", "BAR", "Amphiphysin"

other-chemicalthing

"uranium", "carbon dioxide", "sulfur"

other-currency

"$", "Travancore Rupee", "lac crore"

other-disease

"French Dysentery Epidemic of 1779", "hypothyroidism", "bladder cancer"

other-educationaldegree

"Master", "Bachelor", "BSc ( Hons ) in physics"

other-god

"El", "Fujin", "Raijin"

other-language

"Breton-speaking", "English", "Latin"

other-law

"Thirty Years ' Peace", "Leahy–Smith America Invents Act ( AIA", "United States Freedom Support Act"

other-livingthing

"insects", "monkeys", "patchouli"

other-medical

"Pediatrics", "amitriptyline", "pediatrician"

person-actor

"Ellaline Terriss", "Tchéky Karyo", "Edmund Payne"

person-artist/author

"George Axelrod", "Gaetano Donizett", "Hicks"

person-athlete

"Jaguar", "Neville", "Tozawa"

person-director

"Bob Swaim", "Richard Quine", "Frank Darabont"

person-other

"Richard Benson", "Holden", "Campbell"

person-politician

"William", "Rivière", "Emeric"

person-scholar

"Stedman", "Wurdack", "Stalmine"

person-soldier

"Helmuth Weidling", "Krukenberg", "Joachim Ziegler"

product-airplane

"Luton", "Spey-equipped FGR.2s", "EC135T2 CPDS"

product-car

"100EX", "Corvettes - GT1 C6R", "Phantom"

product-food

"red grape", "yakiniku", "V. labrusca"

product-game

"Airforce Delta", "Hardcore RPG", "Splinter Cell"

product-other

"Fairbottom Bobs", "X11", "PDP-1"

product-ship

"Congress", "Essex", "HMS `` Chinkara ''"

product-software

"AmiPDF", "Apdf", "Wikipedia"

product-train

"High Speed Trains", "55022", "Royal Scots Grey"

product-weapon

"AR-15 's", "ZU-23-2M Wróbel", "ZU-23-2MR Wróbel II"

SpanMarker-BERT-Base-FewNERD-Fine-Super
1KB
fewnerd-labelset.csv
FewNERD Labelset
ML Assisted with FewNERD
Image of ML Assisted with FewNERD