Hugging Face
Last updated
Last updated
Supported Labeling Types: Span Labeling
, Row Labeling
Datasaur integrates directly with HuggingFace, providing access to their 10k+ models.
After choosing HuggingFace as the option, you can navigate to Hugging Face and choose the available model. If you already host your own private models on Hugging Face, you can use those as well.
For Span Labeling, you can either enter the model name or the endpoint URL if you're using a self-hosted model. There's no need to provide a model name or API token when using your own endpoint. You can also set the confidence score to manually adjust the prediction threshold based on your needs.
In Row Labeling, you can choose the Target Text as your input and the Target Question as your desired output. To get started, enter either the model name or the Hugging Face Inference Endpoint URL, along with your API token.
When choosing models for predicting labels, you use a text-classification model, the model should return a list of dictionaries/object where each object contain all prediction (positive, negative, neutral) like this
or just a single list/array that contains objects of single prediction (the highest score) like this
This feature also includes an option for Faster Prediction Speed, which significantly improves performance by processing entire rows at once. However, this action can’t be undone.
Finally, you can adjust the Confidence Score to manually set the prediction threshold according to your preference.
If you click the Predict Labels button, the project will automatically apply labels to the document based on the loaded model.