LLM Labs (beta)

Enable your integration with models from Datasaur LLM Lab

Easily integrate with models from Datasaur's LLM Labs. If you've already tested and deployed your experiment in the Datasaur LLM Labs Sandbox, we're here to help! Our integration allows you to use your deployed LLM Sandbox from Datasaur LLM Labs to enhance your labeling process.

Creating an LLM Labs Sandbox

To begin using ML-Assisted Labeling with LLM Labs, you first need to create and deploy a Sandbox. You can see this page to learn more details about how to deploy an LLM Labs Sandbox.

The output of the LLM Labs Sandbox must be in JSON object format, aligned with the label set defined in your NLP project. This ensures compatibility with regex-based string matching for labeling in your NLP platform.

  1. We have label set of party, date, contextual_reference, signatory.

  2. Therefore, the expected output or result from the LLM Labs Sandbox should be something like this

{
  "party": ["Quercus Capital Innovations, L.P", "Aethelberht Funds SPC"],
  "date": ["October 10, 2023"],
  "contextual_reference": [
    "Article IX Section 7(b).ii(b) of the Aethelberht Funds SPC Limited Partnership Agreement"
  ],
  "signatory": [
    "Quercus Capital Innovations, L.P. by Dr. Maximilian Edgewater, Principal"
  ]
}

Prompt example to generate the correct JSON Object format

Given the document text, please extract the following information and present it in JSON format as shown below:
- *party*: A list of parties firstly mentioned in the document.
- *date*: first mentioned date in the document. Only contain one value
- *contextual reference*: Any specific sections, articles, clauses, or contextual references cited EXPLICITLY in the document. Return complete answer
- *signatory*: One line that include name, title, organization in signatory part. Please make sure the signatory answer is precisely written in the document

Instructions Summary:
1. Extract and present the information in the specified JSON format.
2. Ensure that all extracted data is accurate and corresponds directly to the content of each document.

Return the value of extracted fields in JSON structure in plain text, following this JSON FORMAT
{
"party": [list of party answer],
"date": [list of date answer],
"contextual_reference": [list of contextual reference answer],
"signatory": [list of signatory answer]
}

VERY IMPORTANT
RETURN THE ANSWER WITHOUT ```json
EXTRACT ANSWER THAT PRECISELY WRITTEN IN DOCUMENT CONTEXT

Accessing Your Deployed LLM Labs Sandbox in ML Assisted Labelling

Follow these steps to access your deployed LLM Labs Sandbox (from Datasaur LLM Labs) on your ML Assisted Labelling:

  1. Create a custom project for Row Labeling or Span Labeling.

  2. Click "Manage Extension" on your right bar.

  3. Pop Up Manage Extension will appear and you can enable the Datasaur ML Assisted.

  4. Once enable it, select "LLM Labs" as provider and you will see the following menu:

  5. Target text: the column(s) of your targeted data for this ML assistance are based on.

  6. Target question: the column(s) you wish to answer.

  7. LLM application: your deployed LLM Sandbox name from LLM Labs.

  8. API token: your API keys to access the deployed LLM Application. You can visit LLM Labs and go to `Settings` on the left sidebar, then select the `API Keys` menu.

Prediction Process

After setting up the above options, simply click “Predict Labels” to start predicting and obtaining labels from your deployed LLM Application from Datasaur LLM Labs.

Last updated