Dataset

Overview

The Dataset page in LLM Labs collects all datasets available for automated evaluation or fine-tuning, providing a centralized location for managing your data.

Prerequisites

Dataset must be formatted as a CSV (Comma Separated Value) file with the following two columns:

  1. prompt: This column contains the input prompt that you will feed to your LLM.

  2. expected completion: This column holds the desired or ideal output that your LLM should generate in response to the given prompt.

Create dataset

  1. Navigate to the Dataset menu on the left sidebar.

  2. Click the Create dataset button.

    Create dataset
  3. Enter a dataset name, then click Create.

    Dataset name
  4. Click Upload dataset button and select a .csv containing 2 columns: prompts and expected completion.

  5. Once the file is uploaded, the dataset will be automatically added to the table.

    Dataset created

Modify dataset item

Once a dataset is uploaded, you can add more, edit, or delete dataset items.

Add more dataset items

  1. Click Add dataset button next to the Search field.

  2. Upload the .csv file.

  3. The additional datasets will be added to the table, and the existing ones remain.

    Adding more dataset

Edit dataset item

  1. Find the dataset item you want to edit using the search field, right-click on the row, then choose Edit.

  2. Modify the necessary details, then press Enter to apply the updates.

Delete dataset item

  1. Find the dataset item you want to edit using the search field, right-click on the row, then choose Delete.

  2. The dataset item will be deleted immediately.

Please note that this action cannot be undone.

Delete entire dataset

  1. In the main Dataset page, find the dataset you want to delete using the search field or filter options. Click the More menu (three-dots icon), then select Delete option.

    Delete dataset option
  2. Confirm the deletion by clicking the Delete button.

To delete multiple datasets:

  1. Select the datasets and click the Delete button above the table.

  2. Confirm the deletion by clicking the Delete button.

Access via Automated evaluation

Once you've created the dataset, it will be available for use in the Automated evaluation projects. Learn more about Automated evaluation.

Click Use existing dataset in Step 1 when creating an automated evaluation project.

A dialog will appear where you can choose the dataset to use for the project.

Access via Fine-tuning

Once you've created the dataset, it will be available for use for fine-tuning base models. Learn more about Fine-tuning models.

Click Use existing dataset in Step 1 when configuring fine-tuning.

A dialog will appear where you can choose a dataset for fine-tuning.

Last updated