# Labeling Function Analysis

Labeling function analysis is available only after you generate predictions using labeling functions. You can open the analysis in two ways:

* Click **Labeling functions**, then go to the **Analysis** tab
* Click **See labeling function analysis** after running predictions

<figure><img src="/files/WAbHK8Zswg6RZsEWvVcg" alt=""><figcaption></figcaption></figure>

If predictions have not been generated, the analysis page will show an empty state.

<figure><img src="/files/vXjhb7aEU6XR7k3e46On" alt=""><figcaption><p>Empty labeling function analysis</p></figcaption></figure>

## Metrics

<figure><img src="/files/dA36t24f96DjBNR9r4Bn" alt=""><figcaption><p>Labeling function analysis</p></figcaption></figure>

After running predictions, the following metrics are available:

* **Coverage:** The proportion of data labeled by each labeling function.
* **Overlap:** The proportion of data labeled by multiple labeling functions.
* **Conflicts:** The proportion of data where labeling functions assign different labels.

If you update or add new labeling functions, run **Predict labels** again to refresh the analysis.

<figure><img src="/files/4vZX5RVtE4ziuylimm4O" alt=""><figcaption><p>Outdated labeling function analysis</p></figcaption></figure>

## Improve labeling function performance

A good setup typically has high coverage, high overlap, and low conflicts.

### Scenario 1: Fairly high coverage, high overlap, and high conflicts

This means many data points are labeled, and multiple labeling functions often assign labels to the same data, but they frequently disagree.

**How to improve:**

We need to train the label model to get the performance value between labeling functions. The performance value of labeling functions could estimate accuracies and correlations between labeling functions since we know some labeling functions could give high or low signals regarding the label.

### Scenario 2: Low coverage, high overlap, and high conflicts

This means only a small portion of data is labeled, but existing labeling functions often overlap and disagree.

**How to improve:**

* Add more labeling functions to increase coverage.
* Identify labeling functions that cause conflicts and refine or remove them.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.datasaur.ai/assisted-labeling/data-programming/labeling-function-analysis.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
