Human Annotation

Human Annotation is a manual evaluation method. It is used to collaboratively annotate traces, sessions and observations with scores.

In Langfuse you can use Annotation Queues to streamline working through reviewing larger batches of of single interactions (trace-level), multiple interactions (session-level) or even single observations, below the trace level.

Annotate

Why use Human Annotation?

Collaboration: Enable team collaboration by inviting other internal members to annotate a subset of traces and observations. This manual evaluation can enhance the overall accuracy and reliability of your results by incorporating diverse perspectives and expertise.
Annotation data consistency: Create score configurations for annotation workflows to ensure that all team members are using standardized scoring criteria. Hereby configure categorical, numerical or binary score types to capture different aspects of your data.
Evaluation of new product features: This feature can be useful for new use cases where no other scores have been allocated yet.
Benchmarking of other scores: Establish a human baseline score that can be used as a benchmark to compare and evaluate other scores. This can provide a clear standard of reference and enhance the objectivity of your performance evaluations.

Annotation of Single Traces, Sessions and Observations

Manual Annotation of single traces, sessions and observations is available in the trace, session and observation detail view.

Prerequisite: Create a Score Config

To use Human Annotation, you need to have at least one score configuration (Score Config) set up. See how to create and manage Score Configs for details.

Trigger Annotation on a Trace, Session or Observation

On a Trace, Session or Observation detail view click on Annotate to open the annotation form.

Annotate

Select Score Configs to use

Annotate

Set Score values

Annotate

See newly added Scores

To see your newly added scores on traces or observations, click on the Scores tab on the trace or observation detail view.

Detail scores table

To see your newly added scores on sessions, click on the Scores tab on the session detail view.

Detail scores table

All scores are also available in the traces, sessions and observations table views respectively.

Annotation Queues

Annotation queues allow you to manage and prioritize your annotation tasks in a structured way. This feature is particularly useful for large-scale projects that benefit of human-in-the-loop evaluation at some scale. Queues streamline this process by allowing for specifying which traces, sessions or observations you’d like to annotate on which dimensions.

Create Annotation Queues

Prerequisite: Create a Score Config

To use Human Annotation, you need to have at least one score configuration (Score Config) set up. See how to create and manage Score Configs for details.

Go to Annotation Queues View

Navigate to Your Project > Human Annotation to see all your annotation queues.
Click on New queue to create a new queue.

Annotate

Fill out Create Queue Form

Select the Score Configs you want to use for this queue.
Set the Queue name and Description (optional).

Annotate

Click on Create queue to create the queue.

Run Annotation Queues

Populate Annotation Queues

Once you have created annotation queues, you can assign traces, sessions or observations to them.

To add multiple traces, sessions or observations to a queue:

Navigate to the respective table view and optionally adjust the filters
Select Traces, Sessions or Observations via the checkboxes.
Click on the “Actions” dropdown menu
Click on Add to queue to add the selected traces, sessions or observations to the queue.
Select the queue you want to add the traces, sessions or observations to.

Annotate