Online Evaluation

⚠️

This page is a work in progress and will be released in the coming days.

Online evaluation is a way to evaluate the quality of your LLM application in real-time. The core motivation is to:

Monitor quality, cost, latency, and security issues in real-time
Learn from production data to improve prompts, tools, and other application components
Debug issues upon negative evaluation scores or user feedback

How to get started

Production traces are the source of truth for online evaluation. See Observability section on how to get started with tracing.

Depending on your use case, you can set up online evals in the following ways:

Was this page helpful?