Sampling

Sampling can be used to control the volume of traces collected by Langfuse. Sampling is handled client-side.

You can configure the sample rate by setting the LANGFUSE_SAMPLE_RATE environment variable or by using the sample_rate/sampleRate constructor parameter. The value has to be between 0 and 1.

The default value is 1, meaning that all traces are collected. A value of 0.2 means that only 20% of the traces are collected. The SDK samples on the trace level meaning that if a trace is sampled, all observations and scores within that trace will be sampled as well.

With Python SDK v3, you can configure sampling when initializing the client:

from langfuse import Langfuse, get_client
import os
 
# Method 1: Set environment variable
os.environ["LANGFUSE_SAMPLE_RATE"] = "0.5"  # As string in env var
langfuse = get_client()
 
# Method 2: Initialize with constructor parameter then get client
Langfuse(sample_rate=0.5)  # 50% of traces will be sampled
langfuse = get_client()

When using the @observe() decorator:

from langfuse import observe, Langfuse, get_client
 
# Initialize the client with sampling
Langfuse(sample_rate=0.3)  # 30% of traces will be sampled
 
@observe()
def process_data():
    # Only ~30% of calls to this function will generate traces
    # The decision is made at the trace level (first span)
    pass

If a trace is not sampled, none of its observations (spans or generations) or associated scores will be sent to Langfuse, which can significantly reduce data volume for high-traffic applications.

When using the @observe() decorator:

from langfuse.decorators import langfuse_context, observe
 
os.environ["LANGFUSE_SAMPLE_RATE"] = '0.5'
 
@observe()
def fn():
    pass
 
fn()

When using the low-level SDK:

from langfuse import Langfuse
 
# Either set the environment variable or the constructor parameter. The latter takes precedence.
os.environ["LANGFUSE_SAMPLE_RATE"] = '0.5'
langfuse = Langfuse(sample_rate=0.5)
 
trace = langfuse.trace(
  name="Rap Battle",
)

import { Langfuse } from "langfuse";
 
const langfuse = new Langfuse({
  sampleRate: 0.5,
});

See JS/TS SDK docs for more details.

When using the Python SDK v3, the sample rate provided on client initialization will apply to all event inputs and outputs regardless of the Langfuse-maintained integration you are using.

See the Python SDK v3 tab for more details.

When using the OpenAI SDK Integration with Python SDK v2:

# Either set the environment variable or configure the openai import. The latter takes precedence.
os.environ["LANGFUSE_SAMPLE_RATE"] = '0.5'
 
from langfuse.openai import openai
openai.langfuse_sample_rate = 0.5
 
completion = openai.chat.completions.create(
  name="test-chat",
  model="gpt-3.5-turbo",
  messages=[
    {"role": "system", "content": "You are a calculator."},
    {"role": "user", "content": "1 + 1 = "}],
)

import OpenAI from "openai";
import { observeOpenAI } from "langfuse";
 
const openai = observeOpenAI(new OpenAI(), {
  clientInitParams: {
    sampleRate: 0.5,
  },
});

See OpenAI Integration (JS/TS) for more details.

When using the Python SDK v3, the sample rate provided on client initialization will apply to all event inputs and outputs regardless of the Langfuse-maintained integration you are using.

See the Python SDK v3 tab for more details.

When using the CallbackHandler with Python SDK v2:

from langfuse.callback import CallbackHandler
 
# Either set the environment variable or the constructor parameter. The latter takes precedence.
os.environ["LANGFUSE_SAMPLE_RATE"] = '0.5'
handler = CallbackHandler(
  sample_rate=0.5
)

import { CallbackHandler } from "langfuse-langchain";
 
const handler = new CallbackHandler({
  sampleRate: 0.5,
});

See Langchain Integration (JS/TS) for more details.

When using the Vercel AI SDK Integration

instrumentation.ts

import { registerOTel } from "@vercel/otel";
import { LangfuseExporter } from "langfuse-vercel";
 
export function register() {
  registerOTel({
    serviceName: "langfuse-vercel-ai-nextjs-example",
    traceExporter: new LangfuseExporter({ sampleRate: 0.5 }),
  });
}

GitHub Discussions

Releases & Versioning Token & Cost Tracking

Was this page helpful?

Support