05 Dataset
Workshop source
Workshop material is maintained in the public langfuse/langfuse-workshop repository. Use the repository for the runnable app, checkpoint branches, and local setup.
Learner guide: 05 Dataset
Instructor notes
- Frame the dataset as product scope written down: expected user inputs, expected behavior, and metadata for slicing.
- The code path is intentionally simple: seed curated JSON into Langfuse, then inspect the hosted dataset.
- Connect this chapter back to monitoring: good dataset items often come from surprising production traces.
Demo rhythm
- Open
data/seed-dataset.jsonand point outinput,expectedOutput, andmetadata. - Run
npm run dataset:seed. - Open the dataset list and item table in Langfuse.
Watch for
- Learners expecting exact-answer matching. The ideal answer is reference material for evaluators, not a string equality target.
- Duplicate dataset items if the seed script is run repeatedly against the same project.
Was this page helpful?