Evals on Traces
Run evaluations on your trace and span data
Last updated
Was this helpful?
Run evaluations on your trace and span data
Last updated
Was this helpful?
Evaluations help you understand your LLM application performance. You can measure your application across several dimensions such as correctness, hallucination, relevance, faithfulness, latency, and more. This helps you ship LLM applications that are reliable, accurate, and fast.
As your application grows and the volume of production logs increases, manually managing the data can become challenging. Online evaluation tasks automatically tag new spans with evaluation labels as soon as the data arrives in the platform.