LogoLogo
Python SDKSlack
  • Documentation
  • Cookbooks
  • Self-Hosting
  • Release Notes
  • Reference
  • Code Examples
    • Applications
      • Agents
      • RAG
      • Voice
    • Tracing
    • Evaluations
    • Experiments
      • Summarization
      • Text2SQL
    • Guardrails
  • Guides
    • AI Research
Powered by GitBook

Support

  • Chat Us On Slack
  • support@arize.com

Get Started

  • Signup For Free
  • Book A Demo

Copyright © 2025 Arize AI, Inc

On this page

Was this helpful?

  1. Code Examples

Evaluations

Last updated 28 days ago

Was this helpful?

Run code and LLM evaluations to measure performance

Run online evals in the Arize UI

Guide

Run offline evals in code

Evaluate code functionality

Evaluate hallucination

Evlauate human ground truth vs. AI

Evaluate Q&A correctness

Evaluate RAG

Evaluate reference links

Evaluate relevance

Evaluate SQL correctness

Evaluate tool calling

Evaluate toxicity

Evaluate user frustration

Handle errors with evals

Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link
Colab Link