Python SDK Slack Sign Up

Python SDK Slack Sign Up

Arize AI
Cookbooks
Reference
Changelog

End to End Examples
Evaluations
Experiments
Guardrails

Powered by GitBook

Support

Chat Us On Slack
support@arize.com

Resources

Blog
Course

Get Started

Signup For Free
Book A Demo

Copyright © 2025 Arize AI, Inc

On this page

Was this helpful?

Evaluations

Last updated 4 days ago

Was this helpful?

Run code and LLM evaluations to measure performance

Run online evals in the Arize UI

Guide

Run offline evals in code

Evaluate code functionality

Evaluate hallucination

Evlauate human ground truth vs. AI

Evaluate Q&A correctness

Evaluate RAG

Evaluate reference links

Evaluate relevance

Evaluate SQL correctness

Evaluate tool calling

Evaluate toxicity

Evaluate user frustration

Handle errors with evals