07.02.2024: Function Call Evaluations ⚒️

Available in Phoenix 4.6+

We are introducing a new built-in function call evaluator that scores the function/tool-calling capabilities of your LLMs. This off-the-shelf evaluator will help you ensure that your models are not just generating text but also effectively interacting with tools and functions as intended.

This evaluator checks for issues arising from function routing, parameter extraction, and function generation.

Check out a full walkthrough of the evaluator.

Previous07.03.2024: Datasets & Experiments 🧪

Last updated 14 days ago

Was this helpful?