Log Evaluation Results
This guide shows how LLM evaluation results in dataframes can be sent to Phoenix.
An evaluation must have a name
(e.g. "Q&A Correctness") and its DataFrame must contain identifiers for the subject of evaluation, e.g. a span or a document (more on that below), and values under either the score
, label
, or explanation
columns. See Evaluations for more information.
Connect to Phoenix
Before accessing px.Client(), be sure you've set the following environment variables:
If you're self-hosting Phoenix, ignore the client headers and change the collector endpoint to your endpoint.
Span Evaluations
A dataframe of span evaluations would look similar like the table below. It must contain span_id
as an index or as a column. Once ingested, Phoenix uses the span_id
to associate the evaluation with its target span.
5B8EF798A381
correct
"this is correct ..."
E19B7EC3GG02
incorrect
"this is incorrect ..."
The evaluations dataframe can be sent to Phoenix as follows. Note that the name of the evaluation must be supplied through the eval_name=
parameter. In this case we name it "Q&A Correctness".
Document Evaluations
A dataframe of document evaluations would look something like the table below. It must contain span_id
and document_position
as either indices or columns. document_position
is the document's (zero-based) index in the span's list of retrieved documents. Once ingested, Phoenix uses the span_id
and document_position
to associate the evaluation with its target span and document.
5B8EF798A381
relevant
"this is ..."
5B8EF798A381
irrelevant
"this is ..."
E19B7EC3GG02
relevant
"this is ..."
The evaluations dataframe can be sent to Phoenix as follows. Note that the name of the evaluation must be supplied through the eval_name=
parameter. In this case we name it "Relevance".
Logging Multiple Evaluation DataFrames
Multiple sets of Evaluations can be logged by the same px.Client().log_evaluations()
function call.
Specifying A Project for the Evaluations
By default the client will push traces to the project specified in the PHOENIX_PROJECT_NAME
environment variable or to the default
project. If you want to specify the destination project explicitly, you can pass the project name as a parameter.
Last updated