Changelog

See the latest new features released in Arize

Last updated 2 days ago

Was this helpful?

Changelog

See the latest new features released in Arize

Column selection in prompt playground

May 5, 2025

You can now view all of your prompt variables and dataset values directly in playground!

Latency and token counts in prompt playground

May 2, 2025

We've added latency and token counts to prompt playground runs! Currently supported for OpenAI, with more providers to come!

Major design refresh in Arize AX

We've refreshed Arize AX with polished fonts, spacing, color, and iconography throughout the whole platform.

Custom code evaluators

Security audit logs for enterprise customers

Larger dataset runs in prompt playground

We've increased the row limit for datasets in the playground, so you can run prompts in parallel on up to 100 examples.

Evaluations on experiments

Cancel running background tasks

Improved UI for functions in prompt playground

Compare prompts side by side

Image segmentation support for CV models

We now support logging image segmentation to Arize. Log your segmentation coordinates and compare your predictions vs. your actuals.

New time selector on your traces

Prompt hub python SDK

pip install "arize[PromptHub]"

View task run history and errors

Run evals and tasks over a date range

Easily run your online evaluation tasks over historical data.

Test online evaluation tasks in playground

Select metadata on the sessions page

Dynamically select the fields you want to see in your sessions view.

Labeling queues

Expand and collapse your traces

You can now collapse rows to see more data at a glance or expand them to view more text.

Schedule your monitors

Schedule for monitors to run hourly, daily, weekly, or monthly.

Improved traces export

primary_df = client.export_model_to_df(
    columns=['context.span_id', 'attributes.llm.input'] # <---- HERE
    space_id='',
    model_id='',
    environment=Environments.TRACING,
    start_time=datetime(2025, 3, 25),
    end_time=datetime(2025, 4, 25),
)

Create dataset from CSVs

OTEL tracing Via HTTP

Support for HTTP when sending traces to Arize! See GitHub for more info.

tracer_provider = register(
    endpoint="https://otlp.arize.com/v1/traces",     # NEW
    transport=Transport.HTTP,                        # NEW
    space_id=SPACE_ID,
    api_key=API_KEY
    project_name="test-project-http",
)

Voice application tracing and evaluation

Dashboard colors

We’ve added new ways to plot your charts, with custom colors and better UX!

Prompt hub

Managed code evaluators

Create experiments from playground

Monitor alert status

See exactly how and when your monitors are triggered

LangChain Instrumentation

Support for sessions via LangChain native thread tracking in TypeScript is now available. Easily track multi-turn conversations / threads using LangChain.js.

Analyze your spans with Copilot

Extract key insights quickly from your spans instead of trying to decipher meaning in hundreds of spans. Ask questions and run evals right in the trace view.

Generate dashboards with Copilot

Building dashboard plots just got way easier. Create time series plots and even translate code into ready to go visualizations.

The Custom Metric skill now supports a conversational flow, making it easier for users to iterate and refine metrics dynamically

View your experiment traces

Experiment traces for a dataset are now consolidated accessed under "Experiment Projects".

Multi-class calibration chart

For your multi-class ML models, you can see how your model is calibrated in one visualization

Log experiments in Python SDK

arize_client.log_experiment(
    space_id=SPACE_ID,
    experiment_name="my_experiment",
    experiment_df=experiment_run_df,
    task_columns=task_columns,
    evaluator_columns={"correctness": evaluator_columns},
    dataset_name=dataset_name,
)

Create custom metrics with Copilot

Summarize embeddings with Copilot

Local explainability support for ML models

See experiment results over time

Function calling replay in prompt playground

Vercel AI auto-instrumentation

Track sessions and context attributes in instrumentation

Easily test your online tasks and evals

Experiment filters

Embedding traces

Experiments Details Visualization

Users can now view a detailed breakdown of labels for their experiments on the Experiments Details page.

Support for o1-mini and o1-preview in playground

We've added full support for all available OpenAI models in the playground including the o1-mini and o1-preview.

Improved auto-complete in playground

We've added better input variable behavior, autocompletion enhancements, support for mustache/f-string input variables, and more.

Filter history

We now store the last three filters used by a user! Users can easily access their filter history in the query filters dropdown, making it simpler to reuse filters for future queries.