12.07.2023
Last updated
Was this helpful?
Last updated
Was this helpful?
Users can now iterate on prompts in the Prompt Playground using the AWS Bedrock integration. This integration allows users to iterate on prompt templates, parameters, and variables in the platform and compare responses. Additionally, users can now compare LLM providers by comparing prompt runs between LLMs. Learn more here.
Users can now visualize an ROC curve with calculated AUC for their model from within Arize on the Performance Tracing page. Simply select the "More Charts" tab, and then "ROC Curve".
A new dashboard template has been added for easier analysis of drift across all your model features. To access this new template, create a new dashboard, select the "Drift" tab, and then select the "Feature Drift" template.
Able to ingest a list of strings ([item_1, item_2, item_3])
and view metrics
New metrics support: Percent Empty
and Average List Length
(data quality metric)
Filters: on every filter page you can select contains
Valid for all ingestion methods (except CSV)
Model navigation can now be found (and collapsed) on the left side of the UI for an improved experience. With this design, we have also categorized groupings of tabs so you can more easily find what you're looking for.
Arize's new greyscale dark mode new dark mode introduces a richer, greyscale palette that minimizes distractions and is optimized for work in low-light environments, reducing eye strain for extended evening hours.
To understand model performance across features/tags of your data, users now can group by features or tags on dashboard line plots. To do this, simply create a Time Series plot, and turn on "Group metric by feature or tag".
To help users better understand where there data is during data ingestion, users will now be navigated to a loading page as Arize ingests their data.
The latest ebooks, self-guided course modules, and technical posts on topics like LLM evaluation and beyond:
The Definitive LLM Observability Checklist
Build Versus Buy: LLMOps and Observability
LLM Observability 101: Ebook
Prompt Templates, Functions and Prompt Window Management
RAG Retrieval: Benchmarking Evaluation
LLM Semantic Search: VectorFlow Code-Along Tutorial