Export Data to Notebook
Easily share data when you discover interesting insights so your data science team can perform further investigation or kickoff retraining workflows.
Oftentimes, the team that notices an issue in their model, for example a prompt/response LLM model, may not be the same team that continues the investigations or kicks off retraining workflows.
To help connect teams and workflows, Arize enables continued analysis of production data in a notebook environment for fine tuning workflows.
For example, a user may have noticed in Arize that this prompt template is not performing well.
Prompt Template: "You are an agent created to accurately translate sentences into the desired language."
With a few lines of Python code, users can export this data into Phoenix or a Jupyter notebook for further analysis. This allows team members, such as data scientists, who may not have access to production data today, an easy way to access relevant product data for further analysis in an environment they are familiar with.
They can then easily augment and fine tune the data and verify improved performance, before deploying back to production.
Phoenix is Arize's open source ML observability library designed for the notebook, helping visualize, troubleshoot, and monitor your LLM, CV, NLP and tabular models.
There are two ways export data for further investigation:
- 1.The easiest way is to click the export button on the Embeddings and Datasets pages. This will produce a code snippet that you can copy into a Python environment and install Phoenix. This code snippet will include the date range you have selected in the Arize platform, in addition to the datasets you have selected.
Export button on Embeddings tab
Export to Phoenix example
- 2.Users can also query Arize for data directly using the Arize Python export client. We recommend doing this once you're more comfortable with the in-platform export functionality, as you will need to manually enter in the data ranges and datasets you want to export.
os.environ['ARIZE_API_KEY'] = ARIZE_API_KEY
from datetime import datetime
from arize.exporter import ArizeExportClient
from arize.utils.types import Environments
client = ArizeExportClient()
primary_df = client.export_model_to_df(