Export Data to Notebook

Easily share data when you discover interesting insights so your data science team can perform further investigation or kickoff retraining workflows.

Oftentimes, the team that notices an issue in their model, for example a prompt/response LLM model, may not be the same team that continues the investigations or kicks off retraining workflows.

To help connect teams and workflows, Arize enables continued analysis of production data in a notebook environment for fine tuning workflows.

For example, a user may have noticed in Arize that this prompt template is not performing well.

They can then easily augment and fine tune the data and verify improved performance, before deploying back to production.

There are two ways export data for further investigation:

The easiest way is to click the export button on the Embeddings and Datasets pages. This will produce a code snippet that you can copy into a Python environment and install Phoenix. This code snippet will include the date range you have selected in the Arize platform, in addition to the datasets you have selected.

Users can also query Arize for data directly using the Arize Python export client. We recommend doing this once you're more comfortable with the in-platform export functionality, as you will need to manually enter in the data ranges and datasets you want to export.

os.environ['ARIZE_API_KEY'] = ARIZE_API_KEY

from datetime import datetime

from arize.exporter import ArizeExportClient
from arize.utils.types import Environments

client = ArizeExportClient()

primary_df = client.export_model_to_df(
    space_id='U3BhY2U6NzU0',
    model_id='test_home_prices_LLM',
    environment=Environments.PRODUCTION,
    start_time=datetime.fromisoformat('2023-02-11T07:00:00.000+00:00'),
    end_time=datetime.fromisoformat('2023-03-14T00:59:59.999+00:00'),
    # optional arguments
    include_actuals = False,
    model_version = None,
    batch_id = None
)

Last updated 1 year ago

Was this helpful?

os.environ['ARIZE_API_KEY'] = ARIZE_API_KEY from datetime import datetime from arize.exporter import ArizeExportClient from arize.utils.types import Environments client = ArizeExportClient() primary_df = client.export_model_to_df( space_id='U3BhY2U6NzU0', model_id='test_home_prices_LLM', environment=Environments.PRODUCTION, start_time=datetime.fromisoformat('2023-02-11T07:00:00.000+00:00'), end_time=datetime.fromisoformat('2023-03-14T00:59:59.999+00:00'), # optional arguments include_actuals = False, model_version = None, batch_id = None )