Similarity Search
The Similarity Search feature allows you to find items that are similar to a set of reference embeddings using cosine similarity. This feature supports both image and text embeddings.
Last updated
The Similarity Search feature allows you to find items that are similar to a set of reference embeddings using cosine similarity. This feature supports both image and text embeddings.
Last updated
Copyright © 2023 Arize AI, Inc
Currently, similarity search is only available for images, text, and inferences. Support for traces is coming soon.
Reference Embedding: The embedding vector that serves as the baseline for similarity comparisons. Select the column containing these vectors, representing the characteristics or features you are interested in matching.
Search Embedding: The column containing embedding vectors of items to be compared against the reference embedding using cosine similarity.
Threshold: A user-defined value that determines the minimum similarity score required for an item to be considered similar to the reference embeddings.
Selecting an Embedding Cell Directly
Hover over an embedding column in the table view and click the “Find Similar” button.
Select points in UMAP and then press the “Find Similar” button.
Press the “Find Similar” button in dimension details after selecting an embedding or row.
Any selection automatically updates the reference object with the prediction ID and the name of the embedding column.
Multiple Embeddings
Add multiple items from any of the entry points.
When multiple embeddings are selected, their vectors will be averaged to form the reference embedding.
Limitations
Different columns can be used for the search and reference, but adding a new reference point from a different column will trigger a modal error.
Similarity search is only supported in performance tracing and embedding views.
Define Reference Embeddings: Specify the embeddings you want to use as references. Ensure that all reference embeddings are in the same column.
Set Search Parameters: Define the search embedding column and the similarity threshold.
Execute the Search: Use the provided API to perform the similarity search and retrieve the results.
Make sure you have at least version 7.18.1 of Arize installed: