Summarization Eval
When To Use Summarization Eval Template
This Eval helps evaluate the summarization results of a summarization task. The template variables are:
document: The document text to summarize
summary: The summary of the document
Summarization Eval Template
We are continually iterating our templates, view the most up-to-date template on GitHub. Last updated on 10/12/2023
Benchmark Results
GPT-4 Results
GPT-3.5 Results
Claud V2 Results
How To Run the Eval
The above shows how to use the summarization Eval template.
Eval Summary | GPT-4 | GPT-3.5 | GPT-3.5 Instruct | Palm 2 (Text Bison) | Claud V2 | Llama 7b (soon) |
---|---|---|---|---|---|---|
Precision | 0.79 | 1 | 1 | 0.57 | 0.75 | |
Recall | 0.88 | 0.1 | 0.16 | 0.7 | 0.61 | |
F1 | 0.83 | 0.18 | 0.280 | 0.63 | 0.67 |
Last updated