Summarization
When To Use Summarization Eval Template
This Eval helps evaluate the summarization results of a summarization task. The template variables are:
document: The document text to summarize
summary: The summary of the document
Summarization Eval Template
We are continually iterating our templates, view the most up-to-date template on GitHub.
Benchmark Results
GPT-4 Results
GPT-3.5 Results
Claud V2 Results
GPT-4 Turbo
How To Run the Eval
The above shows how to use the summarization Eval template.
Eval
GPT-4o
GPT-4
GPT-4 Turbo
Gemini Pro
GPT-3.5
GPT-3.5 Instruct
Palm 2 (Text Bison)
Claud V2
Llama 7b (soon)
Precision
0.87
0.79
0.94
0.61
1
1
0.57
0.75
Recall
0.63
0.88
0.641
1.0
0.1
0.16
0.7
0.61
F1
0.73
0.83
0.76
0.76
0.18
0.280
0.63
0.67
Last updated