AGIEval (English)
reasoningPending Human Review
A benchmark for evaluating foundation models on human-centric tasks.
Published: 2023
Score Range: 0-100
Top Score: N/A
AGIEval (English) Leaderboard
| Rank | Model | Provider | Score | Parameters | Released | Type |
|---|---|---|---|---|---|---|
| No models found with scores for this benchmark. | ||||||
About AGIEval (English)
Methodology
AGIEval (English) evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.
Publication
This benchmark was published in 2023.Paper