AGIEval (English)

reasoningPending Human Review

A benchmark for evaluating foundation models on human-centric tasks.

Published: 2023

Score Range: 0-100

Top Score: N/A

AGIEval (English) Leaderboard

Rank	Model	Provider	Score	Parameters	Released	Type
No models found with scores for this benchmark.

About AGIEval (English)

Methodology

AGIEval (English) evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.