MGSM
mathematicsPending Verification
Multilingual Grade School Math (MGSM) extends GSM8K to 10 languages.
Published: 2022
Score Range: 0-100
Top Score: 91.6
MGSM Leaderboard
Rank | Model | Provider | Score | Parameters | Released | Type |
---|---|---|---|---|---|---|
1 | Claude 3.5 Sonnet | Anthropic | 91.6 | 2024-06-20 | Multimodal | |
2 | Claude 3 Opus | Anthropic | 90.7 | 2024-03-04 | Multimodal | |
3 | GPT-4o | OpenAI | 90.5 | 2024-05-13 | Multimodal | |
4 | Claude 3.5 Haiku | Anthropic | 85.6 | 2024-10-22 | Multimodal | |
5 | Claude 3 Sonnet | Anthropic | 83.5 | 2024-03-04 | Multimodal | |
6 | DeepSeek-V3 | DeepSeek | 79.8 | 671B total, 37B activated | 2024-12-26 | Text |
7 | Claude 3 Haiku | Anthropic | 75.1 | 2024-03-04 | Multimodal |
About MGSM
Methodology
MGSM evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.
Publication
This benchmark was published in 2022.Read the full paper