MGSM

mathematicsPending Verification

Multilingual Grade School Math (MGSM) extends GSM8K to 10 languages.

Published: 2022
Score Range: 0-100
Top Score: 91.6

MGSM Leaderboard

RankModelProviderScoreParametersReleasedType
1Claude 3.5 SonnetAnthropic
91.6
2024-06-20Multimodal
2Claude 3 OpusAnthropic
90.7
2024-03-04Multimodal
3GPT-4oOpenAI
90.5
2024-05-13Multimodal
4Claude 3.5 HaikuAnthropic
85.6
2024-10-22Multimodal
5Claude 3 SonnetAnthropic
83.5
2024-03-04Multimodal
6DeepSeek-V3DeepSeek
79.8
671B total, 37B activated2024-12-26Text
7Claude 3 HaikuAnthropic
75.1
2024-03-04Multimodal

About MGSM

Methodology

MGSM evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2022.Read the full paper

Related Benchmarks