AIME-2024
mathematicsPending Verification
American Invitational Mathematics Examination (AIME) 2024 problems.
Published: 2024
Score Range: 0-100
Top Score: 95.8
AIME-2024 Leaderboard
Rank | Model | Provider | Score | Parameters | Released | Type |
---|---|---|---|---|---|---|
1 | Grok 3 Mini | xAI | 95.8 | Unknown | 2025-02-19 | Multimodal |
2 | o4-mini | OpenAI | 93.4 | 2025-04-16 | Multimodal | |
3 | Qwen-3 | Alibaba | 85.7 | 235B (22B active) | 2025-04-29 | Text |
4 | o1 | OpenAI | 83.3 | 2024-09-12 | Multimodal | |
5 | Claude 3.7 Sonnet | Anthropic | 80 | 2025-02-24 | Multimodal | |
6 | DeepSeek-R1 | DeepSeek | 79.8 | 671B (37B activated) | 2025-01-20 | Text |
7 | Kimi K2 | Moonshot AI | 69.6 | 1T total, 32B activated | 2025-07-11 | Text |
8 | Grok 3 | xAI | 52.2 | Unknown (multi-trillion estimated) | 2025-02-19 | Multimodal |
9 | GPT-4.1 | OpenAI | 48.1 | 2025-04-14 | Multimodal | |
10 | DeepSeek-V3 | DeepSeek | 39.2 | 671B total, 37B activated | 2024-12-26 | Text |
About AIME-2024
Methodology
AIME-2024 evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.
Publication
This benchmark was published in 2024.Read the full paper