AIME-2025

mathematicsPending Human Review

American Invitational Mathematics Examination (AIME) 2025 problems.

Published: 2025
Score Range: 0-100
Top Score: 99.2

AIME-2025 Leaderboard

RankModelProviderScoreParametersReleasedType
1Nemotron 3 NanoNVIDIA
99.2
31.6B (Total), ~3.2B (Active)2025-12-15Text
2GPT-OSS-20BOpenAI
98.7
21B total (3.6B active per token)2025-08-05Text
3GPT-OSS-120BOpenAI
97.9
117B total (5.1B active per token)2025-08-05Text
4GLM-4.7Z.ai
95.7
Unreleased2025-12-22Text
5Gemini 3 ProGoogle
95
Proprietary2025-11-18Multimodal
6Kimi K2Moonshot AI
94.5
1T total, 32B activated2025-07-11Text
7Grok 3xAI
93.3
Unknown (multi-trillion estimated)2025-02-19Multimodal
8o4-miniOpenAI
92.7
2025-04-16Multimodal
9Grok 4xAI
91.7
Unknown2025-07-09Multimodal
10Grok 3 MinixAI
90.8
Unknown2025-02-19Multimodal

About AIME-2025

Methodology

AIME-2025 evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2025.Technical Paper

Related Benchmarks