MathVista

multimodalPending Verification

Evaluates mathematical reasoning in visual contexts, combining vision and mathematical problem-solving.

Published: 2023
Score Range: 0-100
Top Score: 86.8

MathVista Leaderboard

RankModelProviderScoreParametersReleasedType
1o3OpenAI
86.8
2025-04-16Multimodal
2o4-miniOpenAI
84.3
2025-04-16Multimodal
3o1OpenAI
73.9
2024-09-12Multimodal
4GPT-4.1OpenAI
72.2
2025-04-14Multimodal
5Grok-2xAI
69
Unknown2024-08-13Multimodal
6Grok-2 minixAI
68.1
Unknown2024-08-13Multimodal

About MathVista

Methodology

MathVista evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2023.Technical Paper

Related Benchmarks