MathVista

multimodal

Evaluates mathematical reasoning in visual contexts, combining vision and mathematical problem-solving.

Published: 2023
Scale: 0-100
Top Score: 86.8

MathVista Leaderboard

RankModelProviderScoreParametersReleasedType
1o3OpenAI
86.8
2025-04-16Multimodal
2o4-miniOpenAI
84.3
2025-04-16Multimodal
3o1OpenAI
73.9
2024-09-12Multimodal

About MathVista

Description

Evaluates mathematical reasoning in visual contexts, combining vision and mathematical problem-solving.

Methodology

MathVista evaluates models on a scale of 0 to 100. Higher scores indicate better performance. For detailed information about the methodology, please refer to the original paper.

Publication

This benchmark was published in 2023.Read the full paper

Related Benchmarks