MMLU-Pro

knowledgeVerified

MMLU-Pro is an enhanced benchmark with over 12,000 challenging questions across 14 domains including Biology, Business, Chemistry, Computer Science, Economics, Engineering, Health, History, Law, Math, Philosophy, Physics, Psychology, and Others. It features 10 answer choices per question (vs. 4 in MMLU) and focuses on complex reasoning tasks.

Published: 2025
Scale: 0-100
Top Score: 84

MMLU-Pro Leaderboard

RankModelProviderScoreParametersReleasedType
1DeepSeek-R1DeepSeek
84
671B (37B activated)2025-01-20Text
2Gemini 2.0 ProGoogle
79.1
2025-02-05Multimodal
3Gemini 2.0 FlashGoogle
77.6
2025-02-25Multimodal
4DeepSeek-V3DeepSeek
75.9
671B total, 37B activated2024-12-26Text
5Gemini 2.0 Flash-LiteGoogle
71.6
2025-02-25Multimodal
6Qwen-2Alibaba
55.6
72B2024-06-11Text

About MMLU-Pro

Description

MMLU-Pro is an enhanced benchmark with over 12,000 challenging questions across 14 domains including Biology, Business, Chemistry, Computer Science, Economics, Engineering, Health, History, Law, Math, Philosophy, Physics, Psychology, and Others. It features 10 answer choices per question (vs. 4 in MMLU) and focuses on complex reasoning tasks.

Methodology

MMLU-Pro evaluates models on a scale of 0 to 100. Higher scores indicate better performance. For detailed information about the methodology, please refer to the original paper.

Publication

This benchmark was published in 2025.Read the full paper

Related Benchmarks