Global-MMLU-Lite

knowledgeVerified

A balanced collection of culturally sensitive and culturally agnostic MMLU tasks designed for efficient evaluation of multilingual models in 15 languages (including English).

Published: 2025
Score Range: 0-100
Top Score: 86.5

Global-MMLU-Lite Leaderboard

RankModelProviderScoreParametersReleasedType
1Gemini 2.0 ProGoogle
86.5
2025-02-05Multimodal
2Gemini 2.5 Flash-LiteGoogle
84.5
2025-06-17Multimodal
3Gemini 2.0 FlashGoogle
83.4
2025-02-25Multimodal
4Gemini 2.0 Flash-LiteGoogle
78.2
2025-02-25Multimodal

About Global-MMLU-Lite

Methodology

Global-MMLU-Lite evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2025.Technical Paper

Related Benchmarks