Global-MMLU-Lite

knowledgeVerified

A balanced collection of culturally sensitive and culturally agnostic MMLU tasks designed for efficient evaluation of multilingual models in 15 languages (including English).

Published: 2025
Scale: 0-100
Top Score: 86.5

Global-MMLU-Lite Leaderboard

RankModelProviderScoreParametersReleasedType
1Gemini 2.0 ProGoogle
86.5
2025-02-05Multimodal
2Gemini 2.0 FlashGoogle
83.4
2025-02-25Multimodal
3Gemini 2.0 Flash-LiteGoogle
78.2
2025-02-25Multimodal

About Global-MMLU-Lite

Description

A balanced collection of culturally sensitive and culturally agnostic MMLU tasks designed for efficient evaluation of multilingual models in 15 languages (including English).

Methodology

Global-MMLU-Lite evaluates models on a scale of 0 to 100. Higher scores indicate better performance. For detailed information about the methodology, please refer to the original paper.

Publication

This benchmark was published in 2025.Read the full paper

Related Benchmarks