LiveCodeBench-v5

codingPending Verification

Evaluates models on their ability to solve coding problems in real-time.

Published: 2024
Score Range: 0-100
Top Score: 79.4

LiveCodeBench-v5 Leaderboard

RankModelProviderScoreParametersReleasedType
1Grok 4xAI
79.4
Unknown2025-07-09Multimodal
2Gemini 2.5 ProGoogle
75.6
2025-05-06Multimodal
3Qwen-3Alibaba
70.7
235B (22B active)2025-04-29Text
4Gemini 2.5 FlashGoogle
63.9
2025-05-20Multimodal
5Grok 3xAI
57
Unknown (multi-trillion estimated)2025-02-19Multimodal
6Grok 3 MinixAI
41.5
Unknown2025-02-19Multimodal
7Gemini 2.0 ProGoogle
36
2025-02-05Multimodal
8Gemini 2.0 FlashGoogle
34.5
2025-02-25Multimodal
9Gemini 2.5 Flash-LiteGoogle
34.3
2025-06-17Multimodal
10Gemini 2.0 Flash-LiteGoogle
28.9
2025-02-25Multimodal

About LiveCodeBench-v5

Methodology

LiveCodeBench-v5 evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2024.Technical Paper

Related Benchmarks