Codeforces

coding

Evaluates models on competitive programming problems from the Codeforces platform.

Published: 2023
Scale: 0-3000
Top Score: 2,719

Codeforces Leaderboard

RankModelProviderScoreParametersReleasedType
1o4-miniOpenAI
2,719
2025-04-16Multimodal
2o3OpenAI
2,706
2025-04-16Multimodal
3Qwen-3Alibaba
2,056
235B (22B active)2025-04-29Text
4DeepSeek-R1DeepSeek
2,029
671B (37B activated)2025-01-20Text
5o1OpenAI
1,673
2024-09-12Multimodal
6o1-miniOpenAI
1,650
2024-09-12Text
7o1-previewOpenAI
1,258
2024-09-12Text
8GPT-4oOpenAI
900
2024-05-13Multimodal
9DeepSeek-V3DeepSeek
51.6
671B total, 37B activated2024-12-26Text

About Codeforces

Description

Evaluates models on competitive programming problems from the Codeforces platform.

Methodology

Codeforces evaluates models on a scale of 0 to 3000. Higher scores indicate better performance. For detailed information about the methodology, please refer to the original paper.

Publication

This benchmark was published in 2023.Read the full paper

Related Benchmarks