ARC

reasoning

AI2 Reasoning Challenge (ARC) tests reasoning through grade-school science questions.

Published: 2018
Scale: 0-100
Top Score: 96.4

ARC Leaderboard

RankModelProviderScoreParametersReleasedType
1Claude 3 OpusAnthropic
96.4
2024-03-04Multimodal
2Claude 3 SonnetAnthropic
93.2
2024-03-04Multimodal
3Claude 3 HaikuAnthropic
89.2
2024-03-04Multimodal

About ARC

Description

AI2 Reasoning Challenge (ARC) tests reasoning through grade-school science questions.

Methodology

ARC evaluates models on a scale of 0 to 100. Higher scores indicate better performance. For detailed information about the methodology, please refer to the original paper.

Publication

This benchmark was published in 2018.Read the full paper

Related Benchmarks