PIQA

reasoningPending Human Review

Physical Interaction Question Answering.

Published: 2019
Score Range: 0-100
Top Score: 84.7

PIQA Leaderboard

RankModelProviderScoreParametersReleasedType
1DeepSeek-V3DeepSeek
84.7
671B total, 37B activated2024-12-26Text

About PIQA

Methodology

PIQA evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2019.Paper

Related Benchmarks