CommonsenseQA

Unknown

12,247 multiple choice questions requiring commonsense reasoning about everyday concepts.

Benchmark Stats

Models3
Papers3
Metrics1

SOTA History

Not enough data to show trend.

Only 3 models on this benchmark

Help build the community leaderboard — submit your model results.

accuracy

accuracy

Higher is better

RankModelSourceScoreYearPaper
1gpt-4o

Commonsense reasoning QA from ConceptNet.

Editorial85.42025Source
2claude-35-sonnetEditorial83.22025Source
3llama-3-70bEditorial80.92025Source

Submit a Result