Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Models · Apertus-70B3 results · 3 benchmarks
Model card

Apertus-70B.

unknown
§ 02 · Benchmarks

Every benchmark Apertus-70B has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01WinoGrandeReasoning · Commonsense Reasoningaccuracy73.3%#8/13source ↗
02HellaSwagReasoning · Commonsense Reasoningaccuracy64.0%#15/17source ↗
03MMLUReasoning · Commonsense Reasoningaccuracy65.2%#57/64source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where Apertus-70B actually performs.

Reasoning
3
benchmarks
avg rank #26.7
§ 04 · Papers

1 paper with results for Apertus-70B.

  1. 2025-09-17· 3 results

    Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

§ 06 · Sources & freshness

Where these numbers come from.

pwc-dump
3
results
0 of 3 rows marked verified.