Codesota · Registry log11,851 rows · 2707 new this monthShowing 14
Editorial · Registry log
Every score we've added, in order.
The append-only public ledger of every benchmark result on Codesota. When a row was written, when the result itself is dated, who the model was, what value was claimed, and where the citation lives. New-SOTA rows are marked in colour; unverified rows still show, but labelled.
This is the audit trail. If a score is wrong, this is where the error will be visible; if a source is missing, this is where you'll see the gap.
2026-05-26 · 1 row
- 13:27Gemini 3.1 ProLiveCodeBench Pro2887.00NEW SOTA+448.00source ↗· verified
2026-05-20 · 13 rows
- 16:00Kimi K2.6HLE54.0%NEW SOTA+6.00source ↗· unverified
- 16:00MiMo-V2.5-ProHLE48.0%NEW SOTA+9.70source ↗· unverified
- 16:00Wav2vec2-base-960hVoxPopuli32.5%NEW SOTA+2.39source ↗· unverified
- 16:00Wav2vec2-base-960hSPGISpeech27.6%NEW SOTA+1.35source ↗· unverified
- 16:00Mms-1b-fl102SPGISpeech26.2%NEW SOTA+0.75source ↗· unverified
- 16:00Mms-1b-fl102TED-LIUM32.4%NEW SOTA+11.30source ↗· unverified
- 16:00Wav2vec2-base-960hTED-LIUM21.1%NEW SOTA+1.56source ↗· unverified
- 16:00Mms-1b-fl102AMI-IHM86.8%NEW SOTA+39.51source ↗· unverified
- 16:00Data2vec-audio-base-960hTED-LIUM19.5%NEW SOTA+0.64source ↗· unverified
- 16:00wav2vec 2.0 Large (960h)TED-LIUM18.9%NEW SOTA+1.37source ↗· unverified
- 16:00wav2vec 2.0 Large (960h)VoxPopuli30.1%NEW SOTA+6.23source ↗· unverified
- 16:00Data2vec-audio-base-960hSPGISpeech25.5%NEW SOTA+2.64source ↗· unverified
- 16:00Stt_en_fastconformer_ctc_largeOpen ASR Leaderboard6399.25NEW SOTA+1054.11source ↗· unverified
Showing the 200 most-recent rows. To inspect a single dataset’s history, append ?dataset=ID (e.g. /log?dataset=mmmu). Delta compares each row to the prior-best value on the same dataset at the moment this row was added. Hidden datasets and hidden models are not shown.