Codesota · Benchmark · AMI-IHMHome/Leaderboards/Audio & Speech/Automatic Speech Recognition/AMI-IHM
Unknown

AMI-IHM.

The AMI Meeting Corpus IHM subset consists of ~100 hours of recorded English meetings captured with individual headset microphones. Long-form spontaneous speech across overlapping speakers makes it a standard stress-test for ASR systems beyond clean read speech.

Paper Leaderboard
§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?
Use row edits to send a sourced correction into moderation.
Add / edit result Report issue

Wer

Wer is the reported evaluation metric for AMI-IHM. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Lower is better

Trust tiers for Werverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

RankModelTrustScoreYearLinksFix
01Granite Speech 4.1 2Bunverified8.092025Paper ↗Source ↗Looks wrong?
02Cohere Transcribe (Mar 2026)unverified8.132026Paper ↗Looks wrong?
03Granite Speech 3.3 2Bunverified8.902025Paper ↗Source ↗Looks wrong?
04SYMPHONY-ASRunverified9.562026Paper ↗Looks wrong?
05Qwen3-ASR-1.7Bunverified10.562026Paper ↗Code ↗Source ↗Looks wrong?
06Phi-4 Multimodal Instructunverified11.092025Paper ↗Source ↗Looks wrong?
07Parakeet-tdt-0.6b-v2unverified11.162023Paper ↗Code ↗Source ↗Looks wrong?
08Moonshine-streaming-smallunverified12.532026Paper ↗Source ↗Looks wrong?
09Distil-large-v2unverified14.672023Paper ↗Code ↗Source ↗Looks wrong?
10Distil-large-v3unverified15.162023Paper ↗Code ↗Source ↗Looks wrong?
11Owsm_ctc_v3.1_1Bunverified15.612024Paper ↗Code ↗Source ↗Looks wrong?
12Niagara-38m-batch.enunverified15.872026Paper ↗Source ↗Looks wrong?
13Parakeet-tdt_ctc-110munverified15.892023Paper ↗Code ↗Source ↗Looks wrong?
14Stt_en_conformer_ctc_largeunverified15.952020Paper ↗Code ↗Source ↗Looks wrong?
15Distil-medium.enunverified16.122023Paper ↗Code ↗Source ↗Looks wrong?
16Whisper Large v3 Turbounverified16.132022Paper ↗Code ↗Source ↗Looks wrong?
17Distil-small.enunverified16.162023Paper ↗Code ↗Source ↗Looks wrong?
18Parakeet-ctc-0.6bunverified16.462023Paper ↗Source ↗Looks wrong?
19Whisper-medium.enunverified16.682022Paper ↗Code ↗Source ↗Looks wrong?
20Whisper Largeunverified16.732022Paper ↗Code ↗Source ↗Looks wrong?
21Whisper Large v2unverified16.742022Paper ↗Code ↗Source ↗Looks wrong?
22Lite-whisper-large-v3-turbo-accunverified16.972025Paper ↗Code ↗Source ↗Looks wrong?
23Voxtral-Mini-4B-Realtime-2602unverified17.072026Paper ↗Source ↗Looks wrong?
24VibeVoice-ASR-HFunverified17.22026Paper ↗Source ↗Looks wrong?
25Parakeet-rnnt-0.6bunverified17.42023Paper ↗Source ↗Looks wrong?
26Moonshine-baseunverified17.492024Paper ↗Code ↗Source ↗Looks wrong?
27Whisper-small.enunverified17.932022Paper ↗Code ↗Source ↗Looks wrong?
28Stt_en_fastconformer_ctc_largeunverified18.612023Paper ↗Source ↗Looks wrong?
29Niagara-19m-batch.enunverified18.862026Paper ↗Looks wrong?
30Moonshine-streaming-tinyunverified19.022026Paper ↗Looks wrong?
31Stt_en_fastconformer_transducer_largeunverified19.092023Paper ↗Source ↗Looks wrong?
32Asr-conformer-loquaciousunverified19.62025Paper ↗Looks wrong?
33Lite-whisper-large-v3-fastunverified19.872025Paper ↗Code ↗Source ↗Looks wrong?
34Stt_en_conformer_ctc_smallunverified20.432020Paper ↗Code ↗Source ↗Looks wrong?
35Whisper-base.enunverified21.132022Paper ↗Code ↗Source ↗Looks wrong?
36Moonshine-tinyunverified22.842024Paper ↗Code ↗Source ↗Looks wrong?
37Whisper-tiny.enunverified24.242022Paper ↗Code ↗Source ↗Looks wrong?
38Asr-wav2vec2-librispeechunverified32.052021Paper ↗Code ↗Source ↗Looks wrong?
39Wav2vec2-large-960h-lv60-selfunverified36.772020Paper ↗Code ↗Source ↗Looks wrong?
40Wav2vec2-large-robust-ft-libri-960hunverified37.752021Paper ↗Code ↗Source ↗Looks wrong?
41Hubert-xlarge-ls960-ftunverified39.112021Paper ↗Code ↗Source ↗Looks wrong?
42Hubert-large-ls960-ftunverified39.722021Paper ↗Code ↗Source ↗Looks wrong?
43Data2vec-audio-large-960hunverified40.512022Paper ↗Code ↗Source ↗Looks wrong?
44Mms-1b-allunverified42.022023Paper ↗Code ↗Source ↗Looks wrong?
45Wav2vec2-conformer-rel-pos-large-960h-ftunverified42.392020Paper ↗Code ↗Source ↗Looks wrong?
46Wav2vec2-conformer-rope-large-960h-ftunverified42.472020Paper ↗Code ↗Source ↗Looks wrong?
47wav2vec 2.0 Large (960h)unverified42.662020Paper ↗Code ↗Source ↗Looks wrong?
48Wav2vec2-base-960hunverified45.562020Paper ↗Code ↗Source ↗Looks wrong?
49Data2vec-audio-base-960hunverified47.272022Paper ↗Code ↗Source ↗Looks wrong?
50Mms-1b-fl102unverified86.782023Paper ↗Code ↗Source ↗Looks wrong?
§ 04 · Submit a result

Add to the leaderboard.

← Back to Automatic Speech Recognition
AMI-IHM Leaderboard | CodeSOTA | CodeSOTA