LibriSpeech test-clean is a classic ASR benchmark, but it no longer separates the strongest speech-to-text systems by itself. Current frontier systems cluster around very low WER on clean audiobook speech, so CodeSOTA uses the broader HF Open ASR Leaderboard as the headline ranking and treats LibriSpeech as one diagnostic slice.
Use LibriSpeech when your workload is clean, read English audio. For meetings, calls, accents, long-form podcasts, or noisy streaming, check AMI, Earnings-22, GigaSpeech, TED-LIUM, VoxPopuli and latency features before choosing a model.
LibriSpeech test-clean · 34 systems · ranked low-to-high
LibriSpeech test-clean WER from the HF Open ASR Leaderboard. Lower is better; the frontier is saturated near 1.3%, so treat sub-2% gaps as noise and rank on the eight-dataset mean WER above. Cloud-API rows show vendor- or AA-reported figures.