Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · Automatic Speech RecognitionHome/Tasks/Audio/Automatic Speech Recognition

Automatic Speech Recognition.

Automatic Speech Recognition (ASR) is the technology that converts spoken language into written text. ASR systems process audio signals containing human speech and transcribe them into readable text format. These systems use acoustic models, language models, and often neural networks to recognize phonemes, words, and sentences from audio input. ASR is foundational for applications like voice assistants (Siri, Alexa), transcription services, voice-controlled systems, and accessibility tools for the hearing impaired.

25
Datasets
0
Results
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

Seeking canonical benchmark for this task.

Suggest one →
§ 03 · Top 10

Leading models.

Leading models across all datasets in this task.

No results yet. Be the first to contribute.

What were you looking for on Automatic Speech Recognition?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

25 datasets tracked for this task.

AMI IHM
0 results
AMI SDM1
0 results
Artie
0 results
CHiME6
0 results
CORAAL
0 results
CallHome
0 results
CoVost2 (en→zh)
0 results
Common Voice
0 results
CosyVoice3 Cross-Lingual Test Set zh to en
0 results
Earnings-22
0 results
Fleurs
0 results
Fleurs En
0 results
GigaSpeech
0 results
LibriSpeech Clean
0 results
LibriSpeech Other
0 results
MiniMax Multilingual Test Set - Chinese
0 results
Open ASR Leaderboard
0 results
SEED Seed-TTS test-zh
0 results
SPGISpeech
0 results
Switchboard
0 results
Tedlium
0 results
VoiceBench Overall
0 results
VoxPopuli
0 results
VoxPopuli En
0 results
WSJ
0 results
§ 05 · Related tasks

Other tasks in Audio.

Audio ClassificationAudio-Language ModelsText-to-speechVoice cloning
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Automatic Speech Recognition? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.