Codesota · Tasks · Automatic Speech RecognitionHome/Tasks/Audio/Automatic Speech Recognition

Automatic Speech Recognition.

Automatic Speech Recognition (ASR) is the technology that converts spoken language into written text. ASR systems process audio signals containing human speech and transcribe them into readable text format. These systems use acoustic models, language models, and often neural networks to recognize phonemes, words, and sentences from audio input. ASR is foundational for applications like voice assistants (Siri, Alexa), transcription services, voice-controlled systems, and accessibility tools for the hearing impaired.

Datasets

Results

—

Canonical metric

§ 02 · Canonical benchmark

The reference dataset.

Seeking canonical benchmark for this task.

Suggest one →

§ 03 · Top 10

Leading models.

Leading models across all datasets in this task.

No results yet. Be the first to contribute.

What were you looking for on Automatic Speech Recognition?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

25 datasets tracked for this task.

Automatic Speech Recognition.

The reference dataset.

Leading models.

What were you looking for on Automatic Speech Recognition?

Tracked datasets.

Other tasks in Audio.

Didn't find what you came for?