Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · Speech TranslationHome/Tasks/Speech/Speech Translation

Speech Translation.

Translating spoken audio directly to another language.

1
Datasets
3
Results
bleu
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

MuST-C En-De tst-COMMON

Multilingual Speech Translation Corpus built from TED talks. The English-German tst-COMMON split is the de-facto benchmark for end-to-end speech translation. BLEU on tst-COMMON is the primary metric.

Primary metric: bleu
View full leaderboard →
§ 03 · Top 10

Leading models.

Leading models on MuST-C En-De tst-COMMON.

#ModelbleuYearSource
SeamlessM4T v2 Large37.12026paper ↗
2Whisper Large-v229.02026paper ↗
3Fairseq S2T (MuST-C)22.72026paper ↗

What were you looking for on Speech Translation?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

1 dataset tracked for this task.

MuST-C En-De tst-COMMON
CANONICAL
3 results · bleu
Top: SeamlessM4T v2 Large 37.1
§ 05 · Related tasks

Other tasks in Speech.

Speaker VerificationSpeech Recognition
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Speech Translation? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.