Codesota · Tasks · Speech TranslationHome/Tasks/Speech/Speech Translation

Speech Translation.

Translating spoken audio directly to another language.

Datasets

Results

bleu

Canonical metric

§ 02 · Canonical benchmark

The reference dataset.

MuST-C En-De tst-COMMON

Multilingual Speech Translation Corpus built from TED talks. The English-German tst-COMMON split is the de-facto benchmark for end-to-end speech translation. BLEU on tst-COMMON is the primary metric.

Primary metric: bleu

View full leaderboard →

§ 03 · Top 10

Leading models.

Leading models on MuST-C En-De tst-COMMON.

#	Model	bleu	Year	Source
★	SeamlessM4T v2 Large	37.1	2026	paper ↗
2	Whisper Large v2	29.0	2026	paper ↗
3	Fairseq S2T (MuST-C)	22.7	2026	paper ↗

What were you looking for on Speech Translation?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

1 dataset tracked for this task.

MuST-C En-De tst-COMMON

CANONICAL

3 results · bleu

Top: SeamlessM4T v2 Large — 37.1

§ 05 · Related tasks

Other tasks in Speech.

Speaker Verification Speech Enhancement Speech Recognition

Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Speech Translation? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.