How many models are tracked on WMT'23?

Codesota tracks 4 models on WMT'23.

When was the WMT'23 leaderboard last updated?

The WMT'23 leaderboard on Codesota includes results through 2023.

Codesota · Natural Language Processing · Machine Translation · WMT'23Tasks/Natural Language Processing/Machine Translation

Machine Translation · benchmark dataset · 2023 · EN

WMT'23.

Name: WMT'23 Benchmark Results
Creator: Codesota
Published: 2023-01-01
License: https://creativecommons.org/licenses/by/4.0/

State-of-the-art machine translation evaluation from WMT 2023 shared task

Submit a result ↵

§ 01 · Leaderboard

Best published scores.

4 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.

Primary: bleu · higher is better

comet

4 rows

#	Model	Org	Submitted	Paper / code	comet
01	GPT-4	OpenAI	Dec 2023	arxiv	84.10
02	Google Translate	Google	Dec 2023	arxiv	83.80
03	DeepL	DeepL SE	Dec 2023	arxiv	83.50
04	NLLB-3.3BOpen	Meta AI	Dec 2023	arxiv	81.60

Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result ↵Read submission guide

What a submission needs

01A public checkpoint or API endpoint
02A reproduction script with frozen commit + seed
03Declared evaluation environment (Python, deps)
04One row per metric declared by this dataset
05A contact so we can follow up on discrepancies

WMT'23.

Best published scores.

Neighbouring benchmarks.

Have a score that beatsthis table?

Have a score that beats
this table?