Home/Browse/wmt23

wmt23

Unknown

wmt23 is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for wmt23.

Benchmark Stats

Models4
Papers4
Metrics1

SOTA History

Not enough data to show trend.

Only 4 models on this benchmark

Help build the community leaderboard — submit your model results.

comet

Higher is better

RankModelSourceScoreYearPaper
1GPT-4

GPT-4 COMET-22 score on WMT23 en→de test set. From WMT23 General MT findings.

Community84.12023Source
2Google Translate

Google Translate (ONLINE-B) COMET-22 on WMT23 en→de. Consistently top-tier online system.

Community83.82023Source
3DeepL

DeepL (ONLINE-W) COMET-22 on WMT23 en→de. From WMT23 findings paper.

Community83.52023Source
4NLLB-3.3B

NLLB-3.3B COMET-22 on WMT23 en→de. Open-source strong baseline.

Community81.62023Source

Submit a Result