Unknown
wmt23 is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for wmt23.
Only 4 models on this benchmark
Help build the community leaderboard — submit your model results.
Higher is better
| Rank | Model | Source | Score | Year | Paper |
|---|---|---|---|---|---|
| 1 | GPT-4 GPT-4 COMET-22 score on WMT23 en→de test set. From WMT23 General MT findings. | Community | 84.1 | 2023 | Source |
| 2 | Google Translate Google Translate (ONLINE-B) COMET-22 on WMT23 en→de. Consistently top-tier online system. | Community | 83.8 | 2023 | Source |
| 3 | DeepL DeepL (ONLINE-W) COMET-22 on WMT23 en→de. From WMT23 findings paper. | Community | 83.5 | 2023 | Source |
| 4 | NLLB-3.3B NLLB-3.3B COMET-22 on WMT23 en→de. Open-source strong baseline. | Community | 81.6 | 2023 | Source |