Unknown
xnli is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for xnli.
Only 3 models on this benchmark
Help build the community leaderboard — submit your model results.
Higher is better
| Rank | Model | Source | Score | Year | Paper |
|---|---|---|---|---|---|
| 1 | GPT-4 GPT-4 average XNLI accuracy (15 languages). From GPT-4 Technical Report and cross-lingual evaluation studies. | Community | 87.4 | 2023 | Source |
| 2 | XLM-RoBERTa-large XLM-RoBERTa-large XNLI avg accuracy (15 languages). State-of-the-art at publication. From Table 3. | Community | 83.6 | 2019 | Source |
| 3 | mDeBERTa-v3-base mDeBERTa-v3-base average XNLI accuracy across 15 languages. From HF model card. | Community | 80.8 | 2022 | Source |