General Language Understanding Evaluation for masked language models
Avg Score is the reported evaluation metric for GLUE. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | DeBERTa-v3-large | verified | 91.37 | 2023 | Source ↗ | Looks wrong? |
| 02 | ALBERT-xxlarge-v2 | verified | 89.4 | 2020 | Source ↗ | Looks wrong? |
| 03 | RoBERTa-large | verified | 88.5 | 2019 | Source ↗ | Looks wrong? |