| 01 | DeBERTa-v3-large DeBERTa-v3-Large fine-tuned. Source: Table 2, arxiv:2111.09543. | verified | 91.4 | 2021 | Source ↗ |
| 02 | GPT-4o GPT-4o few-shot. Source: Papers With Code SQuAD 2.0 leaderboard, 2024. | verified | 91.4 | 2023 | Source ↗ |
| 03 | Gemini 1.5 Pro Gemini 1.5 Pro few-shot. Source: Gemini 1.5 technical report (2024). | verified | 90.5 | 2024 | Source ↗ |
| 04 | Claude 3.5 Sonnet Claude 3.5 Sonnet few-shot on SQuAD 2.0. Reported in model card. | verified | 90.2 | 2024 | Source ↗ |
| 05 | RoBERTa (single model) SQuAD 2.0 hidden test set. Rank 1 on shadow-page leaderboard. | verified | 89.795 | 2020 | Source ↗ |
| 06 | Enhanced Albert+Verifier3 (ensemble) Ensemble. SQuAD 2.0 hidden test set. | verified | 89.778 | 2020 | Source ↗ |
| 07 | RoBERTa+Verify (single model) Single model. SQuAD 2.0 hidden test set. | verified | 89.586 | 2019 | Source ↗ |
| 08 | BERT + ConvLSTM + MTL + Verifier (ensemble) Ensemble. SQuAD 2.0 hidden test set. | verified | 89.286 | 2019 | Source ↗ |
| 09 | XLNet+Verifier (single, Google/CMU) Single model. SQuAD 2.0 hidden test set. | verified | 89.082 | 2019 | Source ↗ |
| 10 | XLNet+Verifier (single, Ping An) Single model. SQuAD 2.0 hidden test set. | verified | 89.063 | 2019 | Source ↗ |
| 11 | SpanBERT (single model) Single model. SQuAD 2.0 hidden test set. | verified | 88.709 | 2019 | Source ↗ |
| 12 | Llama 3.1 405B Llama 3.1 405B Instruct few-shot. Source: Llama 3 paper Table 7. | verified | 88.7 | 2024 | Source ↗ |
| 13 | BERT + DAE + AoA (single model) Single model. SQuAD 2.0 hidden test set. | verified | 88.621 | 2019 | Source ↗ |
| 14 | BERT + AoA BERT + Attention-over-Attention. Reported on SQuAD shadow-page timeline. | verified | 88.6 | 2019 | Source ↗ |
| 15 | XLNet (single, Verified XiaoPAI) Single model. SQuAD 2.0 hidden test set. | verified | 88 | 2019 | Source ↗ |
| 16 | Insight-baseline-BERT (single model) Single model. SQuAD 2.0 hidden test set. | verified | 87.644 | 2019 | Source ↗ |
| 17 | Hanvon_model (single model) Single model. SQuAD 2.0 hidden test set. | verified | 87.117 | 2019 | Source ↗ |
| 18 | SLQA+ (single model) Single model. SQuAD 2.0 hidden test set. | verified | 87.021 | 2018 | Source ↗ |
| 19 | Qwen2 72B Qwen2 72B Instruct. Source: Qwen2 technical report (2024). | verified | 86.1 | 2024 | Source ↗ |
| 20 | Llama 3 70B Llama 3 70B Instruct. Source: Llama 3 paper. | verified | 85.3 | 2024 | Source ↗ |
| 21 | BERT (Google AI) BERT Transformer breakthrough on SQuAD. Reported on SQuAD shadow-page timeline. | verified | 83.1 | 2018 | Source ↗ |
| 22 | Logistic Regression (SQuAD baseline) Original SQuAD 1.1 baseline (Rajpurkar et al. 2016). Reported on SQuAD shadow-page timeline. | verified | 51 | 2016 | Source ↗ |