Question answering over Wikipedia tables requiring compositional reasoning
Accuracy is the reported evaluation metric for WikiTableQuestions. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | GPT-4 | verified | 75.3 | 2024 | Source ↗ | Looks wrong? |
| 02 | Claude 3.5 Sonnet | verified | 73 | 2025 | Source ↗ | Looks wrong? |
| 03 | TAPAS-large | verified | 48.7 | 2020 | Paper ↗ | Looks wrong? |