See historical SOTA progress on classic benchmarks. Compare how different models performed on the same datasets. 1,500+ results across 140+ datasets from the Papers With Code archive.
Data source: paperswithcode/paperswithcode-data
324 results tracked
289 results tracked
178 results tracked
156 results tracked
143 results tracked
112 results tracked
98 results tracked
87 results tracked
65 results tracked
48 results tracked
This data is sourced from the Papers With Code open dataset. It includes historical benchmark results from published papers, allowing you to track how model performance has improved over time on standard academic benchmarks.