Codesota · Papers453 papers · 15 shown
Editorial · Papers
Papers, with their scores.
Every paper on this page has at least one benchmark result we've recorded against it — title, abstract, code link, paper link, and a direct cross-link to the leaderboard position that score earned. Sorted by the most recently-updated benchmark result first.
This is the Codesota replacement for the Papers With Code discovery feed. If a paper has no verified benchmark result yet it won't appear here — that's the point.
§ 01 · Computer Code papers
15 papers in Computer Code.
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao et al.arxiv ↗no code link
Missing a paper? Submit it with its benchmark scores and we'll add it. Papers without any verified benchmark results are not shown — this is intentional.