13,610 competitive programming problems from CodeForces. ~200 private test cases per problem. 12+ programming languages.
Pass@1 is the reported evaluation metric for CodeContests. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better