Code Generation2023python

HumanEval+ Extended Version

Extended HumanEval with 80x more test cases. Tests code robustness and edge case handling.

No benchmark results indexed for this dataset yet.

Contribute results on GitHub

Other Code Generation Datasets

HumanEval+ Benchmark - Code Generation | CodeSOTA