Code Generation2023python
HumanEval+ Extended Version
Extended HumanEval with 80x more test cases. Tests code robustness and edge case handling.
No benchmark results indexed for this dataset yet.
Contribute results on GitHubExtended HumanEval with 80x more test cases. Tests code robustness and edge case handling.
No benchmark results indexed for this dataset yet.
Contribute results on GitHub