Pass@1 is the reported evaluation metric for HumanEval+. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
Trust tiers for pass@1verifiedpapervendorcommunityunverified
Rank
Model
Trust
Score
Year
Source
01
Qwen2.5-Coder-32B
Qwen2.5-Coder-32B-Instruct (Alibaba, Nov 2024). HumanEval+ pass@1 87.2%. Table 16 of Qwen2.5-Coder technical report.