Bug Detection
Identifying bugs and vulnerabilities in code.
1
Datasets
0
Results
accuracy
Canonical metric
Canonical Benchmark
Bugs2Fix
Bug detection and repair benchmark with ~2.4M Java methods mined from GitHub commits labeled as bug fixes. Used widely to evaluate LLM bug detection capabilities. Primary metric is Accuracy (correct bug classification).
Primary metric: accuracy
Top 10
Leading models on Bugs2Fix.
No results yet. Be the first to contribute.
All datasets
1 dataset tracked for this task.
Related tasks
Other tasks in Computer Code.