Bug Detection

Identifying bugs and vulnerabilities in code.

1
Datasets
0
Results
accuracy
Canonical metric
Canonical Benchmark

Bugs2Fix

Bug detection and repair benchmark with ~2.4M Java methods mined from GitHub commits labeled as bug fixes. Used widely to evaluate LLM bug detection capabilities. Primary metric is Accuracy (correct bug classification).

Primary metric: accuracy
View full leaderboard

Top 10

Leading models on Bugs2Fix.

No results yet. Be the first to contribute.

All datasets

1 dataset tracked for this task.

Related tasks

Other tasks in Computer Code.