Program Repair

Automatically fixing bugs in code.

1
Datasets
0
Results
correct-patches
Canonical metric
Canonical Benchmark

Defects4J

Standard program repair benchmark with 835 real bugs from 17 open-source Java projects. Each bug has a fix and triggering test suite. Primary metric is the number of correctly fixed bugs (plausible and correct patches).

Primary metric: correct-patches
View full leaderboard

Top 10

Leading models on Defects4J.

No results yet. Be the first to contribute.

What were you looking for on Program Repair?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

All datasets

1 dataset tracked for this task.

Related tasks

Other tasks in Computer Code.

Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Program Repair? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.