Codesota · Medical · Disease Classification · CheXpertTasks/Medical/Disease Classification
Disease Classification · benchmark dataset · 2019 · EN

CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels.

224,316 chest radiographs from 65,240 patients with 14 pathology labels. Includes uncertainty labels and expert radiologist annotations for validation set. The gold standard for chest X-ray classification.

Paper Download datasetSubmit a result
§ 01 · Leaderboard

Best published scores.

7 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.


Primary
auroc · higher is better
auroc· primary
7 rows
#ModelOrgSubmittedPaper / codeauroc
01CheXpert AUC MaximizerOSSStanfordDec 2025stanford-leaderboard93
02BioViLOSSMicrosoftDec 2025microsoft-research89.10
03CheXzeroOSSHarvard/MITDec 2025research-paper88.60
04GLoRIAOSSStanfordDec 2025research-paper88.20
05MedCLIPOSSResearchDec 2025research-paper87.80
06TorchXRayVisionOSSCohen LabDec 2025github-readme87.40
07DenseNet-121 (Chest X-ray)OSSResearchDec 2025research-paper86.50
Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.
§ 03 · Progress

1 steps
of state of the art.

Each row below marks a model that broke the previous record on auroc. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · auroc
  1. Dec 19, 2025CheXpert AUC MaximizerStanford93
Fig 3 · SOTA-setting models only. 1 entries span Dec 2025 Dec 2025.
§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies
CheXpert — Disease Classification benchmark · Codesota | CodeSOTA