Music generation evaluated on 5.5K expert-annotated music clips
Fad is the reported evaluation metric for MusicCaps. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | MusicGen-Medium | verified | 4.89 | 2023 | Source ↗ | Looks wrong? |
| 02 | AudioLDM 2-MSD | verified | 4.47 | 2024 | Source ↗ | Looks wrong? |
| 03 | MusicLM | verified | 4.00 | 2023 | Source ↗ | Looks wrong? |
| 04 | MusicGen Large | paper | 3.80 | 2026 | Source ↗ | Looks wrong? |
| 05 | AudioLDM-M | verified | 3.20 | 2023 | Source ↗ | Looks wrong? |
| 06 | AudioLDM 2-Full | verified | 3.13 | 2024 | Source ↗ | Looks wrong? |
| 07 | Noise2Music | paper | 2.13 | 2026 | Source ↗ | Looks wrong? |