Code Generation2024en

LiveCodeBench

Contamination-free coding benchmark collecting new problems from LeetCode, AtCoder, and CodeForces after model knowledge cutoffs. Updated continuously with fresh problems. Primary metric is pass@1 on the full test set.

Samples:400
Metrics:pass@1
Paper / Website
Current State of the Art

DeepSeek R1-0528

DeepSeek

73.3

pass@1

pass@1 Progress Over Time

Showing 4 breakthroughs from Sep 2024 to May 2025

27.239.852.364.977.5Sep 2024Nov 2024Feb 2025May 2025pass@1Date

Key Milestones

Sep 2024
Qwen2.5-Coder-32B-Instruct

Qwen2.5-Coder tech report Table 16, earlier window

31.4
Jan 2025
DeepSeek-R1

LCB window Aug 2024–Jan 2025, pass@1-COT

65.9
+109.9%
May 2025
Qwen3-235B-A22B

Qwen3 tech report, LCB v5

70.7
+7.3%
May 2025
DeepSeek R1-0528Current SOTA

LCB window Aug 2024–May 2025, pass@1-COT

73.3
+3.7%
Total Improvement
133.4%
Time Span
9m
Breakthroughs
4
Current SOTA
73.3

Top Models Performance Comparison

Top 10 models ranked by pass@1

pass@11DeepSeek R1-052873.3100.0%2o4-mini72.899.3%3Qwen3-235B-A22B70.796.5%4o3-mini66.991.3%5DeepSeek-R165.989.9%6o365.389.1%7DeepSeek-R1-Distill-Llama...65.288.9%8Kimi k1.562.585.3%9DeepSeek-R1-Distill-Qwen-32B62.184.7%10Claude Opus 457.878.9%0%25%50%75%100%% of best
Best Score
73.3
Top Model
DeepSeek R1-0528
Models Compared
10
Score Range
15.5

pass@1Primary

#ModelScorePaper / CodeDate
1
DeepSeek R1-0528Open Source
DeepSeek
73.3May 2025
2
o4-miniAPI
OpenAI
72.8Mar 2024
3
Qwen3-235B-A22B
Alibaba
70.7May 2025
4
o3-miniAPI
OpenAI
66.9Mar 2024
5
DeepSeek-R1Open Source
DeepSeek
65.9Jan 2025
6
o3API
OpenAI
65.3Mar 2024
7
DeepSeek-R1-Distill-Llama-70BOpen Source
DeepSeek
65.2Jan 2025
8
Kimi k1.5API
Moonshot AI
62.5Jan 2025
9
DeepSeek-R1-Distill-Qwen-32BOpen Source
DeepSeek
62.1Jan 2025
10
Claude Opus 4API
Anthropic
57.8Mar 2024
11
GPT-4.1API
OpenAI
54.4Mar 2024
12
Claude Sonnet 4API
Anthropic
52.8Mar 2024
13
DeepSeek V3Open Source
DeepSeek
49.2Mar 2024
14
DeepSeek-V3-0324Open Source
DeepSeek
49.2Mar 2025
15
Qwen2.5-Coder-32B-InstructOpen Source
Alibaba
47.8Mar 2024
16
DeepSeek-Coder-V2-InstructOpen Source
DeepSeek
43.4Mar 2024
17
Llama 4 MaverickOpen Source
Meta
43.4Apr 2025
18
GPT-4oAPI
OpenAI
40.8Mar 2024
19
DeepSeek V3Open Source
DeepSeek
40.5Dec 2024
20
Gemma 3 27B IT
Google DeepMind
39Mar 2025
21
Llama 4 ScoutOpen Source
Meta
32.8Apr 2025
22
Gemma 3 12B IT
Google DeepMind
32Mar 2025
23
Qwen2.5-Coder-32B-InstructOpen Source
Alibaba
31.4Nov 2024
24
Codestral 22B
Mistral
29.5Mar 2024
25
Gemma 3 4B IT
Google DeepMind
23Mar 2025

Related Papers1

Other Code Generation Datasets