Codesota · Models · DeepSeek-R1-0528DeepSeek11 results · 5 benchmarks
Model card

DeepSeek-R1-0528.

DeepSeekopen-source
§ 01 · Benchmarks

Every benchmark DeepSeek-R1-0528 has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01LiveCodeBenchComputer Code · Code Generationpass@173.3%#6/30source ↗
02PLCCNatural Language Processing · Polish Cultural Competencyhistory91.0%#11/165source ↗
03MMLU-ProReasoning · Commonsense Reasoningaccuracy85.0%#17/202026-04-20source ↗
04SWE-Bench VerifiedComputer Code · Code Generationresolve-rate57.6%#29/39source ↗
05PLCCNatural Language Processing · Polish Cultural Competencygrammar73.0%#39/165source ↗
06PLCCNatural Language Processing · Polish Cultural Competencygeography85.0%#41/165source ↗
07PLCCNatural Language Processing · Polish Cultural Competencyaverage76.2%#44/165source ↗
08PLCCNatural Language Processing · Polish Cultural Competencyvocabulary68.0%#48/165source ↗
09PLCCNatural Language Processing · Polish Cultural Competencyart-and-entertainment65.0%#49/165source ↗
10PLCCNatural Language Processing · Polish Cultural Competencyculture-and-tradition75.0%#53/165source ↗
11SWE-bench VerifiedAgentic AI · SWE-benchresolve-rate44.6%#69/81source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where DeepSeek-R1-0528 actually performs.

Reasoning
1
benchmark
avg rank #17.0
Computer Code
2
benchmarks
avg rank #17.5
Natural Language Processing
1
benchmark
avg rank #40.7
Agentic AI
1
benchmark
avg rank #69.0
§ 04 · Related models

Other DeepSeek models scored on Codesota.

DeepSeek R1
671B MoE params · 10 results
DeepSeek-V3
7 results
DeepSeek-Coder-V2-Instruct
Unknown params · 4 results
DeepSeek-OCR
3 results
DeepSeek V3.5
685B MoE params · 2 results
DeepSeek-V2.5
2 results
DeepSeek-V3.1
2 results
DeepSeek V3.2
1 result
§ 05 · Sources & freshness

Where these numbers come from.

sdadas/PLCC
7
results
deepseek-model-card
1
result
llm-stats
1
result
deepseek-blog
1
result
editorial
1
result
9 of 11 rows marked verified.