Codesota · Models · DeepSeek-Coder-V2-InstructDeepSeek9 results · 7 benchmarks
Model card

DeepSeek-Coder-V2-Instruct.

DeepSeekopen-sourceUnknown paramsMoE Transformer

236B MoE (21B active). Trained on 6T tokens. Matches/beats GPT-4o on code. June 2024.

§ 01 · Benchmarks

Every benchmark DeepSeek-Coder-V2-Instruct has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01Bugs2FixComputer Code · Bug Detectionaccuracy75.3%#3/62024-06-17source ↗
02CodeSearchNetComputer Vision · Optical Character Recognitionbleu-422.8%#3/72024-06-17source ↗
03CrossCodeEvalComputer Code · Code Completionexact-match41.3%#3/62024-06-17source ↗
04TransCoder (GeeksForGeeks)Computer Code · Code Translationcomputational-accuracy84.6%#4/72024-06-17source ↗
05MBPPComputer Code · Code Generationpass@189.4%#7/192024-06-17source ↗
06MBPPComputer Code · Code Generationpass@189.4%#7/19source ↗
07HumanEvalComputer Code · Code Generationpass@190.2%#17/422024-06-17source ↗
08HumanEvalComputer Code · Code Generationpass@190.2%#17/422024-06-01source ↗
09LiveCodeBenchComputer Code · Code Generationpass@143.4%#23/302024-03-12source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where DeepSeek-Coder-V2-Instruct actually performs.

Computer Vision
1
benchmark
avg rank #3.0
Computer Code
6
benchmarks
avg rank #10.1
§ 03 · Papers

2 papers with results for DeepSeek-Coder-V2-Instruct.

  1. 2024-06-17· Computer Code· 6 results

    DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

  2. 2024-03-12· Computer Code· 1 result

    LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

§ 04 · Related models

Other DeepSeek models scored on Codesota.

DeepSeek R1
671B MoE params · 10 results
DeepSeek-V3
7 results
DeepSeek-OCR
3 results
DeepSeek-R1-0528
3 results
DeepSeek V3.5
685B MoE params · 2 results
DeepSeek-V2.5
2 results
DeepSeek-V3.1
2 results
DeepSeek V3.2
1 result
§ 05 · Sources & freshness

Where these numbers come from.

arxiv
6
results
arxiv-2409.12186
1
result
shadow-page-humaneval
1
result
official-leaderboard
1
result
9 of 9 rows marked verified. · first result 2024-03-12, latest 2024-06-17.