Codesota · Models · DeepSeek-V3.2-SpecialeDeepSeek11 results · 5 benchmarks
Model card

DeepSeek-V3.2-Speciale.

DeepSeekopen-source
§ 02 · Benchmarks

Every benchmark DeepSeek-V3.2-Speciale has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01AIME 2025Reasoning · Mathematical Reasoningaccuracy96.0%#4/22source ↗
02LiveCodeBenchComputer Code · Code Generationpass-188.7%#4/24source ↗
03PLCCNatural Language Processing · Polish Cultural Competencygrammar84.0%#13/165source ↗
04PLCCNatural Language Processing · Polish Cultural Competencygeography94.0%#14/165source ↗
05HLEReasoning · Multi-step Reasoningaccuracy30.6%#15/74source ↗
06PLCCNatural Language Processing · Polish Cultural Competencyhistory90.0%#16/165source ↗
07GPQA DiamondReasoning · Multi-step Reasoningaccuracy85.7%#18/74source ↗
08PLCCNatural Language Processing · Polish Cultural Competencyaverage81.0%#29/165source ↗
09PLCCNatural Language Processing · Polish Cultural Competencyart-and-entertainment71.0%#38/165source ↗
10PLCCNatural Language Processing · Polish Cultural Competencyvocabulary71.0%#42/165source ↗
11PLCCNatural Language Processing · Polish Cultural Competencyculture-and-tradition76.0%#46/165source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where DeepSeek-V3.2-Speciale actually performs.

Computer Code
1
benchmark
avg rank #4.0
Reasoning
3
benchmarks
avg rank #12.3
Natural Language Processing
1
benchmark
avg rank #28.3
§ 04 · Papers

1 paper with results for DeepSeek-V3.2-Speciale.

  1. 2025-12-02· 4 results

    DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

§ 05 · Related models

Other DeepSeek models scored on Codesota.

DeepSeek-V4-Pro Max
4 results · 1 SOTA
DeepSeek R1
671B MoE params · 10 results
DeepSeek-V3
7 results
DeepSeek-V3.2
6 results
DeepSeek-Coder-V2-Instruct
Unknown params · 4 results
DeepSeek-OCR
4 results
DeepSeek-V4-Flash Max
4 results
DeepSeek V3.5
685B MoE params · 2 results
§ 06 · Sources & freshness

Where these numbers come from.

sdadas/PLCC
7
results
pwc-dump
4
results
7 of 11 rows marked verified.