How many models are tracked on CodeSearchNet?

Codesota tracks 14 models on CodeSearchNet across 2 metrics.

When was the CodeSearchNet leaderboard last updated?

The CodeSearchNet leaderboard on Codesota includes results through 2026, with the earliest tracked result from 2020.

Codesota · Computer Vision · Optical Character Recognition · CodeSearchNetTasks/Computer Vision/Optical Character Recognition

Optical Character Recognition · benchmark dataset · 2020 · EN

CodeSearchNet.

Name: CodeSearchNet Benchmark Results
Creator: Codesota
Published: 2020-01-01
License: https://creativecommons.org/licenses/by/4.0/

Benchmark for code summarization (docstring generation) across 6 programming languages: Python, Java, JavaScript, PHP, Ruby, Go. Over 2M (code, docstring) pairs. Primary metric is BLEU-4.

Submit a result ↵

§ 01 · Leaderboard

Best published scores.

14 results indexed across 2 metrics. Shaded row marks current SOTA; ties broken by submission date.

Primary: accuracy · higher is better
All metrics: bleu-4, smoothed-bleu-4

bleu-4

7 rows

#	Model	Org	Submitted	Paper / code	bleu-4
01	GPT-4oAPI	OpenAI	Mar 2026	arxiv	25.30
02	Qwen2.5-Coder 32BOpen	Alibaba	Sep 2024	Qwen2.5-Coder Technical Report · code	23.40
03	DeepSeek-Coder-V2-InstructOpen	DeepSeek	Jun 2024	DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source… · code	22.80
04	CodeT5+ 2BOpen	Salesforce	May 2023	CodeT5+: Open Code Large Language Models for Code Unders… · code	21.36
05	CodeT5+Open	Salesforce	May 2023	CodeT5+: Open Code Large Language Models for Code Unders… · code	20.01
06	UniXcoderOpen	Microsoft	Mar 2022	UniXcoder: Unified Cross-Modal Pre-Training for Code Rep… · code	19.06
07	CodeBERTOpen	Microsoft	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	17.65

smoothed-bleu-4

7 rows

#	Model	Org	Submitted	Paper / code	smoothed-bleu-4
01	CodeBERT (MLM+RTD)	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	15.99
02	CodeBERT (MLM)	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	15.55
03	pre-train w/ code only	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	15.15
04	CodeBERT (RTD)	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	15.03
05	RoBERTa	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	14.52
06	Transformer	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	14.31
07	seq2seq	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	13.36

Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.

§ 04 · Literature

6 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

Qwen2.5-Coder Technical Report
Sep 2024·Qwen2.5-Coder 32B
arXiv ↗Code
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Jun 2024·DeepSeek-Coder-V2-Instruct
arXiv ↗Code
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
May 2023·CodeT5+ 2B, CodeT5+
arXiv ↗Code
UniXcoder: Unified Cross-Modal Pre-Training for Code Representation
Mar 2022·UniXcoder
arXiv ↗Code
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Feb 2020·CodeBERT
arXiv ↗Code
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Feb 2020·CodeBERT (MLM+RTD), CodeBERT (MLM), pre-train w/ code only +4
arXiv ↗Code

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result ↵Read submission guide

What a submission needs

01A public checkpoint or API endpoint
02A reproduction script with frozen commit + seed
03Declared evaluation environment (Python, deps)
04One row per metric declared by this dataset
05A contact so we can follow up on discrepancies

CodeSearchNet.

Best published scores.

6 paperstied to this benchmark.

Neighbouring benchmarks.

Have a score that beatsthis table?

6 papers
tied to this benchmark.

Have a score that beats
this table?