codesearchnet---java.

Dataset from Papers With Code

Submit a result ↵

§ 01 · Leaderboard

Best published scores.

14 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.

Primary: accuracy · higher is better

smoothed-bleu-4

14 rows

#	Model	Org	Submitted	Paper / code	smoothed-bleu-4
01	StarCoder-LoRA	BigCode / Salesforce	Jun 2024	codexglue-leaderboard	22.61
02	CodeTrans-MT-Large	—	Apr 2021	CodeTrans: Towards Cracking the Language of Silicon's Co… · code	21.87
03	DistillCodeT5	FSOFT AI Lab	Jun 2024	codexglue-leaderboard	20.51
04	PolyglotCodeBERT	UC Davis	Dec 2021	codexglue-leaderboard	20.11
05	ProphetNet-X	USTC / Microsoft Research Asia	Apr 2021	codexglue-leaderboard	19.39
06	CoTexT	Case Western Reserve University	May 2021	codexglue-leaderboard	19.06
07	PLBART	UCLA / Columbia University	Mar 2021	codexglue-leaderboard	18.45
08	CodeBERT (MLM+RTD)	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	14.56
09	CodeBERT (MLM)	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	13.59
10	RoBERTa	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	13.20
11	pre-train w/ code only	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	13.07
12	CodeBERT (RTD)	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	12.72
13	Transformer	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	12.57
14	seq2seq	—	Feb 2020	CodeBERT: A Pre-Trained Model for Programming and Natura… · code	11.42

Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.

§ 04 · Literature

2 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing
Apr 2021·CodeTrans-MT-Large
arXiv ↗Code
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Feb 2020·CodeBERT (MLM+RTD), CodeBERT (MLM), RoBERTa +4
arXiv ↗Code

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result ↵Read submission guide

What a submission needs

01A public checkpoint or API endpoint
02A reproduction script with frozen commit + seed
03Declared evaluation environment (Python, deps)
04One row per metric declared by this dataset
05A contact so we can follow up on discrepancies

codesearchnet---java.

Best published scores.

2 paperstied to this benchmark.

Neighbouring benchmarks.

Have a score that beatsthis table?

2 papers
tied to this benchmark.

Have a score that beats
this table?