Computer Code

Developing AI coding assistants? Test code generation, completion, translation, bug detection, and repair capabilities.

6 tasks8 datasets

Tasks in Computer Code

Generating code from natural language descriptions (HumanEval, MBPP).

Predicting the next tokens in code sequences.

Converting code between programming languages.

Generating natural language descriptions of code.

Identifying bugs and vulnerabilities in code.

Automatically fixing bugs in code.

Building systems that understand images and video? Find benchmarks for recognition, detection, segmentation, and document analysis tasks.

Processing and understanding text? Evaluate your models on language understanding, generation, translation, and information extraction benchmarks.

Testing if your model can think logically? Benchmark math problem solving, commonsense understanding, and multi-step reasoning capabilities.

Working with voice and audio? Evaluate speech-to-text accuracy, voice synthesis quality, and speaker identification performance.