Computer Code
Developing AI coding assistants? Test code generation, completion, translation, bug detection, and repair capabilities.
Tasks in Computer Code
Code Generation
Generating code from natural language descriptions (HumanEval, MBPP).
Code Completion
Predicting the next tokens in code sequences.
Code Translation
Converting code between programming languages.
Code Summarization
Generating natural language descriptions of code.
Bug Detection
Identifying bugs and vulnerabilities in code.
Program Repair
Automatically fixing bugs in code.
Explore Other Areas
Computer Vision
Building systems that understand images and video? Find benchmarks for recognition, detection, segmentation, and document analysis tasks.
Natural Language Processing
Processing and understanding text? Evaluate your models on language understanding, generation, translation, and information extraction benchmarks.
Reasoning
Testing if your model can think logically? Benchmark math problem solving, commonsense understanding, and multi-step reasoning capabilities.
Speech
Working with voice and audio? Evaluate speech-to-text accuracy, voice synthesis quality, and speaker identification performance.