| 01 | Codex / GPT-5.5 Official Terminal-Bench 2.0 leaderboard rank 1. System couples agent scaffold and underlying model: Codex / GPT-5.5. | verified | 82 | 2026 | Source ↗ |
| 02 | Codex / GPT-5.5 Official Terminal-Bench 2.0 leaderboard rank 1. System couples agent scaffold and underlying model: Codex / GPT-5.5. | verified | 82 | 2026 | Source ↗ |
| 03 | ForgeCode / GPT-5.4 Official Terminal-Bench 2.0 leaderboard rank 2. System couples agent scaffold and underlying model: ForgeCode / GPT-5.4. | verified | 81.8 | 2026 | Source ↗ |
| 04 | ForgeCode / GPT-5.4 Official Terminal-Bench 2.0 leaderboard rank 2. System couples agent scaffold and underlying model: ForgeCode / GPT-5.4. | verified | 81.8 | 2026 | Source ↗ |
| 05 | TongAgents / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 3. System couples agent scaffold and underlying model: TongAgents / Gemini 3.1 Pro. | verified | 80.2 | 2026 | Source ↗ |
| 06 | TongAgents / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 3. System couples agent scaffold and underlying model: TongAgents / Gemini 3.1 Pro. | verified | 80.2 | 2026 | Source ↗ |
| 07 | ForgeCode / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 4. System couples agent scaffold and underlying model: ForgeCode / Claude Opus 4.6. | verified | 79.8 | 2026 | Source ↗ |
| 08 | ForgeCode / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 4. System couples agent scaffold and underlying model: ForgeCode / Claude Opus 4.6. | verified | 79.8 | 2026 | Source ↗ |
| 09 | ForgeCode / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 6. System couples agent scaffold and underlying model: ForgeCode / Gemini 3.1 Pro. | verified | 78.4 | 2026 | Source ↗ |
| 10 | SageAgent / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 5. System couples agent scaffold and underlying model: SageAgent / GPT-5.3-Codex. | verified | 78.4 | 2026 | Source ↗ |
| 11 | SageAgent / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 5. System couples agent scaffold and underlying model: SageAgent / GPT-5.3-Codex. | verified | 78.4 | 2026 | Source ↗ |
| 12 | ForgeCode / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 6. System couples agent scaffold and underlying model: ForgeCode / Gemini 3.1 Pro. | verified | 78.4 | 2026 | Source ↗ |
| 13 | Droid / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 7. System couples agent scaffold and underlying model: Droid / GPT-5.3-Codex. | verified | 77.3 | 2026 | Source ↗ |
| 14 | Droid / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 7. System couples agent scaffold and underlying model: Droid / GPT-5.3-Codex. | verified | 77.3 | 2026 | Source ↗ |
| 15 | Capy / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 8. System couples agent scaffold and underlying model: Capy / Claude Opus 4.6. | verified | 75.3 | 2026 | Source ↗ |
| 16 | Capy / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 8. System couples agent scaffold and underlying model: Capy / Claude Opus 4.6. | verified | 75.3 | 2026 | Source ↗ |
| 17 | Simple Codex / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 9. System couples agent scaffold and underlying model: Simple Codex / GPT-5.3-Codex. | verified | 75.1 | 2026 | Source ↗ |
| 18 | Simple Codex / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 9. System couples agent scaffold and underlying model: Simple Codex / GPT-5.3-Codex. | verified | 75.1 | 2026 | Source ↗ |
| 19 | Terminus-KIRA / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 10. System couples agent scaffold and underlying model: Terminus-KIRA / Gemini 3.1 Pro. | verified | 74.8 | 2026 | Source ↗ |
| 20 | Terminus-KIRA / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 10. System couples agent scaffold and underlying model: Terminus-KIRA / Gemini 3.1 Pro. | verified | 74.8 | 2026 | Source ↗ |
| 21 | Terminus-KIRA / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 11. System couples agent scaffold and underlying model: Terminus-KIRA / Claude Opus 4.6. | verified | 74.7 | 2026 | Source ↗ |
| 22 | Terminus-KIRA / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 11. System couples agent scaffold and underlying model: Terminus-KIRA / Claude Opus 4.6. | verified | 74.7 | 2026 | Source ↗ |
| 23 | Mux / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 12. System couples agent scaffold and underlying model: Mux / GPT-5.3-Codex. | verified | 74.6 | 2026 | Source ↗ |
| 24 | Mux / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 12. System couples agent scaffold and underlying model: Mux / GPT-5.3-Codex. | verified | 74.6 | 2026 | Source ↗ |
| 25 | MAYA-V2 / Claude 4.6 Opus Official Terminal-Bench 2.0 leaderboard rank 13. System couples agent scaffold and underlying model: MAYA-V2 / Claude 4.6 Opus. | verified | 72.1 | 2026 | Source ↗ |
| 26 | MAYA-V2 / Claude 4.6 Opus Official Terminal-Bench 2.0 leaderboard rank 13. System couples agent scaffold and underlying model: MAYA-V2 / Claude 4.6 Opus. | verified | 72.1 | 2026 | Source ↗ |
| 27 | TongAgents / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 14. System couples agent scaffold and underlying model: TongAgents / Claude Opus 4.6. | verified | 71.9 | 2026 | Source ↗ |
| 28 | TongAgents / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 14. System couples agent scaffold and underlying model: TongAgents / Claude Opus 4.6. | verified | 71.9 | 2026 | Source ↗ |
| 29 | Junie CLI / Multiple Official Terminal-Bench 2.0 leaderboard rank 15. System couples agent scaffold and underlying model: Junie CLI / Multiple. | verified | 71 | 2026 | Source ↗ |
| 30 | Junie CLI / Multiple Official Terminal-Bench 2.0 leaderboard rank 15. System couples agent scaffold and underlying model: Junie CLI / Multiple. | verified | 71 | 2026 | Source ↗ |
| 31 | CodeBrain-1 / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 16. System couples agent scaffold and underlying model: CodeBrain-1 / GPT-5.3-Codex. | verified | 70.3 | 2026 | Source ↗ |
| 32 | CodeBrain-1 / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 16. System couples agent scaffold and underlying model: CodeBrain-1 / GPT-5.3-Codex. | verified | 70.3 | 2026 | Source ↗ |
| 33 | Droid / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 17. System couples agent scaffold and underlying model: Droid / Claude Opus 4.6. | verified | 69.9 | 2026 | Source ↗ |
| 34 | Droid / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 17. System couples agent scaffold and underlying model: Droid / Claude Opus 4.6. | verified | 69.9 | 2026 | Source ↗ |
| 35 | Ante / Gemini 3 Pro Official Terminal-Bench 2.0 leaderboard rank 18. System couples agent scaffold and underlying model: Ante / Gemini 3 Pro. | verified | 69.4 | 2026 | Source ↗ |
| 36 | Ante / Gemini 3 Pro Official Terminal-Bench 2.0 leaderboard rank 18. System couples agent scaffold and underlying model: Ante / Gemini 3 Pro. | verified | 69.4 | 2026 | Source ↗ |
| 37 | IndusAGI Coding Agent / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 19. System couples agent scaffold and underlying model: IndusAGI Coding Agent / GPT-5.3-Codex. | verified | 69.1 | 2026 | Source ↗ |
| 38 | IndusAGI Coding Agent / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 19. System couples agent scaffold and underlying model: IndusAGI Coding Agent / GPT-5.3-Codex. | verified | 69.1 | 2026 | Source ↗ |
| 39 | Crux / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 20. System couples agent scaffold and underlying model: Crux / Claude Opus 4.6. | verified | 66.9 | 2026 | Source ↗ |
| 40 | Crux / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 20. System couples agent scaffold and underlying model: Crux / Claude Opus 4.6. | verified | 66.9 | 2026 | Source ↗ |