| 01 | Codex / GPT-5.5 Official Terminal-Bench 2.0 leaderboard rank 1. System couples agent scaffold and underlying model: Codex / GPT-5.5. | verified | 82 | 2026 | Source ↗ | Looks wrong? |
| 02 | ForgeCode / GPT-5.4 Official Terminal-Bench 2.0 leaderboard rank 2. System couples agent scaffold and underlying model: ForgeCode / GPT-5.4. | verified | 81.8 | 2026 | Source ↗ | Looks wrong? |
| 03 | TongAgents / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 3. System couples agent scaffold and underlying model: TongAgents / Gemini 3.1 Pro. | verified | 80.2 | 2026 | Source ↗ | Looks wrong? |
| 04 | ForgeCode / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 4. System couples agent scaffold and underlying model: ForgeCode / Claude Opus 4.6. | verified | 79.8 | 2026 | Source ↗ | Looks wrong? |
| 05 | ForgeCode / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 6. System couples agent scaffold and underlying model: ForgeCode / Gemini 3.1 Pro. | verified | 78.4 | 2026 | Source ↗ | Looks wrong? |
| 06 | SageAgent / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 5. System couples agent scaffold and underlying model: SageAgent / GPT-5.3-Codex. | verified | 78.4 | 2026 | Source ↗ | Looks wrong? |
| 07 | Droid / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 7. System couples agent scaffold and underlying model: Droid / GPT-5.3-Codex. | verified | 77.3 | 2026 | Source ↗ | Looks wrong? |
| 08 | Capy / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 8. System couples agent scaffold and underlying model: Capy / Claude Opus 4.6. | verified | 75.3 | 2026 | Source ↗ | Looks wrong? |
| 09 | Simple Codex / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 9. System couples agent scaffold and underlying model: Simple Codex / GPT-5.3-Codex. | verified | 75.1 | 2026 | Source ↗ | Looks wrong? |
| 10 | Terminus-KIRA / Gemini 3.1 Pro Official Terminal-Bench 2.0 leaderboard rank 10. System couples agent scaffold and underlying model: Terminus-KIRA / Gemini 3.1 Pro. | verified | 74.8 | 2026 | Source ↗ | Looks wrong? |
| 11 | Terminus-KIRA / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 11. System couples agent scaffold and underlying model: Terminus-KIRA / Claude Opus 4.6. | verified | 74.7 | 2026 | Source ↗ | Looks wrong? |
| 12 | Mux / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 12. System couples agent scaffold and underlying model: Mux / GPT-5.3-Codex. | verified | 74.6 | 2026 | Source ↗ | Looks wrong? |
| 13 | MAYA-V2 / Claude 4.6 Opus Official Terminal-Bench 2.0 leaderboard rank 13. System couples agent scaffold and underlying model: MAYA-V2 / Claude 4.6 Opus. | verified | 72.1 | 2026 | Source ↗ | Looks wrong? |
| 14 | TongAgents / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 14. System couples agent scaffold and underlying model: TongAgents / Claude Opus 4.6. | verified | 71.9 | 2026 | Source ↗ | Looks wrong? |
| 15 | Junie CLI / Multiple Official Terminal-Bench 2.0 leaderboard rank 15. System couples agent scaffold and underlying model: Junie CLI / Multiple. | verified | 71 | 2026 | Source ↗ | Looks wrong? |
| 16 | CodeBrain-1 / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 16. System couples agent scaffold and underlying model: CodeBrain-1 / GPT-5.3-Codex. | verified | 70.3 | 2026 | Source ↗ | Looks wrong? |
| 17 | Droid / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 17. System couples agent scaffold and underlying model: Droid / Claude Opus 4.6. | verified | 69.9 | 2026 | Source ↗ | Looks wrong? |
| 18 | Ante / Gemini 3 Pro Official Terminal-Bench 2.0 leaderboard rank 18. System couples agent scaffold and underlying model: Ante / Gemini 3 Pro. | verified | 69.4 | 2026 | Source ↗ | Looks wrong? |
| 19 | IndusAGI Coding Agent / GPT-5.3-Codex Official Terminal-Bench 2.0 leaderboard rank 19. System couples agent scaffold and underlying model: IndusAGI Coding Agent / GPT-5.3-Codex. | verified | 69.1 | 2026 | Source ↗ | Looks wrong? |
| 20 | Crux / Claude Opus 4.6 Official Terminal-Bench 2.0 leaderboard rank 20. System couples agent scaffold and underlying model: Crux / Claude Opus 4.6. | verified | 66.9 | 2026 | Source ↗ | Looks wrong? |