| 01 | Qwen/Qwen3.5-35B-A3B thinking (API)OSS | Qwen | Jul 2025 | SpeakLeash/CPTU-Bench | 4.70 |
| 02 | Qwen/Qwen3.5-27B thinking (API)OSS | Qwen | Jul 2025 | SpeakLeash/CPTU-Bench | 4.61 |
| 03 | Qwen/Qwen3.5-27B non-thinking (API)OSS | Qwen | Jul 2025 | SpeakLeash/CPTU-Bench | 4.43 |
| 04 | deepseek-ai/DeepSeek-V3.2 (API)OSS | deepseek-ai | Jul 2025 | SpeakLeash/CPTU-Bench | 4.20 |
| 05 | Qwen/Qwen3.5-35B-A3B non-thinking (API)OSS | Qwen | Jul 2025 | SpeakLeash/CPTU-Bench | 4.19 |
| 06 | deepseek-ai/DeepSeek-R1 (API)OSS | deepseek-ai | Jan 2025 | SpeakLeash/CPTU-Bench | 4.12 |
| 07 | 🚧DeepSeek-V3-0324OSS | deepseek-ai | Mar 2025 | SpeakLeash/CPTU-Bench | 4.02 |
| 08 | deepseek-ai/DeepSeek-V3 (API)OSS | deepseek-ai | Dec 2024 | SpeakLeash/CPTU-Bench | 3.99 |
| 09 | gemini-2.0-flash-001OSS | Google | Feb 2025 | SpeakLeash/CPTU-Bench | 3.99 |
| 10 | moonshotai/Kimi-K2-Instruct-0905 (API)OSS | moonshotai | Sep 2025 | SpeakLeash/CPTU-Bench | 3.93 |
| 11 | openai/gpt-oss-120b (API)OSS | openai | Jun 2025 | SpeakLeash/CPTU-Bench | 3.89 |
| 12 | deepseek-ai/DeepSeek-V3.1 (API)OSS | deepseek-ai | May 2025 | SpeakLeash/CPTU-Bench | 3.87 |
| 13 | gemini-2.0-flash-lite-001OSS | Google | Feb 2025 | SpeakLeash/CPTU-Bench | 3.85 |
| 14 | Qwen/Qwen3-235B-A22B non-thinking (API)OSS | Qwen | Apr 2025 | SpeakLeash/CPTU-Bench | 3.84 |
| 15 | Qwen2.5-72B-InstructOSS | Qwen | Sep 2024 | SpeakLeash/CPTU-Bench | 3.81 |
| 16 | meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 (API)OSS | meta-llama | Apr 2025 | SpeakLeash/CPTU-Bench | 3.76 |
| 17 | Mistral-Large-Instruct-2411OSS | mistralai | Nov 2024 | SpeakLeash/CPTU-Bench | 3.72 |
| 18 | Meta-Llama-3-70B-InstructOSS | meta-llama | Apr 2024 | SpeakLeash/CPTU-Bench | 3.71 |
| 19 | Qwen2-72B-InstructOSS | Qwen | Jun 2024 | SpeakLeash/CPTU-Bench | 3.68 |
| 20 | Mistral-Large-Instruct-2407OSS | mistralai | Jul 2024 | SpeakLeash/CPTU-Bench | 3.65 |
| 21 | Qwen/Qwen3.5-9B non-thinking (API, FP8)OSS | Qwen | Jul 2025 | SpeakLeash/CPTU-Bench | 3.64 |
| 22 | Qwen2.5-32B-InstructOSS | Qwen | Sep 2024 | SpeakLeash/CPTU-Bench | 3.59 |
| 23 | Qwen/Qwen3-32B non-thinking (API)OSS | Qwen | Apr 2025 | SpeakLeash/CPTU-Bench | 3.56 |
| 24 | Qwen/Qwen3-30B-A3B non-thinking (API)OSS | Qwen | Apr 2025 | SpeakLeash/CPTU-Bench | 3.54 |
| 25 | gemma-3-27b-itOSS | google | Mar 2025 | SpeakLeash/CPTU-Bench | 3.53 |
| 26 | Bielik-11B-v2.1-InstructOSS | speakleash | Sep 2024 | SpeakLeash/CPTU-Bench | 3.47 |
| 27 | Mistral-Small-24B-Instruct-2501OSS | mistralai | Jan 2025 | SpeakLeash/CPTU-Bench | 3.45 |
| 28 | NVIDIA-Nemotron-3-Nano-30B-A3B-BF16OSS | nvidia | Jun 2025 | SpeakLeash/CPTU-Bench | 3.43 |
| 29 | mistralai/Mistral-Small-3.1-24B-Instruct-2503 (API FP8)OSS | mistralai | Mar 2025 | SpeakLeash/CPTU-Bench | 3.42 |
| 30 | Llama-3.3-70B-InstructOSS | meta-llama | Dec 2024 | SpeakLeash/CPTU-Bench | 3.38 |
| 31 | Qwen2.5-14B-InstructOSS | Qwen | Sep 2024 | SpeakLeash/CPTU-Bench | 3.34 |
| 32 | Qwen/Qwen3-14B non-thinking (API)OSS | Qwen | Apr 2025 | SpeakLeash/CPTU-Bench | 3.33 |
| 33 | mistralai/Mistral-Small-3.2-24B-Instruct-2506 (API FP8)OSS | mistralai | Jun 2025 | SpeakLeash/CPTU-Bench | 3.30 |
| 34 | Mixtral-8x22B-Instruct-v0.1OSS | mistralai | Apr 2024 | SpeakLeash/CPTU-Bench | 3.24 |
| 35 | Bielik-11B-v2.3-InstructOSS | speakleash | Nov 2024 | SpeakLeash/CPTU-Bench | 3.22 |
| 36 | Llama-PLLuM-70B-chatOSS | CYFRAGOVPL | Mar 2025 | SpeakLeash/CPTU-Bench | 3.21 |
| 37 | Llama-4-Scout-17B-16E-InstructOSS | meta-llama | Apr 2025 | SpeakLeash/CPTU-Bench | 3.19 |
| 38 | Bielik-11B-v3.0-InstructOSS | speakleash | Jun 2025 | SpeakLeash/CPTU-Bench | 3.19 |
| 39 | Bielik-11B-v2.2-InstructOSS | speakleash | Oct 2024 | SpeakLeash/CPTU-Bench | 3.12 |
| 40 | Bielik-11B-v2.6-InstructOSS | speakleash | Feb 2025 | SpeakLeash/CPTU-Bench | 3.10 |
| 41 | WizardLM-2-8x22BOSS | alpindale | Apr 2024 | SpeakLeash/CPTU-Bench | 3.06 |
| 42 | Meta-Llama-3.1-70B-InstructOSS | meta-llama | Jul 2024 | SpeakLeash/CPTU-Bench | 3.01 |
| 43 | Bielik-11B-v2.5-InstructOSS | speakleash | Jan 2025 | SpeakLeash/CPTU-Bench | 2.91 |
| 44 | pllum-12b-nc-chat-250715OSS | CYFRAGOVPL | Jul 2025 | SpeakLeash/CPTU-Bench | 2.90 |
| 45 | Qwen/Qwen3-8B non-thinking (API)OSS | Qwen | Apr 2025 | SpeakLeash/CPTU-Bench | 2.76 |
| 46 | EuroLLM-9B-InstructOSS | utter-project | Mar 2025 | SpeakLeash/CPTU-Bench | 2.75 |
| 47 | speakleash/Bielik-Minitron-7B-v3.0-InstructOSS | speakleash | Jul 2025 | SpeakLeash/CPTU-Bench | 2.74 |
| 48 | phi-4OSS | microsoft | Jan 2025 | SpeakLeash/CPTU-Bench | 2.72 |
| 49 | Qwen1.5-72B-ChatOSS | Qwen | Feb 2024 | SpeakLeash/CPTU-Bench | 2.67 |
| 50 | Llama-PLLuM-70B-instructOSS | CYFRAGOVPL | Mar 2025 | SpeakLeash/CPTU-Bench | 2.63 |
| 51 | CYFRAGOVPL/PLLuM-12B-nc-chatOSS | CYFRAGOVPL | Apr 2025 | SpeakLeash/CPTU-Bench | 2.62 |
| 52 | PLLuM-12B-chatOSS | CYFRAGOVPL | Apr 2025 | SpeakLeash/CPTU-Bench | 2.59 |
| 53 | Qwen2.5-7B-InstructOSS | Qwen | Sep 2024 | SpeakLeash/CPTU-Bench | 2.58 |
| 54 | Meta-Llama-3-8B-InstructOSS | meta-llama | Apr 2024 | SpeakLeash/CPTU-Bench | 2.48 |
| 55 | Bielik-4.5B-v3.0-InstructOSS | speakleash | Jun 2025 | SpeakLeash/CPTU-Bench | 2.46 |
| 56 | CYFRAGOVPL/pllum-12b-nc-instruct-250715OSS | CYFRAGOVPL | Jul 2025 | SpeakLeash/CPTU-Bench | 2.37 |
| 57 | Llama-PLLuM-8B-chatOSS | CYFRAGOVPL | Mar 2025 | SpeakLeash/CPTU-Bench | 2.25 |
| 58 | gemma-2-2b-itOSS | google | Jun 2024 | SpeakLeash/CPTU-Bench | 2.21 |
| 59 | Bielik-11B-v2.0-InstructOSS | speakleash | Aug 2024 | SpeakLeash/CPTU-Bench | 2.20 |
| 60 | Bielik-7B-Instruct-v0.1OSS | speakleash | Apr 2024 | SpeakLeash/CPTU-Bench | 2.16 |
| 61 | SOLAR-10.7B-Instruct-v1.0OSS | upstage | Dec 2023 | SpeakLeash/CPTU-Bench | 2.12 |
| 62 | Meta-Llama-3.1-8B-InstructOSS | meta-llama | Jul 2024 | SpeakLeash/CPTU-Bench | 2.11 |
| 63 | Mistral-Nemo-Instruct-2407OSS | mistralai | Jul 2024 | SpeakLeash/CPTU-Bench | 2.09 |
| 64 | Mistral-7B-Instruct-v0.3OSS | mistralai | May 2024 | SpeakLeash/CPTU-Bench | 1.99 |
| 65 | glm-4-9b-chatOSS | THUDM | Jun 2024 | SpeakLeash/CPTU-Bench | 1.98 |
| 66 | CYFRAGOVPL/PLLuM-12B-nc-instructOSS | CYFRAGOVPL | Apr 2025 | SpeakLeash/CPTU-Bench | 1.98 |
| 67 | openchat-3.5-0106OSS | openchat | Dec 2023 | SpeakLeash/CPTU-Bench | 1.96 |
| 68 | PLLuM-12B-instructOSS | CYFRAGOVPL | Apr 2025 | SpeakLeash/CPTU-Bench | 1.90 |
| 69 | Qwen2.5-3B-InstructOSS | Qwen | Sep 2024 | SpeakLeash/CPTU-Bench | 1.81 |
| 70 | PLLuM-8x7B-nc-chatOSS | CYFRAGOVPL | Feb 2025 | SpeakLeash/CPTU-Bench | 1.80 |
| 71 | Mixtral-8x7B-Instruct-v0.1OSS | mistralai | Dec 2023 | SpeakLeash/CPTU-Bench | 1.80 |
| 72 | PLLuM-8x7B-chatOSS | CYFRAGOVPL | Feb 2025 | SpeakLeash/CPTU-Bench | 1.78 |
| 73 | PLLuM-8x7B-nc-instructOSS | CYFRAGOVPL | Feb 2025 | SpeakLeash/CPTU-Bench | 1.76 |
| 74 | openchat-3.5-0106-gemmaOSS | openchat | Dec 2023 | SpeakLeash/CPTU-Bench | 1.68 |
| 75 | Starling-LM-7B-alphaOSS | berkeley-nest | Nov 2023 | SpeakLeash/CPTU-Bench | 1.68 |
| 76 | CYFRAGOVPL/Llama-PLLuM-8B-instructOSS | CYFRAGOVPL | Mar 2025 | SpeakLeash/CPTU-Bench | 1.66 |
| 77 | PLLuM-8x7B-instructOSS | CYFRAGOVPL | Feb 2025 | SpeakLeash/CPTU-Bench | 1.51 |
| 78 | Phi-4-mini-instructOSS | microsoft | Apr 2025 | SpeakLeash/CPTU-Bench | 1.30 |
| 79 | Bielik-1.5B-v3.0-InstructOSS | speakleash | Jun 2025 | SpeakLeash/CPTU-Bench | 1.22 |
| 80 | Llama-3.2-3B-InstructOSS | meta-llama | Sep 2024 | SpeakLeash/CPTU-Bench | 1.22 |
| 81 | NousResearch/Hermes-3-Llama-3.2-3BOSS | NousResearch | Oct 2024 | SpeakLeash/CPTU-Bench | 1.14 |
| 82 | Phi-3.5-mini-instructOSS | microsoft | Aug 2024 | SpeakLeash/CPTU-Bench | 1.04 |
| 83 | trurl-2-13b-academicOSS | Voicelab | Jan 2024 | SpeakLeash/CPTU-Bench | 1.02 |
| 84 | Yi-1.5-34B-ChatOSS | 01-ai | May 2024 | SpeakLeash/CPTU-Bench | 1.00 |
| 85 | EuroLLM-1.7B-InstructOSS | utter-project | Jan 2025 | SpeakLeash/CPTU-Bench | 0.758 |
| 86 | Qwen2.5-1.5B-InstructOSS | Qwen | Sep 2024 | SpeakLeash/CPTU-Bench | 0.663 |
| 87 | granite-3.1-2b-instructOSS | ibm-granite | Jan 2025 | SpeakLeash/CPTU-Bench | 0.590 |
| 88 | Llama-3.2-1B-InstructOSS | meta-llama | Sep 2024 | SpeakLeash/CPTU-Bench | 0.522 |
| 89 | LGAI-EXAONE/EXAONE-3.5-2.4B-InstructOSS | LGAI-EXAONE | Jan 2025 | SpeakLeash/CPTU-Bench | 0.489 |
| 90 | SmolLM2-1.7B-InstructOSS | HuggingFaceTB | Feb 2025 | SpeakLeash/CPTU-Bench | 0.253 |
| 91 | Qwen/Qwen2.5-0.5B-InstructOSS | Qwen | Sep 2024 | SpeakLeash/CPTU-Bench | 0.219 |
| 92 | h2oai/h2o-danube2-1.8b-chatOSS | h2oai | Apr 2024 | SpeakLeash/CPTU-Bench | 0.129 |
| 93 | internlm2-chat-20bOSS | internlm | Jan 2024 | SpeakLeash/CPTU-Bench | 0.124 |