Recommended default
Qwen3.6-35B-A3B Q4
Use Q4 GGUF, modest context. This is the highest-scoring current open-weight model that fits this card cleanly, selected by benchmark then fit then freshness, not by parameter count.
Benchmark anchor
MMLU-Pro 85.6 BF16 / 85.0 NVFP4 · GPQA Diamond 84.9 / 84.8 · SciCode 40.8 / 40.6 · AIME 2025 89.2 / 88.8 (NVIDIA Qwen3.6-35B-A3B-NVFP4 card).
Evidence
Qwen3.6-35B-A3B has stronger 2026 benchmark evidence than older 70B compatibility models; NVFP4 loses little vs BF16 in NVIDIA's published table.