Recommended default
Qwen3.6-35B-A3B (higher quant)
Use Q5-ish / FP4 where supported, 32k-64k practical. This is the highest-scoring current open-weight model that fits this card cleanly, selected by benchmark then fit then freshness, not by parameter count.
Benchmark anchor
Same Qwen3.6-35B-A3B score profile (MMLU-Pro 85.6/85.0, GPQA Diamond 84.9/84.8); NVFP4 loses little vs BF16, which matters for Blackwell-era deployment.
Evidence
NVFP4 numbers show low degradation vs BF16; on 32GB the lever is quality and context, not parameter count.