OpenRouter's rankings page ships a full year of weekly token data per vendor. We analysed it. While benchmark leaderboards argue about Claude, GPT-5 and Gemini, the actual inference market has shifted somewhere else entirely — Chinese open-weight labs went from a rounding error to more than half of the tracked flow in twelve months, and the total market is up 11.1×.
Stacked share of total OpenRouter top-9 token volume. The “how many tokens” view — it undercounts premium vendors whose tokens cost 10–30× more. The dollar view is § 02.
Same weeks, same vendors — each vendor's token count is multiplied by its current average blended $/M, computed from the live model catalog weighted by real usage. The “how much money” view. Anthropic's token share is 12.3% but its dollar share is 46.3% — that gap is the whole premium-lane thesis.
Last 4 weeks. Positive gap = premium pricing, below-weight on tokens. Negative gap = volume play, below-weight on dollars.
| Vendor | Tokens | Dollars | Gap (pts) | Avg $/M |
|---|---|---|---|---|
| 13.3% | 7.0% | -6.3 | $1.12 | |
| xiaomi | 13.0% | 9.0% | -4.0 | $1.47 |
| qwen | 12.7% | 4.6% | -8.1 | $0.77 |
| anthropic | 12.3% | 46.3% | +34.0 | $7.95 |
| openai | 9.8% | 24.2% | +14.4 | $5.25 |
| minimax | 9.5% | 2.1% | -7.3 | $0.48 |
| deepseek | 6.3% | 0.9% | -5.4 | $0.30 |
| z-ai | 6.0% | 5.2% | -0.8 | $1.82 |
| stepfun | 4.8% | 0.4% | -4.4 | $0.16 |
| x-ai | 1.0% | 0.3% | -0.7 | $0.55 |
| nvidia | 0.3% | 0.0% | -0.3 | $0.19 |
| meta-llama | 0.0% | 0.0% | 0.0 | — |
| mistralai | 0.0% | 0.0% | 0.0 | $0.03 |
| microsoft | 0.0% | 0.0% | 0.0 | $0.62 |
| nousresearch | 0.0% | 0.0% | 0.0 | $0.20 |
Combined share of xiaomi, qwen, minimax, deepseek, z-ai, stepfun.
First 4 weeks of the series vs the last 4. Growth ratio is absolute tokens, not share.
| Vendor | Year-ago | Now | Δ pts | Past tok | Now tok | Growth |
|---|---|---|---|---|---|---|
| 37.0% | 13.3% | -23.7 | 2.56T | 10.15T | 4.0× | |
| xiaomiCN | 0.0% | 13.0% | +13.0 | 0 | 9.90T | new |
| qwenCN | 2.2% | 12.7% | +10.5 | 154.1B | 9.71T | 62× |
| anthropic | 24.7% | 12.3% | -12.3 | 1.71T | 9.44T | 5.5× |
| openai | 11.4% | 9.8% | -1.6 | 788.1B | 7.47T | 9.5× |
| minimaxCN | 0.0% | 9.5% | +9.5 | 0 | 7.24T | new |
| deepseekCN | 12.5% | 6.3% | -6.2 | 864.8B | 4.84T | 5.6× |
| z-aiCN | 0.0% | 6.0% | +6.0 | 0 | 4.58T | new |
| stepfunCN | 0.0% | 4.8% | +4.8 | 0 | 3.66T | new |
| x-ai | 0.4% | 1.0% | +0.6 | 25.2B | 735.7B | 29× |
| nvidia | 0.0% | 0.3% | +0.3 | 0 | 231.2B | new |
| meta-llama | 5.6% | 0.0% | -5.6 | 385.3B | 0 | -100% |
| mistralai | 2.9% | 0.0% | -2.9 | 197.7B | 0 | -100% |
| microsoft | 0.8% | 0.0% | -0.8 | 53.0B | 0 | -100% |
| nousresearch | 0.3% | 0.0% | -0.3 | 17.8B | 0 | -100% |
The short answer is no, and the longer answer has three parts.
1 · The market is stratifying, not choosing. Claude Opus 4.6 is the #1 model on our inverted leaderboard by dollar spend — $25M/month across 24 apps — because it is priced at $5/$25 per million. But it is only ~4th by token volume (2.4T), and Anthropic as a vendor is ~12% of total tokens, down from ~25% last year. Both things are true at once: the premium lane still pays Anthropic real money, and the commodity lane has moved decisively elsewhere.
2 · It is following price, not quality. The vendors gaining share (Qwen, Xiaomi, MiniMax, DeepSeek, Z.ai, StepFun) share one property: aggressive open-weight pricing, often sub-$1/M blended. On the inverted model leaderboard, Qwen3.6 Plus is in 27 of 30 apps and MiMo-V2-Pro handles 5.5T tokens at ~$1.50/M blended — a tiny fraction of what a comparable Claude Opus run would cost.
3 · Benchmark-to-spend correlation is weak, and inverted at the top. Among models with both token volume and a published benchmark score, the relationship between benchmark rank and market share is closer to anti-correlated. Premium models capture most of the dollar spend because they're priced 10–30× higher; they do not capture tokens. Agents — when given a free choice through a router — route most tokens to “good enough and 20× cheaper” rather than “best and 20× more expensive”.
This cuts two ways. It says benchmark leaderboards overstate real-world adoption for frontier models. It also says the market may be under-weighting quality in agentic workflows where a wrong answer is expensive — we do not yet have the data to distinguish “efficient routers” from “cheap routers paid for by downstream users”.
openrouter.ai/rankings RSC stream. 53 weeks of vendor-level tokens, 77 weeks of model-level. Captured 2026-04-14.Seeing a model, vendor, or app shift that isn't reflected here? Tell us — we reply within 48 hours and update the analysis.