Blog · February 15, 2026
Every Major AI Price Cut (February 2026)
AI model pricing changes constantly. Vendors cut prices, launch cheaper alternatives, and shift their lineups — often without announcement. What some are calling an AI pricing war is playing out across every major provider in 2026. If you're running models in production, you need to know when the math changes.
This page tracks every major AI API pricing change in February 2026 across OpenAI, Anthropic, Google, DeepSeek, and others. We pull from our database of 412 models across 43 vendors, synced daily. All per-request costs assume 5,000 input tokens and 1,000 output tokens — a typical API call.
February 2026
The month the Opus price ceiling dropped and new budget frontiers arrived.
| Model | Vendor | IQ | $/1M in | $/1M out | What changed |
|---|---|---|---|---|---|
| Claude Opus 4.5 | Anthropic | 49.7 | $5.00 | $25.00 | 67% price cut from prior Opus ($15/$75). Highest-IQ Opus at the time. |
| Gemini 3.1 Pro Preview | 57.2 | $2.00 | $12.00 | New model. Highest IQ of any model in our database at launch. $2/$12 undercuts both Opus and GPT-5. | |
| MiMo-V2-Flash | Xiaomi | 41.5 | $0.10 | $0.30 | New model. Cheapest frontier-tier model (IQ 40+) at $0.0008/request. |
February was the biggest shake-up of 2026 so far. Anthropic's Opus went from being the most expensive frontier model (at $15/$75) to a mid-tier price point. Google dropped the highest-IQ model in our database at a price that undercuts both Opus and GPT-5. And Xiaomi set a new floor for frontier intelligence at a tenth of a cent per request.
The cumulative effect
Each vendor's best value frontier model as of February 2026, ranked by cost per intelligence point. This table shows the result of all pricing moves combined.
| Vendor | Best value model | IQ | $/1M in | $/1M out | $/IQ point |
|---|---|---|---|---|---|
| Xiaomi | MiMo-V2-Flash | 41.5 | $0.10 | $0.30 | $0.000019 |
| DeepSeek | DeepSeek V3.2 (Reasoning) | 41.7 | $0.28 | $0.42 | $0.000044 |
| MiniMax | MiniMax-M2.5 | 41.9 | $0.30 | $1.20 | $0.000064 |
| OpenAI | GPT-5 mini (high) | 41.2 | $0.25 | $2.00 | $0.000079 |
| Alibaba | Qwen3.5 27B (Reasoning) | 42.1 | $0.30 | $2.40 | $0.000093 |
| Z AI | GLM-4.7 (Reasoning) | 42.1 | $0.55 | $2.15 | $0.000116 |
| Gemini 3 Flash Preview | 46.4 | $0.50 | $3.00 | $0.000119 | |
| Kimi | Kimi K2.5 (Reasoning) | 46.8 | $0.60 | $3.00 | $0.000128 |
| Anthropic | Claude Sonnet 4.6 (Adaptive) | 51.7 | $3.00 | $15.00 | $0.000580 |
| xAI | Grok 4 | 41.5 | $3.00 | $15.00 | $0.000723 |
$/IQ point = cost per request (5K in + 1K out) / Intelligence Index. Only frontier models (IQ 40+). One model per vendor, best value selected.
The top three value leaders are all Chinese companies: Xiaomi, DeepSeek, and MiniMax. The gap between the best (Xiaomi at $0.000019/IQ) and worst (xAI at $0.000723/IQ) is 38x. Even within Western vendors, there's a 7x gap between OpenAI's best value ($0.000079) and Anthropic's ($0.000580).
What the Opus cut means in dollars
The Opus 67% price reduction was the single largest price cut from a Western vendor in 2026. Here's what it looks like at production scale.
| Old Opus ($15/$75) | New Opus ($5/$25) | Savings | |
|---|---|---|---|
| Cost per request | $0.1500 | $0.0500 | $0.1000 |
| 1,000 requests/day | $4,500/mo | $1,500/mo | $3,000/mo |
| 10,000 requests/day | $45,000/mo | $15,000/mo | $30,000/mo |
| 100,000 requests/day | $450,000/mo | $150,000/mo | $300,000/mo |
5K input + 1K output tokens per request. 30-day month. "Old Opus" = Claude 3/4/4.1 Opus at $15/$75 per 1M. "New Opus" = Claude 4.5/4.6 Opus at $5/$25 per 1M.
At 10,000 requests per day, the Opus generation upgrade saves $30,000 per month. That's $360,000 per year — just from being on the latest version of the same vendor's model.
But the savings get more dramatic if you compare across vendors. MiMo-V2-Flash (IQ 41.5) at the same 10K daily volume costs $240/month. That's $44,760 less than new Opus and $14,760 less than even MiMo's cost is a fraction of any Western frontier model.
What this means in production
Price cuts only save you money if you actually switch. And switching has costs:
- Testing. You need to validate that the new model handles your specific workload without quality regressions. Benchmarks measure general capability, not your use case.
- Migration effort. Different models have different API quirks, function calling behaviors, and output formats. Budget engineering time for the transition.
- Monitoring. After switching, you need to track quality metrics to catch regressions that benchmarks didn't predict.
The teams that benefit most from price cuts are the ones that already track their costs per model and per customer. When a new pricing option appears, they can calculate the savings against their actual usage — not a hypothetical. The MarginDash cost simulator does exactly this: reprice your real token usage against any model in the database. Sign up free to try it with your own data.
The 2026 AI pricing war in context
The AI pricing war in 2026 isn't a single event — it's a pattern of compounding pressure from three directions at once.
Chinese vendors set the floor. Xiaomi, DeepSeek, and MiniMax now offer frontier-tier intelligence (IQ 40+) for under $0.003 per request. These prices aren't promotional — they reflect structural cost advantages from efficient architectures (mixture-of-experts, distillation) and aggressive go-to-market strategies. Western vendors can't ignore a 30–225x price gap on comparable benchmarks.
Incumbents respond with generational leaps. Instead of cutting list prices on existing models, Anthropic shipped Claude 4.5 Opus at $5/$25 — a 67% cut from the previous generation. This is both a direct price cut and a quality upgrade: you get more intelligence for fewer dollars.
Google pushes from the top. Gemini 3.1 Pro at IQ 57.2 and $2/$12 is both the smartest and one of the cheaper frontier models. It puts pressure on Anthropic and OpenAI from above: why pay more for lower IQ when $2/$12 gets you IQ 57.2?
The net effect: the cost of frontier intelligence is dropping fast, but it's happening through new model launches rather than retroactive price cuts. If you're not evaluating new models as they launch, you're overpaying by default. We track every model launch and pricing change in our database of 412 models — see The Cheapest AI Models That Are Actually Good for the full ranking across all quality tiers.
Methodology
All pricing from the MarginDash model database: 412 models across 43 vendors, synced daily. Intelligence Index scores from Artificial Analysis. Cost per request calculated at 5,000 input tokens + 1,000 output tokens. List prices only — batch, cached-input, and volume discounts not included.
See our methodology page for the full breakdown of data sources and calculation methods.
Last updated: February 15, 2026