What AI model price cuts happened in 2026?

Major February 2026 price cuts include: Anthropic cut Opus pricing 67% (from $15/$75 to $5/$25 per 1M tokens) with the 4.5 generation. Google launched Gemini 3.1 Pro at $2/$12 with the highest IQ score (57.2) of any model. Xiaomi released MiMo-V2-Flash at $0.10/$0.30 per 1M — the cheapest frontier-tier model at $0.0008 per request.

Which AI provider has cut prices the most in 2026?

Anthropic made the largest single cut in 2026, reducing Opus pricing by 67% from $15/$75 to $5/$25 per 1M tokens with the Claude 4.5 generation. However, new entrants like Xiaomi (MiMo-V2-Flash at $0.10/$0.30) and continued DeepSeek pricing ($0.28/$0.42) set new baselines that make all Western vendors look expensive by comparison.

How much can I save by switching AI models after a price cut?

Savings depend on your volume and current model. At 10,000 requests per day, switching from old Opus pricing ($15/$75) to new Opus pricing ($5/$25) saves $30,000 per month. Switching from Opus to MiMo-V2-Flash at the same volume saves $44,760 per month with comparable frontier-tier benchmark scores. Use our cost simulator to calculate savings for your specific usage.

Blog · February 15, 2026

Every Major AI Price Cut (February 2026)

AI model pricing changes constantly. Vendors cut prices, launch cheaper alternatives, and shift their lineups — often without announcement. What some are calling an AI pricing war is playing out across every major provider in 2026. If you're running models in production, you need to know when the math changes.

This page tracks every major AI API pricing change in February 2026 across OpenAI, Anthropic, Google, DeepSeek, and others. We pull from our database of 412 models across 43 vendors, synced daily. All per-request costs assume 5,000 input tokens and 1,000 output tokens — a typical API call.

February 2026

The month the Opus price ceiling dropped and new budget frontiers arrived.

Model	Vendor	IQ	$/1M in	$/1M out	What changed
Claude Opus 4.5	Anthropic	49.7	$5.00	$25.00	67% price cut from prior Opus ($15/$75). Highest-IQ Opus at the time.
Gemini 3.1 Pro Preview	Google	57.2	$2.00	$12.00	New model. Highest IQ of any model in our database at launch. $2/$12 undercuts both Opus and GPT-5.
MiMo-V2-Flash	Xiaomi	41.5	$0.10	$0.30	New model. Cheapest frontier-tier model (IQ 40+) at $0.0008/request.

February was the biggest shake-up of 2026 so far. Anthropic's Opus went from being the most expensive frontier model (at $15/$75) to a mid-tier price point. Google dropped the highest-IQ model in our database at a price that undercuts both Opus and GPT-5. And Xiaomi set a new floor for frontier intelligence at a tenth of a cent per request.

The cumulative effect

Each vendor's best value frontier model as of February 2026, ranked by cost per intelligence point. This table shows the result of all pricing moves combined.

Vendor	Best value model	IQ	$/1M in	$/1M out	$/IQ point
Xiaomi	MiMo-V2-Flash	41.5	$0.10	$0.30	$0.000019
DeepSeek	DeepSeek V3.2 (Reasoning)	41.7	$0.28	$0.42	$0.000044
MiniMax	MiniMax-M2.5	41.9	$0.30	$1.20	$0.000064
OpenAI	GPT-5 mini (high)	41.2	$0.25	$2.00	$0.000079
Alibaba	Qwen3.5 27B (Reasoning)	42.1	$0.30	$2.40	$0.000093
Z AI	GLM-4.7 (Reasoning)	42.1	$0.55	$2.15	$0.000116
Google	Gemini 3 Flash Preview	46.4	$0.50	$3.00	$0.000119
Kimi	Kimi K2.5 (Reasoning)	46.8	$0.60	$3.00	$0.000128
Anthropic	Claude Sonnet 4.6 (Adaptive)	51.7	$3.00	$15.00	$0.000580
xAI	Grok 4	41.5	$3.00	$15.00	$0.000723

$/IQ point = cost per request (5K in + 1K out) / Intelligence Index. Only frontier models (IQ 40+). One model per vendor, best value selected.

The top three value leaders are all Chinese companies: Xiaomi, DeepSeek, and MiniMax. The gap between the best (Xiaomi at $0.000019/IQ) and worst (xAI at $0.000723/IQ) is 38x. Even within Western vendors, there's a 7x gap between OpenAI's best value ($0.000079) and Anthropic's ($0.000580).

What the Opus cut means in dollars

The Opus 67% price reduction was the single largest price cut from a Western vendor in 2026. Here's what it looks like at production scale.

	Old Opus ($15/$75)	New Opus ($5/$25)	Savings
Cost per request	$0.1500	$0.0500	$0.1000
1,000 requests/day	$4,500/mo	$1,500/mo	$3,000/mo
10,000 requests/day	$45,000/mo	$15,000/mo	$30,000/mo
100,000 requests/day	$450,000/mo	$150,000/mo	$300,000/mo

5K input + 1K output tokens per request. 30-day month. "Old Opus" = Claude 3/4/4.1 Opus at $15/$75 per 1M. "New Opus" = Claude 4.5/4.6 Opus at $5/$25 per 1M.

At 10,000 requests per day, the Opus generation upgrade saves $30,000 per month. That's $360,000 per year — just from being on the latest version of the same vendor's model.

But the savings get more dramatic if you compare across vendors. MiMo-V2-Flash (IQ 41.5) at the same 10K daily volume costs $240/month. That's $44,760 less than new Opus and $14,760 less than even MiMo's cost is a fraction of any Western frontier model.

What this means in production

Price cuts only save you money if you actually switch. And switching has costs:

Testing. You need to validate that the new model handles your specific workload without quality regressions. Benchmarks measure general capability, not your use case.
Migration effort. Different models have different API quirks, function calling behaviors, and output formats. Budget engineering time for the transition.
Monitoring. After switching, you need to track quality metrics to catch regressions that benchmarks didn't predict.

The teams that benefit most from price cuts are the ones that already track their costs per model and per customer. When a new pricing option appears, they can calculate the savings against their actual usage — not a hypothetical. The MarginDash cost simulator does exactly this: reprice your real token usage against any model in the database. Sign up free to try it with your own data.

The 2026 AI pricing war in context

The AI pricing war in 2026 isn't a single event — it's a pattern of compounding pressure from three directions at once.

Chinese vendors set the floor. Xiaomi, DeepSeek, and MiniMax now offer frontier-tier intelligence (IQ 40+) for under $0.003 per request. These prices aren't promotional — they reflect structural cost advantages from efficient architectures (mixture-of-experts, distillation) and aggressive go-to-market strategies. Western vendors can't ignore a 30–225x price gap on comparable benchmarks.

Incumbents respond with generational leaps. Instead of cutting list prices on existing models, Anthropic shipped Claude 4.5 Opus at $5/$25 — a 67% cut from the previous generation. This is both a direct price cut and a quality upgrade: you get more intelligence for fewer dollars.

Google pushes from the top. Gemini 3.1 Pro at IQ 57.2 and $2/$12 is both the smartest and one of the cheaper frontier models. It puts pressure on Anthropic and OpenAI from above: why pay more for lower IQ when $2/$12 gets you IQ 57.2?

The net effect: the cost of frontier intelligence is dropping fast, but it's happening through new model launches rather than retroactive price cuts. If you're not evaluating new models as they launch, you're overpaying by default. We track every model launch and pricing change in our database of 412 models — see The Cheapest AI Models That Are Actually Good for the full ranking across all quality tiers.

Methodology

All pricing from the MarginDash model database: 412 models across 43 vendors, synced daily. Intelligence Index scores from Artificial Analysis. Cost per request calculated at 5,000 input tokens + 1,000 output tokens. List prices only — batch, cached-input, and volume discounts not included.

See our methodology page for the full breakdown of data sources and calculation methods.

Last updated: February 15, 2026