Usage-based pricing is the default recommendation for AI products. The default is wrong.
Cursor switched its Pro plan to token-based billing in June 2025 and apologized within three weeks. Users who budgeted $20/month saw invoices hit $60 to $100. One five-person team spent $4,600 in six weeks. Snowflake customers routinely report hidden consumption growth from cross-region data sharing, spending far more than forecasted. Notion AI launched at $10/member/month as a flat add-on, hit 65% workspace adoption within a year, and reduced churn enough to save an estimated $30M annually.
Three companies. Three pricing structures. The two that charged per unit of consumption created billing crises. The one that charged a flat rate kept its customers.
Last Updated: April 2026
The Budget Problem Nobody Talks About
Three years ago, a typical company's software budget had 14 fixed line items and one variable cloud bill. The CFO could forecast Q3 spend in an afternoon.
Today, half those tools have shifted to consumption pricing.
| Budget Line | 2023 (Fixed) | 2026 (Variable) |
|---|---|---|
| Code editor | $20/seat/mo | Per completion (Cursor) |
| Support tool | $50/seat/mo | $0.99/resolution (Intercom Fin) |
| Analytics | $500/mo flat | Per query |
| Data warehouse | $2,000/mo flat | Per compute credit (Snowflake) |
| AI assistant | $30/seat/mo | Per message |
One variable line item is manageable. Five variable line items compounding against each other is unplannable.
78% of IT leaders report unexpected charges from consumption-based AI pricing. 90% of CIOs cite cost forecasting as their top challenge in AI deployment. These are not complaints about any single vendor. They are complaints about what happens when an entire software stack becomes variable at the same time.
The company that offers predictability earns a structural advantage at renewal. Not because the product is better. Because the invoice is forecastable.
Two Ways Renewals Break
Matt Green, who studies pricing across 500+ SaaS companies at Growth Unhinged, captures the core tension: usage-based pricing is easiest to close and hardest to renew.
The sales motion is simple. "Only pay for what you use" removes objections. No shelfware risk. Low commitment. Easy first signature.
The renewal is where it breaks. It breaks in two opposite directions.
Bill shock: the product worked too well. The customer adopted enthusiastically. Usage spiked. The invoice arrived at 3x expectations. This is the Cursor pattern. Users did not blame themselves for using the product. They blamed Cursor for the surprise.
Perceived waste: the product did not get adopted. The team used it lightly. The invoice was low. That sounds fine. It is not. A low invoice tells the CFO the product is not essential. It becomes the first line item cut in the next budget review. Low usage signals low value, even when the product delivered exactly what was needed.
Green reports an executive revenue leader whose gross revenue retention improved after switching from consumption back to per-seat pricing. Not a theoretical argument. An observed result.
The Quiet Part
Usage-based pricing punishes your best customers.
The customer who uses the product most gets the highest bill. The team that drives adoption across every department gets the largest invoice. The power user who builds workflows around your tool gets charged more than the person who logs in once a month.
That is the opposite of a loyalty reward. Enterprise buyers especially hate this, because they cannot forecast costs for a product they plan to use more over time. A successful deployment means a bigger bill. A growing team means a growing invoice. The incentive structure pushes customers to limit adoption, not expand it.
The companies moving from pure usage to hybrid are not retreating from value-based pricing. They are responding to what customers actually want: predictability with value alignment. The customer wants to know what they will pay. They also want to feel that what they pay reflects what they get. Hybrid pricing gives them both. Pure usage gives them neither.
Usage Pricing Frames You as a Commodity
Vin Vashishta makes the point most pricing discussions skip entirely.
When you price per token, per API call, or per credit, you frame your product as infrastructure. Infrastructure gets compared on price per unit. The customer shops for cheaper tokens the way they shop for cheaper cloud compute. A cheaper alternative appears. The switching calculus is simple arithmetic.
An outcome gets compared on results. The customer evaluates whether the problem got solved, not how many tokens it consumed. Switching means retraining, reintegrating, and accepting the risk of worse results.
| Pricing Frame | Customer Evaluates | Switching Trigger |
|---|---|---|
| Per token / per credit | Cost per unit | Cheaper alternative appears |
| Per resolution / per outcome | Problem solved or not | Results decline |
| Plan with guardrails | Total value vs. total cost | Budget review, competitive feature gap |
The commoditization curve accelerates this. Model costs drop roughly 10x every 18 months. When you price per token, every cost reduction puts direct downward pressure on your revenue. Plan-based pricing decouples your revenue from the model cost curve. When inference gets cheaper, your margins improve. The customer sees the same bill. You capture the efficiency gain instead of passing it through.
Where the Market Actually Landed
Hybrid pricing adoption surged from 27% to 41% in 2025 and is projected to reach 61% by end of 2026. Pure seat-based pricing dropped from 21% to 15% over the same period. The market is not choosing between seats and usage. It is choosing both.
HubSpot moved its Breeze AI agents from $1.00 per conversation to $0.50 per resolved conversation in April 2026. Activity to outcomes. Intercom charges $0.99 per resolution but bundles it with per-seat helpdesk pricing. Predictable base, variable outcome layer. Notion bundled AI into its Business tier entirely in May 2025, eliminating the per-member add-on and absorbing AI cost into a single forecastable line item.
Each moved toward more predictability, not less.
What Actually Works: Plans With Usage Guardrails
The companies reporting the best retention numbers share a structure.
| Component | Purpose |
|---|---|
| Base plan ($X/month) | Predictable cost the CFO can budget |
| Included usage allowance | Covers 80-90% of customers without overage |
| Published overage rate | Clear per-unit cost above the allowance |
| Hard spending cap | Maximum monthly bill, no surprises |
Notion AI at $10/user/month with included responses achieved 90% renewal rates and 91% monthly active retention. Jasper and Fireflies.ai use flat-rate add-ons or seat-based AI plans to avoid billing anxiety. Anthropic offers team seats from $25/seat/month with usage limits per tier, keeping API-rate billing separate from the workspace product.
Cursor's backlash came from removing the plan, not from adding usage. Users accepted usage limits. They rejected unpredictable bills. The plan creates the anchor. The guardrails handle edge cases. The cap prevents bill shock.
The Cost Visibility Gap Underneath
Every argument about seats versus usage versus outcomes traces back to the same gap: most companies do not know their per-customer cost-to-serve.
Without that number, the pricing model debate is abstract. You cannot set a confident flat price without knowing your cost floor. You cannot set a confident usage rate without knowing your cost variance across customers. You cannot price on outcomes without knowing what outcomes cost to produce.
OpenAI, Anthropic, and Vercel all surface granular usage dashboards for teams to model ROI. They do this because cost visibility is the prerequisite to any pricing structure. The 78% of IT leaders reporting unexpected AI charges are not facing a pricing model problem. They are facing a cost visibility problem.
Some customer segments are wildly profitable under flat pricing. Others need guardrails. A few might justify outcome-based pricing where the outcome is measurable and consistent. You cannot know which until you see the numbers customer by customer.
The Pricing Model You Choose Determines Customer Behavior
The standard framing treats pricing as a revenue capture mechanism. Pick the model that extracts the most value. This framing is incomplete.
Pricing determines behavior. Usage-based pricing teaches customers to limit consumption. Plan-based pricing teaches customers to maximize adoption within their tier. Outcome-based pricing teaches customers to demand measurable results.
The Copilot story demonstrated what happens when a company cannot see its own cost distribution. The Cursor story demonstrated what happens when customers cannot see theirs. Both problems have the same root: insufficient visibility into what each customer costs and what each customer gets.
The companies that build cost attribution early get to design pricing around observed customer behavior. The rest get to react to churn signals. The model you choose is not just a billing decision. It is a product decision that shapes how customers use what you built.
Visibility into cost-to-serve is what lets you design the right hybrid, not guess at one.
If you are building with AI, Bear Lumen gives you the per-customer cost attribution layer that makes hybrid pricing possible. See how it works.