Skip to main content
Back to Blog
guides14 min read

How to Price AI Products: A Data-Driven Framework

A practical framework for pricing AI products using real cost-to-serve data. Covers unit economics, pricing models, margin analysis, and iteration strategies for AI startups.

BLT

Bear Lumen Team

Research

#ai-pricing#pricing-strategy#unit-economics#cost-to-serve#margins

Most pricing frameworks start with the wrong question.

They ask what customers will pay, then survey competitors, model willingness-to-pay, and anchor to whatever the market seems to tolerate. The price goes live, and months later the margin report arrives. Some customers are profitable, some cost more to serve than they pay, and nobody knows which are which until the aggregate numbers turn red.

The better starting question is simpler: what does it cost to serve each customer?

Intercom knew this when it priced Fin at $0.99 per resolved ticket. That number wasn't pulled from competitor analysis. It came from inference cost data, $0.15 to $0.85 per resolution depending on complexity, with enough volume for the law of large numbers to stabilize margin. Cognition's Devin charges $20/month plus per-compute-unit billing, not because $20 is what the market will bear, but because the cost of autonomous coding sessions varies by 100x depending on complexity, and flat pricing would bleed on power users.

GitHub Copilot did the opposite. Microsoft set $10/month based on market positioning, the cost to serve turned out to be roughly $30/month, and that gap went unmeasured for two years across 4.7 million subscribers.

The difference between these outcomes isn't pricing talent. It's whether cost-to-serve data existed before the price was set.


Why AI Pricing Is Structurally Different

Traditional SaaS has near-zero marginal cost per user. Add a customer and the infrastructure bill barely moves, which made it possible to price on value, competitive positioning, or gut feel and still hold 70-90% gross margins. The buffer forgave almost any pricing mistake.

AI products break that assumption. Every interaction triggers inference, every inference costs money, and two customers on the same plan can have 10-100x cost differences depending on usage patterns. A chat feature and a document analysis feature have entirely different cost profiles. A "free" feature nobody asked for might be quietly burning 40% of your compute.

ICONIQ Growth reports that scaling AI B2B companies averaged 52% gross margins in January 2026, up from 41% in 2024. Bessemer Venture Partners documents AI gross margins at 50-60%, against 70-90% for mature SaaS. For every $1M in AI product revenue, roughly $230,000 goes to inference alone, before engineering, sales, or support see a dollar.

DimensionTraditional SaaSAI-First SaaS
Gross margin70-90%20-60% (avg 52%)
Marginal cost per userNear zeroVariable, per-request
Cost predictabilityHigh (fixed infra)Low (model-dependent)
Cost variance across customersMinimal10-100x range
Pricing model convergenceSettled (per-seat)Unsettled (7+ models)

The implication is blunt: pricing an AI product requires cost-to-serve data at the customer level. Without it, margin calculations are fiction.


Step 1: Know Your Costs Before Setting a Price

Most AI companies track the monthly OpenAI or Anthropic bill. Fewer track cost per customer. Almost none track cost per feature, per workflow, or per outcome.

The unit of AI cost accounting is not the token. It's the trace: a complete workflow execution from user request to final response. A single customer interaction might embed a query, retrieve context from a vector database, call a planner model, execute tool calls, generate a response, and run safety checks. That's six cost centers, often across multiple providers, in one interaction. Token math captures maybe two of them.

Current API prices show the range. GPT-4o costs $2.50/$10 per million input/output tokens, Claude Sonnet 4.6 costs $3/$15, and Claude Opus 4.6 costs $5/$25. A workflow that chains models, uses retrieval, and includes tool calls accumulates costs across all of them. Prompt caching (90% savings) and batch processing (50% off) can pull these down dramatically, but only if the architecture was designed for them.

Before setting any price, four questions need answers.

What does it cost to serve each customer per month? Not on average. Per customer, with variance, because averages hide the customers who cost more than they pay.

Which features drive the most cost? A chat feature and a code generation feature have very different cost profiles, and knowing which costs what determines which features belong in which tier.

What does a single workflow execution cost? Not the LLM call alone, but the full trace: retrieval, tool calls, retries, safety checks.

And how does cost correlate with value delivered? Some high-cost interactions produce high value and some don't. That correlation is what determines whether outcome-based pricing is viable for your product.

If you can answer these four, you can price with confidence. If you can't, any pricing framework is built on assumptions that may be off by an order of magnitude.


Step 2: Build Unit Economics From Real Data

With cost data in hand, the core financial model becomes straightforward. Gross margin per customer equals revenue minus variable cost to serve, where variable cost includes LLM API costs across all models, embedding and retrieval infrastructure, tool call and external API costs, compute for self-hosted models, and a proportional share of vector databases and caching layers.

Bessemer sets the benchmark tiers. Fast-ramping AI "Supernovas" average about 25% gross margin early on, steadier "Shooting Stars" trend closer to 60%, and LLM-native companies holding around 65% while growing 400% year-over-year represent the current ceiling.

A healthy AI product targets 60-80% gross margins. Below 50%, pricing or cost structure needs attention. The structural floor for AI companies will likely settle at 60-65%, below the 80%+ traditional SaaS achieves.

MetricTargetWhy
Gross margin60-80%Covers fixed costs and profit
LTV/CAC3x minimumValidates acquisition economics
Payback periodUnder 12 monthsRunway sustainability
Cost varianceMeasured, not averagedTiers should reflect reality
Inference as % of revenueUnder 23%ICONIQ benchmark for scaling AI

A practical example: a customer pays $100/month, costs $10/month to serve, and cost $300 to acquire. Payback is 3.3 months, three-year gross LTV is roughly $3,240, and LTV/CAC lands at 10.8x. Healthy.

But if your heaviest users cost $80/month to serve on the same $100 plan, payback stretches to 15 months and LTV/CAC drops to 2.4x. Same product, same plan, radically different economics depending on which customer you examine. This is why per-customer cost data is essential rather than optional.


Step 3: Choose a Pricing Model That Fits Your Cost Structure

The market has not converged. A 2025 industry report found 92% of AI software companies now use mixed pricing models, and at least seven distinct approaches are in production. The right choice depends on your cost structure and how specific your use case is.

Input-Based Pricing

Tokens, API calls, compute time, storage. OpenAI and Anthropic use this model because they serve an entire market and cannot predict what gets built on their APIs.

It works when you're a horizontal platform, your customers are technical, and cost scales predictably with usage. The risk is behavioral: Clay discovered that when it introduced per-action pricing, users during onboarding chose to enrich 10 emails instead of 1,000. Not because 10 was enough, but because they were nervous about spending credits they couldn't predict. The onboarding trained users to be conservative instead of discovering value.

Outcome-Based Pricing

Resolved tickets, completed reports, qualified leads. Intercom Fin charges $0.99 per resolved conversation, 11x bills roughly $5 per qualified lead, and Chargeflow takes 25% of recovered chargeback value.

This works when the outcome is measurable and the problem is well-defined. The risk is cost variance per outcome, which can be enormous: some support tickets take 500 tokens to resolve and some take 100,000. If Intercom's average inference cost per resolution is $0.30, they keep $0.69 of gross margin. If a harder ticket requires multiple model calls and costs $0.85, margin drops to $0.14.

Sierra AI serves 40% of the Fortune 50 with outcome-based pricing and no public rates, with year-one costs reportedly reaching $200K-$350K+. Outcome pricing at enterprise scale works when volume smooths the variance. For startups without that volume, a single high-cost outlier can wreck a month's margins.

Hybrid: Subscription Base With Usage Components

Most AI products land here. A base subscription provides revenue predictability while usage components capture value as customers scale.

GitHub Copilot charges $10/month Individual, $19/user/month Business, and $39/user/month Enterprise. Microsoft Copilot for Security bills $4/hour of compute. Notion bundles AI into its $20/user/month Business tier.

Three tiers cover most products: a starter tier with low price and limited usage for low-friction entry, a growth tier with higher limits that's the obvious choice for most customers, and an enterprise tier with custom limits priced on negotiation. The design principle is to hide pricing complexity for 90% of users and let heavy users buy additional capacity after hitting their plan limit, rather than making everyone a meticulous gas-meter tracker.

ModelExampleBilling UnitWorks When
Input-basedOpenAI, AnthropicPer token/callHorizontal platform, technical buyers
Per-resolutionIntercom Fin ($0.99)Resolved ticketBinary outcomes, high volume
Per-lead11x ($5,000/mo)Qualified leadMeasurable pipeline value
HourlyCopilot for Security ($4/hr)Compute hourBursty, unpredictable workloads
Outcome %Chargeflow (25%)% of value recoveredHigh-value, measurable outcomes
Hybrid seat+usageGitHub Copilot ($19-$39/seat)Seat + overagesMost B2B SaaS products
BundledNotion ($20/seat)Feature-gated tierAI as feature, not core product

Match the Model to Your Product's Maturity

Before committing, be honest about where the product sits. Commodity products compete on price, with pure usage at the lowest unit cost; most wrapper and thin-layer AI products live here whether they admit it or not. Differentiated products compete on quality, with premium per-unit pricing justified by measurable quality differences; specialized vertical AI and products with proprietary data live here. Indispensable products compete on outcomes and can make delivery promises because their output variance is narrow enough to keep them.

Most companies believe they're differentiated. Most are commodity. Commodity products doing outcome pricing make promises they can't keep, and indispensable products on pure usage pricing leave money on the table. If you don't know which level you occupy, you probably lack the cost and outcome data to tell, and that's itself the answer: start with usage-based or hybrid pricing and measure your way toward outcome pricing as your output variance narrows.


Step 4: Price to Value, Bounded by Alternatives

Cost data is the floor. Value is the target. But the ceiling is set by alternatives, and that ceiling is moving.

If your AI replaces a $50,000/year employee, $10,000 annually is a strong value proposition regardless of delivery cost. If you prevent a $100,000 compliance violation, $10,000 is rational risk reduction. A useful rule of thumb: aim for 5-10x ROI for the customer. At 10x the purchase decision is obvious; below 3x it becomes a negotiation.

Customers compare your price to four alternatives: hiring someone to do the work manually, using a competitor, building it themselves, or doing nothing.

That third option is changing faster than most vendors have noticed. AI coding tools have dropped the cost of building software in-house significantly. Retool's 2026 report found that 35% of their customers have already replaced at least one SaaS tool with a custom build, and 78% expect to build more of their own tools in 2026.

The practical test: could your customer's team member, armed with Cursor and a weekend, replicate 80% of your core workflow? When that build option was $500K and 12 months, paying $50K/year for SaaS was obvious. When it costs $5,000-$15,000 and three months with AI-assisted development, the calculus changes.

This doesn't mean underprice. Price signals quality, and in B2B a $29/month AI product competing against $500/month solutions raises suspicion rather than excitement. But the ceiling on what you can charge is a moving target, and it's moving down. Anchor pricing to alternatives, not to what the market would theoretically pay.


Step 5: Handle Free Tiers With Cost Awareness

Free trials and freemium reduce the first purchase decision to zero. For AI products, that comes with a specific catch: you pay real, per-interaction costs for users who haven't committed to paying.

Free trials grant full access for 14-30 days and create urgency. They work best when users experience value within minutes, and one design principle helps a lot: start the trial clock when the user takes their first meaningful action, not when they create an account.

Freemium offers limited features indefinitely and works when usage reveals value over time and the cost to serve free users stays low. For AI products, "low cost to serve" is the binding constraint, so set clear boundaries: limit AI interactions per month, restrict which models or features are available, cap compute time or output length.

The math that matters: monitor cost to serve free users and free-to-paid conversion together. If you spend $5/month per free user with a 2% monthly conversion rate, you're spending $250 in free-tier costs per converted customer. That's your effective CAC from the free tier. If paid ads acquire customers at $150 and the free tier acquires them at $250 with higher engagement, the free tier may still win on LTV. If it acquires at $250 with lower engagement, it's a cash furnace dressed up as a growth engine.


Step 6: Iterate With Data, Not Intuition

Pricing isn't a launch decision. It's a continuous process. In the first year, revisit quarterly; as the product matures, semi-annual reviews are sufficient unless model costs shift sharply, which they often do.

Four signals suggest your price is too low: customers accept without negotiation, sales cycles are unusually fast, buyers describe it as "a steal," and churn is high among low-engagement users who never valued what they barely paid for.

Four signals suggest it's too high: frequent discount requests, high abandonment at the payment step, sales cycles stuck at procurement, and declining win rates despite strong product fit.

When adjusting, raise prices for new customers first and measure conversion impact before applying broadly. Grandfather existing customers or give 90+ days notice with clear communication about added value. And test packaging changes before testing price changes, because packaging often moves the needle more than the number itself.


The Foundation

Every step in this framework requires one thing: actual cost-to-serve data at the customer level. Without it, unit economics are estimates, tier boundaries are guesses, and margin calculations are fiction.

The AI pricing discourse is full of confident, mutually exclusive claims. Seat-based pricing is dead; seat-based pricing improved retention. Outcome-based pricing is the future; outcome-based pricing is a buzzword. Usage-based pricing is necessary; usage-based pricing destroys retention. Credits are a growth lever; credits are customer-hostile.

Lay those arguments side by side and the contradictions aren't subtle. Almost every disagreement traces back to the same gap: per-request, per-customer cost-to-serve data barely exists yet, so the arguments run on intuition, anecdotes, and small samples.

Truthfully, the market may be converging on hybrid pricing not because it's the best model, but because it's the model you can implement without knowing your true cost-to-serve. Companies that do have per-request cost data might arrive at entirely different, more profitable answers. They might discover that certain customer segments are wildly profitable under flat pricing while others need aggressive usage gating, or that output variance is narrow enough in specific use cases to justify outcome pricing for those segments alone.

Cost-to-serve data is what turns pricing from guesswork into architecture, and every AI decision you make shifts it. The teams that measure it get to design their pricing. The teams that don't get to react to it.

If you'd rather not assemble that dataset by hand, Bear Lumen builds it for you: cost-to-serve per customer, per feature, and per outcome, with no tagging scheme to invent and no spreadsheet reconciliation at month-end.

Share this article