Unit Economics

Agent product LTV / CAC / gross margin calculation — formula adjustments and typical pitfalls when marginal cost is non-zero.

Unit economics describes the profitability structure per “unit” (user, task, contract). Traditional SaaS unit economics rests on the assumption of near-zero marginal cost — serving one more user has negligible added expense, gross margin stays at 80%+ year-round, and LTV simplifies to “monthly fee × retained months”.

Agent products break this assumption. Every inference is billed; gross margin is not a constant — it fluctuates with user behavior (task complexity, model selection, cache hit rate). Continuing to apply the SaaS formula produces severely inflated LTV.

Key formula adjustments

Traditional SaaS formula

LTV = ARPU × Gross Margin × Avg Retained Months

Gross Margin is typically taken as the company-wide constant (80%+), assumed independent of the user.

Agent product formula

LTV(per user) = Σ (revenue per task − cost per task) × avg monthly task count × avg retained months
              = gross margin per task × monthly task count × retained months

Key differences:

  • Per-task gross margin replaces fixed Gross Margin — each task has a different margin
  • Monthly task count replaces fixed ARPU — heavy and light users have vastly different unit economics
  • Heavy users are the revenue engine in traditional SaaS (essentially zero cost), but in agent products may be a loss source (more tasks → higher cost → margin compressed)

Decomposing per-task gross margin

Per-task gross margin = revenue per task − Σ per-task cost
Per-task cost = token cost + sandbox / storage cost + HITL labor cost + refund / redo cost × failure rate

Dependencies:

  • Token cost — see economics/cost-model; typical medium task $0.10-$0.30
  • Sandbox / storage — typically < 1% of total per-task cost; negligible at unit economics level
  • HITL labor cost — if tasks require human review, $0.5-$1 per minute (at support hourly rate); for workflows with high HITL intervention rate, this may exceed token cost
  • Failure redo — failure rate × (tokens already consumed + human redo time)

Unit economics across three user types

Per-user gross margin by usage segment Subscription $40/month · synthetic example · heavy users cross into negative margin 0% 98% 75% −88% Light Medium Heavy 5% of users · 5 tasks/mo 70% of users · 50 tasks/mo 25% of users · 300 tasks/mo

The same product yields drastically different gross margin structures across three user types:

User typeMonthly tasksPer-task cost (incl. HITL)Monthly costGross margin (subscription $40 / month)
Light (5%)5$0.15$0.7598%
Medium (70%)50$0.20$1075%
Heavy (25%)300$0.25$75−88%

(Synthetic example illustrating structural differences.)

Heavy users are loss-making under subscription — a pitfall unique to agent products. In traditional SaaS heavy users are “the most valuable customers”; in agent products they may be “the customers most needing to be guided to downgrade or upgrade”.

Response strategies

Four common approaches:

  1. Usage caps — hard token limits under subscription, with throttling or suspension on breach. See the tier design in economics/controls-and-roi
  2. Hybrid pricing — base subscription + per-token charge above limit. Heavy users automatically bear overage cost
  3. Price discrimination — tier by usage profile (Lite / Pro / Ultra); heavy users guided toward higher tiers
  4. Accept loss for growth — allow heavy-user losses in early stage, betting on cross-sell or network effects later

The trade-offs between strategies are the central topic of pricing.

Difference from traditional SaaS valuation

Investors typically value SaaS at an ARR multiple (10x-30x). Agent product ARR cannot be applied directly — at the same $10M ARR:

  • If gross margin is 80% (mostly light users), valuation approaches traditional SaaS
  • If gross margin is 40% (heavy users dominate), valuation moves toward “consulting / services company” levels (2-5x ARR)

Investors will drill down to unit economics to see the substance. Reporting ARR with gross-margin distribution split by user segment is far more credible than reporting a single number.

Cross-section connections

Was this page helpful?