Forecast AI Usage With Customer Cohorts, Not One Average
A practical cohort method for forecasting AI usage as customers activate, mature, automate workflows and change behavior over time.
A forecast built from total customers multiplied by average usage is attractive because it fits on one line. It is also fragile. New customers explore, established customers settle into routines, and a few successful accounts connect automations that run far more often than any person clicks.
Cohorts make those changes visible without turning the forecast into a complicated simulation.
Group customers by behavior you can observe
Start with a small set: trial, newly paid, established and high-volume. The labels matter less than having groups with meaningfully different activity. Measure actions per account, tokens per action, model mix and failure rate for each.
Do not create so many segments that every account becomes its own category. Three to five groups are usually enough to expose the economics and remain understandable to the people making pricing and capacity decisions.
Let customers move through the model
A June signup should not be treated like an account that has used the product for a year. Estimate how many customers move from trial to paid, how many become established and how many leave. Then apply the usage profile for the stage they occupy that month.
This makes growth costs arrive at the right time. A promotion may create many low-cost trials now and a smaller wave of heavier paid usage later. One average tends to either overstate the first month or understate the later ones.
Keep automation as its own behavior
API access, scheduled jobs and integrations change the shape of usage. A person has natural pauses; an automation can send work overnight and on weekends. Split automated accounts from interactive ones even if they pay for the same plan.
Look for changes in requests per day and concurrency as well as monthly totals. The invoice may be manageable while burst traffic creates capacity or rate-limit problems.
Forecast ranges, not a ceremonial number
Build low, expected and high cases by changing a few important assumptions: conversion, retention, actions per active account and share of high-volume customers. Keep token size and model routing assumptions visible too. A product change that adds richer context can increase cost even when customer behavior stays flat.
A range is useful only when it leads to decisions. Note which infrastructure or spending threshold each case reaches and what the team would change.
Reconcile every month
Compare forecast and actual usage by cohort. If the total was wrong, identify whether customer count, behavior, model mix or cost per action caused the miss. Update that assumption rather than applying a vague correction to the entire spreadsheet.
Over time, the forecast becomes a compact story of how customers adopt the product. It can show whether people are finding more value, whether a free tier attracts the wrong workload and whether the pricing model still matches the cost it creates. That is much more useful than being able to say last month's average was technically correct.