The hidden cost of running AI for every client

Why the license is the cheap part, and how the per-client rebuild quietly eats the margin you advertise.

Technology

By Mark Choudhari · Jun 7, 2026 · 5 min read

The license is the cheap part. The rebuild is the bill.

Made with Works

TL;DR

The agency AI tax is the per-client configuration, integration, and maintenance work that scales with every account you add and quietly consumes the reseller margin vendors advertise. The license is the cheap part. Ongoing maintenance runs 15 to 30 percent of build cost a year, and hidden run costs often match or exceed the platform fee. Pay the tax once on a configured layer, not N times across accounts.

What are the hidden and ongoing costs of running AI agents at scale
What is the agency AI tax, the real cost of running AI across every client and not just the license
What does it cost per year to maintain AI deployments across all your client accounts
Does the per-client config work scale linearly and eat your margin
Who maintains the configs, fixes breakage, and handles model updates across your accounts
What is the opportunity cost of running the AI plumbing instead of advising clients
How to stop paying the tax per account
Common questions

What are the hidden and ongoing costs of running AI agents at scale

The costs that hurt are the ones that never appear on the subscription line. Across a real agency dataset, the run cost alone is large: a 250-agency survey put median token spend at about $1,800 a month per active agent and agency-level AI spend at a median near $7,400 a month, with the bottom quartile of agencies sitting below break-even at 0.7x ROI and evaluation and testing named the top blocker at 49 percent. That is the part of the bill that scales with usage, not with the seat price.

Underneath the run cost sits the build-and-keep-alive cost, and it is bigger than the quote. One hidden-cost breakdown found that integration, monitoring, governance, and delay can roughly double the initial budget of an AI agent, with model drift running 15 to 20 percent of build a year, and a separate total-cost-of-ownership guide put the real figure at close to twice the vendor quote over the first 12 to 18 months. The license told you one number. The work around it told you another.

The true cost of an AI deployment runs to roughly twice the vendor quote over the first 12 to 18 months, once integration, monitoring, and governance are counted.
HyperSense, TCO Guide, 2026

What is the agency AI tax, the real cost of running AI across every client and not just the license

The agency AI tax is the per-client configuration and maintenance burden that scales with account count and consumes the reseller margin a vendor advertises. It is the agency instance of the AI Tax, the seven hidden costs of running AI yourself, seen across many accounts rather than inside one business. The taxonomy already exists; this is the same shape, multiplied by how many clients you serve.

The critique here is not aimed at the agency. The agency is the one absorbing the cost. The target is the per-client bespoke rebuild, the model where every engagement starts the tooling over from scratch, because that model is what turns a one-time setup into a recurring tax. An agency that assembles fresh AI tooling for each client pays discovery, integration, training, and maintenance again every time. The license is the cheap part. The tax is everything you rebuild, maintain, and re-glue for every client, every model update, every account you add.

What does it cost per year to maintain AI deployments across all your client accounts

Plan on ongoing maintenance running 15 to 30 percent of the original build cost, every year, per deployment. Independent breakdowns converge there: one development-cost analysis puts maintenance and monitoring at 15 to 30 percent of build cost annually, and a TCO guide adds a 30 to 40 percent hidden-cost buffer on top of any vendor quote for the integration and governance work that the quote leaves out.

Now multiply. A single deployment at 15 to 30 percent maintenance a year is a manageable line. Ten client deployments, each on its own bespoke setup, is ten of those lines, each with its own breakage surface and its own model-update treadmill. The per-account math is what makes the tax expensive. The work is not exotic. It is just paid again for every account, which is precisely the part the per-engagement model never priced.

Does the per-client config work scale linearly and eat your margin

It scales worse than linearly, and yes, it eats the margin. Each new account adds its own configuration, its own integrations into the apps that client already runs, and its own surface for things to break, so the admin load grows with every logo you add rather than flattening out. The reseller margin is quoted as if the cost were the license. The real cost is the per-client rebuild sitting underneath it.

This piece owns the cost. The margin number it erodes belongs to the breakdown of agency margins on Works, which traces what the advertised spread actually nets once the rebuild work is counted. The two sit together: the cost is what you carry per account, and the margin is what is left after you carry it. The decision that follows, build it per client or standardize it once, is the build, buy, or rent question for agency clients.

Who maintains the configs, fixes breakage, and handles model updates across your accounts

In most agencies, the answer is a person you cannot spare, often the principal or the most senior operator. Someone has to fix the integration that broke when an API changed, re-tune the prompts that drifted after a model update, and reconcile the config differences between Account A and Account B. That work does not stop, because the models keep changing underneath the build, and each change ripples across every account that was wired by hand.

That is the maintenance tax in its agency form. It is not a one-time project. It is a standing job nobody was hired for, distributed across the very people whose time is worth the most. The more accounts you add on the per-client model, the more of that standing job you create.

What is the opportunity cost of running the AI plumbing instead of advising clients

The opportunity cost is the advisory work that does not happen while you are maintaining infrastructure. An agency’s value is judgment, the strategy and the expertise the client is actually paying for, and every hour the principal spends keeping bespoke tooling alive is an hour not spent on that judgment. The bespoke build does not just cost money. It costs the highest-value time in the business.

The market context makes this sharper. AI prices are not falling the way buyers hoped: hyperscaler capital spending is projected to climb from roughly $410 billion in 2025 toward $650 billion in 2026, with one analyst noting that seat-based licensing is going away in favor of usage-based pricing that tracks consumption. So the run cost rises while the maintenance load stays, and the agency carrying both per account is squeezed from two directions at once.

How to stop paying the tax per account

Any honest answer to the agency AI tax has to clear one bar: it has to be paid once, not per account. A second bespoke build for the next client is just the tax again with a new logo on it. The real fix is one configured layer that runs across every account, absorbs model changes without a rebuild, and lets you add a client without re-assembling the stack.

That is what Works is built to do for an agency. The apps each client already runs stay connected through 3,000-plus integrations, and existing Make and n8n automations import in rather than getting rebuilt, so onboarding an account is configuration, not construction. New models land inside the setup you already have, auto-selected per step, so a model update is not another integration project across ten accounts, it just happens underneath. The expertise you configure once carries to the next client instead of being rebuilt for them. And at $49 a month for the Pro tier, the capability sits at a price that makes the per-client rebuild look like the cost it always was, one you were absorbing because the per-engagement model gave you no other option. Machintel runs Works across six teams without rebuilding tooling per team; an agency avoids the per-client rebuild the same way.

Stop paying client AI Tax. Get early access, or DM us for the agency economics breakdown that shows where the per-account cost actually lands.

The license was always the cheap part. The tax is everything you rebuild, maintain, and re-glue for every client, every model update, every account you add. Pay it once, not N times.

Common Questions

What is the agency AI tax?

The agency AI tax is the per-client setup and maintenance burden that every new account re-triggers on a bespoke build model: discovery, integration, training, and ongoing upkeep, paid again for each logo rather than absorbed once across all of them. The result is that per-client margin shrinks as the roster grows, because the cost that looks fixed per account is actually recurring. It is the agency version of the AI Tax, the seven hidden costs of running AI yourself, multiplied by client count.

Is the true cost of an AI deployment larger than the platform fee, and by how much?

Yes. Independent total-cost-of-ownership work puts the real figure at close to twice the vendor quote over the first 12 to 18 months once integration, monitoring, and governance are counted, with ongoing maintenance at 15 to 30 percent of build cost a year. The platform fee is the visible line; the rebuild and upkeep around it are the larger, unbilled one.

Does the per-client work scale linearly?

It scales worse than linearly. Each new account adds its own configuration surface, its own integrations, and its own breakage exposure, so the admin load compounds rather than flattening as the roster grows. The bottom quartile of agencies in a 250-agency survey were already below break-even at 0.7x ROI, and per-client maintenance is a primary reason: the tax accelerates with scale on the bespoke model.

How much of my margin does this consume?

Enough to push a bespoke-rebuild agency toward the lower end of the 13 percent average net, because maintenance runs 15 to 30 percent of build cost per deployment per year and the per-client model stacks those lines without limit. The exact margin left after the tax is accounted for lives in the agency margins on Works, which owns the net number; this article owns the cost that erodes it.

How do I stop paying it?

Standardize on one configured layer that runs across every account, absorbs model updates without a rebuild, and imports the apps the client already uses instead of reconstructing the stack per engagement. The decision between building per client and standardizing once is the build, buy, or rent question.

Sources

Keep reading:

Are You Ready to Make AI Work for You?

Simplify your AI journey with solutions that integrate seamlessly, empower your teams, and deliver real results. Jyn turns complexity into a clear path to success.

See AI for Real Business Impact in Action →

How Early AI Adoption Will Give Businesses a Strategic Edge in the Future

How Early AI Adoption Will Give Businesses a Strategic Edge in the Future