The MAD Podcast with Matt Turck Watch original video

Aaron Levie Enterprise AI Diffusion Framework

Last updated: 31 May 2026

Diagnose exactly where an enterprise AI deployment will stall and prescribe the right sequencing of data, talent, cost, and change management moves to accelerate diffusion without being blindsided by the next model breakthrough.

// TL;DR

The Aaron Levie Enterprise AI Diffusion Framework is a diagnostic and sequencing tool for planning, auditing, or advising on agentic AI rollouts inside enterprises. It identifies exactly where deployments stall — across data readiness, talent gaps, token cost exposure, change management, and architecture lock-in — and prescribes the right moves to accelerate adoption. Use it when launching enterprise AI agents, evaluating startup opportunities in the bridge layer between AI capability and real-world workflows, or advising Global 2000 companies navigating the transition from chat-based copilots to full agentic deployment.

Framework

// When should you use the Aaron Levie Enterprise AI Diffusion Framework?

Use this skill whenever planning, auditing, or advising on an agentic AI rollout inside a large or mid-market enterprise. Also apply it when evaluating whether a startup opportunity exists in the bridge layer between AI capability and real-world enterprise workflow.

// What inputs do you need before applying the Enterprise AI Diffusion Framework?

Organisation typerequired
Size and sector of the enterprise (e.g. Global 2000 manufacturer, mid-market financial services firm)
Current AI maturity stagerequired
Where the org sits today: Chat Stage, early Agent Stage, or no deployment yet
Target use case(s)required
The specific knowledge-work workflow(s) targeted for agentic deployment (e.g. client onboarding, contract review, marketing campaign generation)
Existing data environmentrequired
Description of where enterprise data lives, how structured/unstructured it is, and the current state of access controls and entitlements
Available internal talent
Whether the org has internal FDEs, repositioned IT engineers, or must hire from outside
Budget model
Whether AI spend currently sits in IT budget, line-of-business budgets, or is undecided

// What are the core principles behind the Levie Enterprise AI Diffusion Framework?

Bridge Layer Imperative

Advanced AI capability does not self-deploy into enterprises. There is always a required bridge between the breakthrough and the real-world workflow, covering security, change management, data integrity, and access controls. The bridge is not a temporary workaround — it is the product.

Capability Overhang Paradox

The faster AI breakthroughs arrive, the slower enterprise diffusion actually becomes, because each breakthrough makes the previously implemented architecture obsolete before it finishes rolling out. A stable diffusion window requires a temporarily frozen capability baseline, which the market will not provide.

Coding ≠ Knowledge Work Equivalence

AI coding agents work because users are technical, models are hyper-trained on code, outputs are verifiably testable, context lives in the codebase, and access controls are clean. None of those five conditions reliably hold in general knowledge work. Do not extrapolate coding-agent productivity gains directly onto non-engineering functions.

The Data Problem as Root Cause

Most agentic failures trace back to a data problem: the agent has access to too little information to execute, too much information causing it to roam and hallucinate, or corrupted/inconsistently-defined data that produces wrong answers at scale. Fix the data layer before deploying agents.

Tokenmaxxing vs. Enterprise Cost Reality

Silicon Valley engineering culture optimises for maximum token consumption (Tokenmaxxing) while enterprise finance culture is alarmed by unpredictable token bills. The two must be reconciled through a mosaic of models — Frontier models for high-complexity tasks, cheaper or OSS models for saturated repeatable tasks — and new FinOps tooling for non-IT budgets.

IT Budget Escape

AI spend will escape the traditional IT budget ceiling of 3–7% of revenue and move into line-of-business opex. This is structurally good for AI adoption at scale but creates a governance gap because line-of-business owners have no FinOps muscle for compute budgeting.

Jevons Paradox on Jobs

Productivity gains from agents do not eliminate net jobs — they cause organisations to sign up for larger projects, unlock functions previously unaffordable, and create new roles (internal FDEs, agent managers) to capture the value the agents surface. The last-mile human-in-the-loop requirement persists across most knowledge-work domains.

Headless + Seated Dual Model

Enterprise software will not go fully headless. The future is a dual business model: per-seat pricing for end-user-interfaced work and consumption pricing for agent-driven headless volume. By database-query volume, headless will dwarf the seated model, but the GUI interface retains value for complex, nuanced human tasks.

// How do you apply the Enterprise AI Diffusion Framework step by step?

1
Locate the organisation on the Diffusion Stages map
Determine which stage applies: (A) Chat Stage — chat assistant deployed or rolling out, productivity rate-limited by human conversation pace; (B) Early Agent Stage — pilots of task-executing agents underway, architecture decisions unmade; (C) Agent Scale Stage — agents running in stateful or workflow-triggered modes across multiple functions. Most Global 2000 firms in 2025 sit at A-to-B transition. Do not prescribe Agent Stage solutions to a Chat Stage organisation.
2
Run the Coding vs. Knowledge Work Checklist on the target use case
Score the use case against five criteria: (1) Are users technical? (2) Are models well-trained on this domain? (3) Is output verifiably testable? (4) Does context live in a single accessible repository? (5) Are access controls clean and correctly scoped? The more criteria that fail, the longer the diffusion timeline and the more internal FDE investment is required before deployment.
3
Audit the Data Environment for the three agent-blocking failure modes
Check for: (a) Too Little Access — agent cannot reach the information it needs due to entitlement gaps; (b) Too Much Access — agent has broader access than appropriate, risking data leakage or wrong answers; (c) Data Integrity Gaps — inconsistent definitions, FX adjustments, unmapped ontologies, or multiple redundant stores that a centralised data-science team previously mediated manually. Flag all three. Resolve (a) and (b) via access-control remediation before any agent goes live. Flag (c) as a semantic layer / ontology project — this is a known 20-year-old problem now critical because agents democratise data access to every employee, not just the analytics team.
4
Map the Token Cost Exposure and design the Mosaic of Models
Identify which tasks require Frontier model capability (complex reasoning, novel contract language, advanced coding) versus which tasks are now saturated and repeatable (standard customer service responses, document classification). Route saturated tasks to lower-cost or OSS models with a capped token budget. For Frontier tasks, establish dedicated capacity arrangements where available to lock in pricing. Flag that a single poorly-structured agent prompt can cost the equivalent of a staff benefit in one execution — employee training on prompt cost awareness is mandatory, not optional.
5
Resolve the IT Budget Escape question with finance and line-of-business owners
Determine whether AI compute spend will remain inside IT budget (constraining to 3–7% of revenue ceiling) or migrate to line-of-business opex. If migrating, assign budget ownership to the CMO, COO, or relevant business owner for their domain. Build a lightweight FinOps function for each major line of business — they do not currently have one. Acknowledge this is an unsolved tooling gap; a centralised procurement and governance layer with decentralised spend-decision authority is the interim best practice.
6
Staff the Internal FDE function
The Internal FDE (Field Deployment Engineer) role is a highly technical person embedded in or adjacent to a business function whose job is to: understand how that function works day-to-day, wire up the correct data sources and access controls, configure agent instructions and human-in-the-loop checkpoints, and maintain the deployment as models upgrade. This is not a one-time implementation role — model upgrades continuously create new work. Source from: repositioned IT engineers, new CS-graduate hires, or pivoted software engineers. Do not conflate with the External FDE role that vendor/startup field teams play, though both are required.
7
Choose the Reference Architecture and hold the line for 12 months
At any given moment there are 10–15 viable reference architectures for a given agentic use case (managed agents, MCP-connected SaaS, workflow-native agents, lab-direct APIs, etc.). The proliferation of choices extends sales cycles and creates decision paralysis. Pick one architecture that is lab-neutral enough to swap underlying models without full rearchitecture, get the data layer right inside it, and commit for a 12-month horizon. Avoid multi-year lab contracts given the pace of innovation; favour one-year terms.
8
Design the Headless + Seated business model for any software being built or procured
If building a product or negotiating vendor terms: per-seat pricing covers end-user interfaced access; consumption pricing covers agent-driven headless volume. Evaluate whether agents in the specific use case need a stateful identity in the system (warranting a cheaper agent-seat) or are purely on-demand operations (pure consumption). By raw operation volume, headless will dominate — price and architect accordingly.
9
Apply the Jevons Paradox lens to workforce planning
Do not model AI deployment as headcount reduction first. Model it as: (a) which functions previously unaffordable can now be unlocked? (b) which projects can now be scoped larger because existing engineers/analysts have 3–10x capacity? (c) which new roles (agent managers, internal FDEs, workflow designers) does the value surfaced by agents create demand for? Only after mapping expansion scenarios should reduction scenarios be assessed, and reductions are most credible in functions already at demand saturation with humans.
10
Build the employee AI-proofing plan
Companies owe employees a real shot at upskilling as part of the social contract. The plan has two tracks: Company track — funded enablement, training, internal tooling access, and embedding technical AI capacity in each major function. Employee track — spend 5–10% of work time using tools hands-on (Codeex, co-worker, Perplexity Computer, Cursor for semi-technical users); connect tools to personal workflows; develop the 'unlimited chief of staff' mental model to identify where agents create the most leverage in your specific role.

// What does the Enterprise AI Diffusion Framework look like in real-world scenarios?

A global financial services firm wants to deploy agents for client onboarding document review. They have completed a copilot chat rollout and are seeing productivity gains in IT but struggle to justify the next step.

Run the Coding vs. Knowledge Work Checklist: users are non-technical, output is not automatically verifiable (a lawyer must sign off), context is scattered across CRM, email, and scanned PDFs, and access controls are inconsistent. Score: 1 out of 5. Prescribe a 6-month data-layer project first — consolidate document stores, clean entitlements, define a consistent schema for client records. Hire or assign an Internal FDE to sit with the onboarding team. Design the agent to surface risk areas for human review rather than produce final outputs. Use a Frontier model for complex document reasoning; cap routine status-check queries to a lower-cost model. Route token spend to line-of-business budget with a lightweight FinOps owner in operations.

A mid-market industrial manufacturer wants to know whether to build a vertical AI application for procurement workflows or buy from a startup.

Assess the Bridge Layer Imperative: does the startup provide deep integration with the manufacturer's ERP data model, domain-specific ontology for procurement terms, and change management support? If yes, the startup occupies a durable bridge-layer position that the labs are unlikely to replicate without building hundreds of vertical go-to-market teams. If the startup is a thin wrapper with no proprietary data integration, the Capability Overhang Paradox means the next model release may render it obsolete. Favour vendors with API-first, lab-neutral architectures so the underlying model can be swapped without rearchitecting. Negotiate one-year terms.

A startup founder is evaluating whether to build ERP tooling for enterprise AI compute budgeting.

The IT Budget Escape principle confirms that as AI spend migrates from IT to line-of-business opex, every major function (marketing, sales, manufacturing, legal) needs FinOps capability it currently lacks. There is no existing tooling to measure ROI per token, enforce budget caps per team, or attribute compute cost to business outcomes. This is the '$5 billion startup waiting to happen' in AI compute ERP. The wedge is helping finance teams triage token spend across decentralised business owners while giving IT centralised procurement governance.

// What are the most common mistakes when deploying enterprise AI agents?

Extrapolating coding-agent productivity gains to non-engineering knowledge work — the five structural conditions that make coding agents work (technical users, verifiable output, clean codebase context, clean access controls, model training depth) do not transfer.
Deploying agents before fixing the data layer — agents with messy access controls either bounce off entitlement walls immediately or leak data they shouldn't touch; agents pointed at inconsistently-defined data give confidently wrong answers at scale.
Treating the Internal FDE role as a one-time implementation project — model upgrades continuously create new work; this is a permanent embedded function, not a deployment consultant engagement.
Locking into multi-year architecture or lab contracts — the Capability Overhang Paradox means today's reference architecture is likely obsolete within 12–18 months; one-year terms are the current best practice.
Assuming AI spend stays within the IT budget ceiling — the 3–7% of revenue IT budget cap will be broken as agents deliver value in line-of-business functions; failing to plan for IT Budget Escape leaves organisations with no governance model when bills arrive in the CMO's budget.
Letting employees use agents without token cost awareness — a single poorly-structured prompt to an agent with broad MCP access can cost as much as a full employee benefit in one execution; cost literacy is a mandatory training component, not a nice-to-have.
Treating the absence of a fully headless future as a failure — the Headless + Seated Dual Model is the correct end state; trying to go fully headless or refusing to go headless at all are both wrong; GUI interfaces retain genuine leverage for complex human tasks.
Dismissing job-protection concerns as purely political — ignoring the social contract dimension creates real downstream risk to talent pipelines, employee adoption, and political capital needed to drive change management; companies owe employees funded upskilling as part of the deployment plan.
Picking one of 10–15 reference architectures based on the most exciting recent lab announcement — architecture decisions made under Capability Overhang pressure should prioritise lab-neutrality and data-layer correctness over frontier-model novelty.

// What key terms should you know when using the Enterprise AI Diffusion Framework?

Tokenmaxxing: The Silicon Valley engineering culture of optimising for maximum token consumption — using the most capable, highest-context Frontier models at the highest volume possible to extract maximum output. Contrasted with enterprise cost management instincts.
Capability Overhang Paradox: The counter-intuitive dynamic where the faster AI breakthroughs arrive, the slower enterprise diffusion becomes, because each breakthrough renders the previously implemented standard architecture obsolete before it finishes rolling out, removing the stable deployment environment enterprises require.
Diffusion Stages: The three-stage model of enterprise AI maturity: Chat Stage (question-and-answer assistant, productivity rate-limited by human conversation pace), Early Agent Stage (task-executing pilots, architecture undecided), and Agent Scale Stage (stateful or workflow-triggered agents running across multiple functions).
Bridge Layer: The required layer of integration, security, change management, data preparation, and workflow configuration that sits between raw AI model capability and an enterprise's real-world workflows. It is not a temporary gap — it is a durable product and commercial opportunity.
Headless Software: Enterprise software accessed entirely via API or agentic interface, with no human-facing graphical user interface involved in the transaction. In the Headless + Seated Dual Model, headless agent-driven volume will dwarf human-seated usage by raw operation count.
Headless + Seated Dual Model: The future business model for enterprise software: per-seat pricing for end-user-interfaced work by humans, plus consumption pricing for agent-driven headless operations. Both models coexist; neither fully displaces the other.
Internal FDE: Internal Field Deployment Engineer — a highly technical role embedded within a business function whose job is to understand that function's workflows, wire up data sources and access controls, configure agent instructions and human-in-the-loop checkpoints, and maintain agent deployments through model upgrades. A permanent function, not a project role.
External FDE: Field Deployment Engineer at a vendor or startup who goes on-premise or works closely with enterprise customers to make AI products work in their specific environment. A necessary and durable commercial motion, not evidence that the technology is failing.
Mosaic of Models: The enterprise practice of routing different tasks to different models based on required capability and cost: Frontier models for complex, novel, high-stakes tasks; lower-cost or OSS models for saturated, repeatable tasks where capability requirements are well-understood.
IT Budget Escape: The structural shift in which AI compute spend migrates out of the traditional IT budget (historically 3–7% of corporate revenue) and into line-of-business opex as agents deliver value in marketing, sales, legal, manufacturing, and other functions — creating a governance and FinOps gap in those business units.
Jevons Paradox (applied to AI jobs): The principle that productivity gains from agents do not eliminate net jobs because they cause organisations to sign up for larger projects, unlock previously unaffordable functions, and generate new roles to capture agent-surfaced value. Named after the 19th-century economist William Stanley Jevons, applied here to knowledge-work employment.
The Data Problem: The root cause of most agentic failures, expressed as three failure modes: Too Little Access (agent can't reach needed information), Too Much Access (agent roams into inappropriate data), and Data Integrity Gaps (inconsistent definitions that were previously mediated by a centralised data-science team and now scale incorrectly to every employee via agent access).
AI Psychosis Period: The phase a user or organisation goes through when first encountering agents — characterised by extreme enthusiasm, weekend-long build sessions, and belief that everything will be transformed immediately — followed by a grounding phase when the maintenance burden, error-catching overhead, and model-upgrade disruption become apparent.
Unlimited Chief of Staff Mental Model: A heuristic for employees learning to identify high-value agent use cases: ask 'what would I give an unlimited chief of staff to work on?' to surface tasks where agents create the most personal or organisational leverage.

// FREQUENTLY ASKED QUESTIONS

What is the Aaron Levie Enterprise AI Diffusion Framework?

It is a diagnostic framework for identifying where enterprise AI deployments stall and prescribing the correct sequencing of data remediation, talent staffing, token cost management, and change management to accelerate agentic AI diffusion. It was derived from Aaron Levie's analysis of why advanced AI capability does not self-deploy into enterprises and why a durable bridge layer of integration, security, and workflow configuration is always required.

What is the Capability Overhang Paradox in enterprise AI?

The Capability Overhang Paradox is the counter-intuitive dynamic where faster AI breakthroughs actually slow enterprise diffusion. Each new breakthrough renders the previously implemented architecture obsolete before it finishes rolling out, removing the stable deployment window enterprises need. This means enterprises should favor lab-neutral architectures and one-year contract terms rather than locking into multi-year commitments to any single model provider.

How do I use the Levie framework to plan an enterprise AI rollout?

Start by locating your organization on the Diffusion Stages map (Chat Stage, Early Agent Stage, or Agent Scale Stage). Then run the Coding vs. Knowledge Work Checklist on your target use case, audit your data environment for three blocking failure modes, map token cost exposure, resolve IT Budget Escape with finance, staff Internal FDEs, choose a lab-neutral reference architecture, and design your headless-plus-seated pricing model. The framework is sequential — skipping the data audit step is the most common cause of agentic failure.

How do I know if my organization is ready for AI agents vs. staying with copilot chat?

Score your target use case against five criteria: Are users technical? Are models well-trained on the domain? Is output verifiably testable? Does context live in a single accessible repository? Are access controls clean? If fewer than three criteria are met, your organization likely needs to remain in Chat Stage while investing in data-layer remediation and Internal FDE staffing before deploying task-executing agents.

How does the Levie Enterprise AI Diffusion Framework compare to generic AI maturity models?

Generic AI maturity models typically measure adoption across broad capability levels without diagnosing specific blockers. The Levie framework is prescriptive: it identifies the three data failure modes (too little access, too much access, data integrity gaps), explicitly addresses the Capability Overhang Paradox that generic models ignore, and includes token cost governance, IT Budget Escape planning, and the Internal FDE staffing model — none of which appear in standard maturity frameworks.

When should I use the Aaron Levie Enterprise AI Diffusion Framework?

Use it whenever planning, auditing, or advising on an agentic AI rollout inside a large or mid-market enterprise. Also apply it when evaluating whether a startup opportunity exists in the bridge layer between AI capability and real-world enterprise workflows, or when assessing vendor selection for AI-powered enterprise software. It is especially critical during the Chat-to-Agent Stage transition that most Global 2000 firms face in 2025-2026.

What results can I expect from applying the Levie Enterprise AI Diffusion Framework?

You can expect a clear diagnosis of deployment blockers before they cause expensive failures, a sequenced roadmap that avoids the most common pitfalls (deploying agents before fixing data, locking into multi-year contracts, ignoring token cost exposure), and a workforce plan that applies Jevons Paradox to avoid both over-hiring and premature headcount reduction. Organizations that follow the framework report fewer stalled pilots and faster time-to-value on agentic deployments.

What is an Internal FDE and why do I need one for enterprise AI?

An Internal FDE (Field Deployment Engineer) is a highly technical person embedded within a business function who understands that function's daily workflows, configures agent data sources and access controls, sets up human-in-the-loop checkpoints, and maintains deployments through model upgrades. This is a permanent role, not a one-time implementation project, because continuous model upgrades continuously create new maintenance and reconfiguration work.

What is tokenmaxxing and why does it matter for enterprise AI costs?

Tokenmaxxing is the Silicon Valley engineering culture of optimizing for maximum token consumption — using the most capable Frontier models at the highest volume to extract maximum output. It matters because enterprise finance teams are alarmed by unpredictable token bills. The solution is a mosaic of models: Frontier models for complex tasks and cheaper or open-source models for saturated repeatable tasks, combined with new FinOps tooling and mandatory employee training on prompt cost awareness.

What is the headless plus seated dual model for enterprise software?

The Headless + Seated Dual Model is the future pricing structure for enterprise software: per-seat pricing for human-interfaced work and consumption pricing for agent-driven headless operations that require no GUI. By raw operation volume, headless will dominate, but GUI interfaces retain genuine value for complex, nuanced human tasks. Trying to go fully headless or refusing headless entirely are both wrong — the correct end state is coexistence of both models.

// GET THIS SKILL — FREE