Emit Jane Luma Foundation Lab Method

Last updated: 28 May 2026

Apply Luma's foundation lab methodology to design AI companies, products, and research loops that are jointly optimized end-to-end — so that product and research compound each other rather than compete.

// TL;DR

The Emit Jane Luma Foundation Lab Method is a framework for designing AI companies where product and research are one unified system — not separate functions. It teaches you to jointly optimize model training with real customer usage, build the thinnest possible product on top of base model capability, capture process data (not just artifacts), target professions instead of verticals, and evaluate scaling bets logarithmically. Use it when designing an AI company's research-product strategy, deciding between narrow vertical tools vs. generalist systems, or planning how to couple model training with real-world deployment data to create compounding flywheel effects.

Framework

// When should I use the Luma Foundation Lab Method?

Use this skill when designing or evaluating an AI company's research-product strategy, deciding whether to build a narrow vertical product vs. a generalist system, or planning how to couple model training with real customer usage data.

// What inputs do I need to apply the Foundation Lab Method?

Company or project descriptionrequired
What the company or project is building — the core modality, problem domain, or product area (e.g., visual AI, coding agents, robotics).
Current stagerequired
Where the team is today: pre-product, early product, scaling, or enterprise deployment.
Target professions or customer segmentsrequired
Which professions or types of people the product serves — framed as professions, not verticals (e.g., filmmakers, marketers, game artists, not 'entertainment industry').
Known model capability gaps
What the current model cannot do that the product needs — the delta between model capability and product promise.
Data availability
What proprietary or scarce data exists or could be collected that the internet cannot supply (e.g., process data, how artifacts were made, not just the artifacts themselves).

// What are the core principles of the Foundation Lab Method?

Foundation Lab: No Product or Research — Only One Thing

In a foundation lab, there is no product or research as separate functions. Research produces the product and the product works in research. The secret to building a great company in this space is to treat them as one unified system. Foundation labs are the blueprint of companies of the future.

End-to-End Optimization as Prime Directive

The only way AI systems will do meaningful things in the world is through joint end-to-end optimization — top to bottom. Never optimize a narrow sub-problem in isolation for months; if the model doesn't do it today, that is a data collection job for the next training run, not an engineering harness problem.

Promise of AI Is Not Spot Work

The promise of AI is not doing a little bit of spot work. 'I can make copy' or 'I can make a little image for you' is not the goal. The goal is the full end-to-end solution: the book, the campaign, the film production — not a fragment of it. Customers want end-to-end solutions to their problems, not promises of solutions.

Think in Professions, Not Verticals

Do not think about verticals when designing products. Think about professions — which kinds of people you can help. Verticals are abstractions; professions are humans with specific workflows and end-to-end problems that generalist systems must solve.

Thin Stack on Top of Base Model Capability

Build the thinnest possible product on top of base model capability. If the product ends up being 'a little bit fat,' the next model's job is to reduce that fatness. Avoid spaghetti harnesses and complex workaround systems — they are six-month dead ends that the next model iteration makes irrelevant.

Data Flywheel via Deployed Agents

The internet gives you artifacts but not the process of how artifacts were made. To train agents that do end-to-end work, you need process data — the actions, iterations, and decisions that produced the final output. Deploy agents to real customers, observe the best creatives using them, and feed that intelligence directly back into model training.

Multimodal AGI as North Star

The guiding principle is one multimodal AGI. Every product decision, every research bet, and every data collection strategy should be evaluated against whether it moves you toward a single tower that jointly models language, audio, video, images, and physical context — not separate towers per modality.

Distribution and Data Before Model

If you don't think about distribution you are dead in ML. If you don't think about data you are dead. Before training the target model, ask: where does the scarce data come from? If there is no YouTube of your modality, the first product must be something people love to use for free that generates that data at scale.

Think in Logarithms

When evaluating scaling bets, think in logarithms. The right question is: if the next model is 10x larger in compute and parameters, would it be a categorically different thing — not just incrementally better? If the answer is not an obvious yes, the constraint is architectural or data quality, not scale.

// How do you apply the Foundation Lab Method step by step?

1
Define the North Star as Multimodal AGI and Joint End-to-End Optimization
Before any product or research decision, state the prime directive explicitly: (1) multimodal AGI as the destination and (2) joint end-to-end optimization as the method. Every subsequent decision is evaluated against these two poles. If a product decision does not feed back into the model and if a model improvement does not make a product better, it is misaligned.
2
Identify the scarce data problem for your modality
Ask: is there a YouTube of this modality? A Wikipedia of it? If no, the first product must generate that data. Do not wait to know the exact scale needed — scaling laws for new modalities are unknown early. Release something people love to use for free that produces data at scale. Expect to not know if you need 1 million or 1 trillion examples.
3
Map current base model capability honestly against the end-to-end product promise
List what the model can and cannot do today. Flag every gap where the product currently requires an engineering harness or workaround. Each gap is not an engineering project — it is a data collection and training job. Categorize each gap: does it require a fine-tuning run, a new training run, or a full pre-training investment in compute?
4
Build the thinnest possible product stack on top of current model capability
Resist the urge to build complex orchestration systems to paper over model gaps. Build the thinnest product that delivers real value to real professions today. Fatness in the product stack is technical debt that the next model iteration must pay down. The product's job is also to generate the training signal for the next model.
5
Target professions, not verticals
Reframe every market conversation from 'which vertical' to 'which profession.' Professions have specific workflows, specific failure modes, and specific magic moments. Ask: what does end-to-end look like for this profession? A filmmaker's end-to-end is concept → shoot → edit → set changes → final output — not 'make a clip.' A marketer's end-to-end is understanding the environment → resonant message → localized assets at scale.
6
Deploy Forward Deployed Creatives (FDCs) to enterprise customers
FDCs (Forward Deployed Creatives) are not sales engineers — they serve two jobs simultaneously: (1) help customers actually deploy powerful systems into their complex organizational workflows, and (2) pipe the resulting intelligence — what works, what breaks, what data is needed — directly back to research and model training. Treat every enterprise deployment as an optimization loop, not a support ticket.
7
Capture process data, not just artifact data
The internet supplies artifacts (movies, images, code). It does not supply how those artifacts were made — the actions, iterations, and decisions. End-to-end agents require process data. Every interaction in your deployed product is a training signal. Build the product so that the path to the artifact, not just the artifact, is logged and usable for training.
8
Apply the 10x logarithmic scaling test at each model iteration
Before each major training run, ask: if this model were 10x larger in compute and parameters, would it be categorically different — or just incrementally better? If the answer is not obvious yes, the bottleneck is not scale. Diagnose whether the constraint is: (a) insufficient modality coverage (e.g., missing audio, missing language tower), (b) data quality/process data gaps, or (c) architectural limitations. Fix the real constraint before scaling.
9
Unify modalities into a single tower progressively
The shape of a world model is a single tower that jointly models language, audio, video, images, and physical context as one single signal stream. Do not build separate towers per modality — fuse them. Prioritize: language + video + audio covers approximately 90% of the path to a world model. Start with the highest-leverage fusion (language + image or language + video) and expand. Measure whether each fusion enables things that were categorically impossible before.
10
Evaluate consumer vs. enterprise deployment using the intelligence threshold test
Consumers consume; creators create. A generative product aimed at consumers is premature until the models are intelligent enough to understand context, humor, and the local state of the user. Apply the test: does the model understand why this content would be interesting to this specific person in this specific context? If no, enterprise deployment is the correct focus — businesses are responsible for 99% of pixels on screens every day and have clear end-to-end problems the model can solve now.

// What does the Foundation Lab Method look like in practice?

A startup building AI tools for architecture firms has good 3D rendering models but struggles to grow beyond individual tool usage into full workflow adoption.

Apply the 'Promise of AI is Not Spot Work' principle: the product is currently solving a spot work problem (make a render faster) rather than the end-to-end problem (go from brief → concept → full permit-ready design package). Reframe the product around the architect profession's end-to-end workflow. Identify the process data gap — the internet has finished buildings but not the decision path from brief to building. Deploy FDCs to architecture firms to capture that process data and pipe it back to model training. Build the thinnest stack that covers the full workflow loop, let the next model iteration reduce the harness.

An AI company has a strong language model and is debating whether to build a separate vision model or attempt a unified model.

Apply the Single Tower / Unified Model principle. A separate vision model creates two towers that do not jointly optimize. The unified model approach — one backbone fusing language and image tokens — enables things categorically impossible with separate towers (e.g., understanding who a character is across a long production, reasoning about visual states in code). Apply the 10x logarithmic test: would scaling the language-only model 10x produce categorically better visual reasoning? Almost certainly not — the constraint is architectural, not scale. Invest in the unified model architecture even though it is 'ridiculously hard to train' because it is the only path to end-to-end optimization.

A team is considering launching a consumer social network built around AI-generated video content.

Apply the intelligence threshold test before launch. Ask: do the models understand context, humor, and the local state of each user well enough that the generated content would be interesting to a specific person? If no, the product will have a strong day-one spike (novelty of generation) followed by rapid retention collapse — users scroll for a few days and ask 'now what?' because a generated video is not interesting because it is generated; it is interesting because of what is happening in it. Defer consumer launch until the unified model has sufficient intelligence. In the interim, focus on enterprise and professional creator deployments where end-to-end workflow value does not depend on contextual entertainment intelligence.

// What mistakes should I avoid when using the Foundation Lab Method?

Treating product and research as separate teams with separate roadmaps — in a foundation lab they are one unified system and must be jointly optimized.
Building complex engineering harnesses to paper over model capability gaps — this is a 6-to-8-month dead end; the correct response is to treat the gap as a 2-to-3-week data collection job for the next training run.
Thinking in verticals instead of professions — verticals are abstractions that obscure the actual end-to-end workflow a human needs solved.
Chasing consumer deployment before the models are intelligent enough to understand context and local user state — this produces novelty spikes followed by retention collapse, as the content is not interesting because it is generated.
Collecting only artifact data (finished outputs) and not process data (how the artifact was made) — agents that do end-to-end work require process data that the internet cannot supply.
Assuming scale alone (10x parameters, 10x compute) will fix a categorical capability gap — if the answer to 'would 10x scale make this categorically different' is not an obvious yes, the constraint is architectural or data quality, not scale.
Building separate modality towers instead of a unified single tower — separate towers cannot jointly optimize and prevent the model from developing true physical world understanding.
Solving spot work problems and calling it AI transformation — the promise of AI is the full end-to-end solution (the book, the campaign, the film), not a faster fragment of the workflow.

// What are the key terms and concepts in the Foundation Lab Method?

Foundation Lab: A company architecture in which product and research are not separate functions but one unified system. Research produces the product; the product works in research. Foundation labs are described as 'the blueprint of companies of the future' because their economics are driven by compute and research, not by individual software products, enabling new products to be launched at approximately 1% of the balance sheet.
End-to-End Optimization: The prime directive and guiding methodology: joint top-to-bottom optimization across the full stack — from base model training through product deployment and back. The only way AI systems will do meaningful things in the world. The opposite of optimizing a narrow sub-problem in isolation.
World Model: A model that has understanding of the physical world and is able to simulate it. Not defined by real-time speed or autoregressive architecture. Defined by understanding laws of physics, causality, time, and human language — all as one single signal stream. The shape of a world model is a single tower jointly modeling language, audio, video, images, and physical context.
Unified Model: A single-backbone model with a language tower and one or more modality towers (image, video, audio) fused into one single thing, jointly trained on both language tokens and continuous signal tokens. The unified model is the architectural path to a world model. Unified models enable things categorically impossible with separate modality towers.
Single Tower: The architectural ideal for a world model: one model that processes language, audio, video, images, and physical context as one single signal stream without separate towers per modality. Analogous to the human brain operating across all modalities without separate systems.
FDC (Forward Deployed Creative): A Luma-invented role analogous to Palantir's forward deployed engineers, but for creative and visual domains. FDCs serve two simultaneous functions: (1) help enterprise customers deploy powerful AI systems into their complex organizational workflows, and (2) pipe intelligence from real customer usage directly back to model research and training pipelines.
Process Data: Training data that captures how an artifact was made — the actions, iterations, and decisions in the path to a final output — as opposed to artifact data (the finished output itself). Process data is what the internet cannot supply and what end-to-end agents require to learn to do end-to-end work.
Promise of AI Is Not Spot Work: A core principle stating that AI's value is not in doing fragments of workflows faster ('I can make copy,' 'I can make a little image') but in delivering full end-to-end solutions — the book, the campaign, the film production. Spot work solutions will be commoditized; end-to-end solutions are the durable value.
Think in Logarithms: A scaling evaluation heuristic: when assessing the next major model investment, ask whether a 10x increase in compute or parameters would produce a categorically different model — not just an incremental improvement. If the answer is not obvious yes, the constraint is not scale but architecture, data quality, or missing modality coverage.
Thin Stack: The product architecture principle of building the thinnest possible product layer on top of base model capability. Fatness in the stack represents problems the model cannot yet solve natively. The next model's job is to reduce that fatness. Thick stacks built around model gaps become irrelevant with each new training run.
Intelligence Threshold Test: The test for whether a consumer generative product is viable: does the model understand context, humor, and the local state of the specific user well enough that the output would be genuinely interesting to that person? Below this threshold, consumer generative networks produce novelty spikes followed by retention collapse.

// FREQUENTLY ASKED QUESTIONS

What is the Luma Foundation Lab Method?

The Luma Foundation Lab Method is a framework for building AI companies where product and research are not separate teams but one unified system. Research produces the product, and the product feeds data back into research. It emphasizes end-to-end optimization, targeting professions rather than verticals, building thin product stacks on top of base model capability, and capturing process data — how artifacts are made, not just the finished outputs — to train next-generation models.

What is a foundation lab in AI?

A foundation lab is a company architecture where product and research are fused into a single function. Research directly produces the product, and the product serves as the research platform by generating training data and optimization signals. Foundation labs are described as the blueprint for future AI companies because their economics are driven by compute and research rather than individual software products, enabling new products to be launched at roughly 1% of the balance sheet.

How do I apply the foundation lab method to my AI startup?

Start by defining your north star as multimodal AGI with joint end-to-end optimization. Identify the scarce data problem for your modality — if there is no YouTube-scale dataset, your first product must generate that data. Map your model's current capability gaps honestly, then build the thinnest possible product stack on top of what the model can do today. Deploy to real professionals, capture process data from their usage, and feed it directly back into model training.

How do I decide between building a narrow AI tool vs. a generalist AI system?

Apply the 'Promise of AI Is Not Spot Work' principle. Narrow tools that do fragments of a workflow — making a quick image, generating copy — will be commoditized. The durable value is in full end-to-end solutions for specific professions. Build a generalist system that covers the complete workflow (e.g., brief to final deliverable for an architect) rather than optimizing one isolated subtask. The thin-stack approach lets you start narrow but architect for generalist expansion.

How does the Luma Foundation Lab Method compare to traditional AI product development?

Traditional AI product development treats research and product as separate teams with separate roadmaps — researchers build models, product teams build features on top. The Foundation Lab Method eliminates this separation entirely. Product decisions must feed back into model training, and model improvements must directly improve the product. Traditional approaches also tend to build thick engineering harnesses around model gaps, which the Foundation Lab Method treats as six-month dead ends that the next model iteration makes irrelevant.

When should I use the foundation lab method instead of standard product-market fit approaches?

Use it when your core product value is generated by an AI model rather than by traditional software engineering. If your competitive advantage depends on model capability improving over time, the foundation lab method ensures your product deployment compounds your research advantage. Standard product-market fit approaches work for static software; the foundation lab method is designed for products where the model is the product and customer usage data is the moat.

What is process data and why does it matter for AI training?

Process data captures how an artifact was made — the actions, iterations, decisions, and revisions in the path to a final output — as opposed to artifact data, which is just the finished product. The internet supplies vast amounts of artifact data (images, films, code) but almost no process data. End-to-end AI agents require process data to learn workflows. Deploying products to real professionals and logging their creation paths is the primary way to collect it.

What results can I expect from applying the foundation lab method?

Product and research begin compounding each other instead of competing for resources. Each customer deployment generates training data that improves the next model, which in turn makes the product better, attracting more customers. You avoid six-month dead-end engineering harnesses. Your product roadmap and research roadmap become one document. Over multiple training cycles, the model absorbs product complexity, making the stack thinner and the product more capable with less engineering overhead.

What does 'think in logarithms' mean for AI scaling decisions?

It means evaluating each scaling bet by asking: if the next model were 10x larger in compute and parameters, would it be a categorically different thing — not just incrementally better? If the answer is not an obvious yes, the bottleneck is not scale. It is likely architectural limitations, missing modality coverage, or data quality gaps. Fix the real constraint before spending on scale. This prevents wasting massive compute budgets on problems that scale alone cannot solve.

What is a Forward Deployed Creative and how is it different from a sales engineer?

A Forward Deployed Creative (FDC) is a Luma-invented role that serves two simultaneous functions: helping enterprise customers deploy AI systems into their complex workflows, and piping intelligence from real customer usage directly back to research and model training. Unlike sales engineers who focus on implementation and support, FDCs treat every enterprise deployment as an optimization loop. They observe how the best professionals use the product and feed that signal into the next training run.

// GET THIS SKILL — FREE