📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper reveals that in AI-assisted development, the model accounts for only 10% of system behavior. The focus should be on harness design and context engineering, which drive performance and costs.

A new whitepaper from Google, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that in AI-assisted software development, the model accounts for only about 10% of system behavior. This challenges common assumptions that upgrading models alone will significantly improve performance. Instead, the paper emphasizes that the harness and context engineering are the primary determinants of success and cost-efficiency in AI development, making this a crucial shift in industry focus.

The whitepaper introduces a spectrum of AI coding workflows, from casual vibe coding to disciplined agentic engineering, which involves rigorous verification, testing, and oversight. It notes that most failures in AI agents are configuration errors—missing tools, vague rules, or noisy context—rather than model deficiencies. For example, experiments with the same model showed that changing the harness—prompts, tools, and middleware—can dramatically improve performance, with some teams moving from outside the top 30 to the top 5 in benchmark tests.

The authors argue that the economic and strategic value lies in the harness and context management, not the model itself. They highlight that the cost of AI development is driven by token economy, where ad-hoc prompting appears cheap but incurs high operating and maintenance costs over time. Conversely, investing in structured schemas, evaluation, and context engineering can reduce long-term costs and improve reliability, shifting the industry’s focus from model upgrades to system design.

At a glance

reportWhen: published March 2026

The developmentThe Google whitepaper highlights that the core of AI-driven software development is not the model itself but the surrounding harness and context management, shifting industry focus.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Implications for AI Development Strategies

This shift suggests that organizations should prioritize building robust harnesses and effective context management rather than solely chasing the latest model improvements. Since the harness accounts for roughly 90% of system behavior, mastery in configuration, context engineering, and verification becomes the competitive advantage. This approach can lead to lower costs, higher reliability, and faster iteration cycles, fundamentally changing how AI projects are planned and executed.

WordPress Power Toolkit: Harness AI to build next-level websites

As an affiliate, we earn on qualifying purchases.

Background on AI Coding and Industry Trends

As of early 2026, AI coding agents are widely adopted, with 85% of developers using them regularly and over 41% generating most new code with AI. Previous focus centered on acquiring the most advanced models, but recent experiments and benchmarks reveal that performance improvements are often achieved through better system configuration rather than model upgrades. The whitepaper builds on this trend, emphasizing the importance of system design over raw model power.

“The biggest shift in software engineering isn’t a new language or framework; it’s moving from writing code to expressing intent and trusting machines to execute it.”
— Addy Osmani

Context Engineering for Claude: A Practitioner's Guide to CLAUDE.md, Memory Tools, and Three-Layer Workflows for Solopreneurs, Freelancers, and Product Managers

As an affiliate, we earn on qualifying purchases.

Unclear Aspects of Model Versus Harness Impact

It remains unclear how the relative importance of harness and context engineering will evolve as models continue to improve. The exact cost-benefit balance between investing in model upgrades versus system configuration is still being studied, and industry practices may vary based on application and scale.

AI Engineering: Building Applications with Foundation Models

As an affiliate, we earn on qualifying purchases.

Next Steps for AI Development and Industry Adoption

Organizations are expected to reevaluate their AI development strategies, investing more in system design, context engineering, and verification processes. Future research and benchmarking will likely focus on quantifying the impact of harness improvements and establishing best practices for scalable, cost-effective AI system deployment.

Supply Chain Software Security: AI, IoT, and Application Security

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system behavior?

The whitepaper’s experiments show that most of the AI system’s behavior depends on how the model is integrated, configured, and guided through prompts, tools, and context management—collectively called the harness.

Should I stop upgrading models and focus on system design?

While model improvements are valuable, the whitepaper suggests that investing in harness design, context engineering, and verification yields greater performance and cost benefits in the long run.

What is the main economic implication of this shift?

Ad-hoc prompting appears cheap initially but incurs high ongoing costs, whereas disciplined system design reduces token waste, maintenance, and security risks, leading to more sustainable AI development.

How does this change AI project management?

Teams should prioritize building robust, configurable systems with clear context and guardrails, rather than relying solely on the latest models or quick prompts.

Source: ThorstenMeyerAI.com

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

The Local-First Agentic Operator

Author

EarnQA Team

The model is only 10%

Implications for AI Development Strategies

WordPress Power Toolkit: Harness AI to build next-level websites

Background on AI Coding and Industry Trends

Context Engineering for Claude: A Practitioner's Guide to CLAUDE.md, Memory Tools, and Three-Layer Workflows for Solopreneurs, Freelancers, and Product Managers

Unclear Aspects of Model Versus Harness Impact

AI Engineering: Building Applications with Foundation Models

Next Steps for AI Development and Industry Adoption

Supply Chain Software Security: AI, IoT, and Application Security

Key Questions

Why is the model only 10% of the system behavior?

Should I stop upgrading models and focus on system design?

What is the main economic implication of this shift?

How does this change AI project management?

The Six Chokepoints: How AI Stopped Being a Utility and Became a Lever

The European Union: Rules First, Cushion Always

Cloud’s Hidden Memory Bill

World Model Readiness: Are You Ready for AI That Acts?

Discover The Potential Of Minimal C++ In Tech Signal Monitoring

6 Best Video Conferencing Bar for Small Conference Rooms in 2026

Europe’s AI Sovereignty Journey: The $14 Billion Mistral Initiative

Who Were The Traditional Document Handlers Before Automation?

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

EarnQA Team

The model is only 10%

Implications for AI Development Strategies

WordPress Power Toolkit: Harness AI to build next-level websites

Background on AI Coding and Industry Trends

Context Engineering for Claude: A Practitioner's Guide to CLAUDE.md, Memory Tools, and Three-Layer Workflows for Solopreneurs, Freelancers, and Product Managers

Unclear Aspects of Model Versus Harness Impact

AI Engineering: Building Applications with Foundation Models

Next Steps for AI Development and Industry Adoption

Supply Chain Software Security: AI, IoT, and Application Security

Key Questions

Why is the model only 10% of the system behavior?

Should I stop upgrading models and focus on system design?

What is the main economic implication of this shift?

How does this change AI project management?

You May Also Like