The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper from Google emphasizes that in AI-assisted software development, the model itself is only 10% of the system. The majority of performance depends on how the AI is configured and integrated, shifting focus toward verification and context engineering.

A new Google whitepaper titled The New SDLC With Vibe Coding emphasizes that the model used in AI coding agents constitutes only about 10% of the overall system behavior. The report argues that the real leverage comes from how developers configure, verify, and manage the surrounding harness, making the system’s design and context engineering the primary focus for effective AI integration.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that 85% of professional developers now use AI coding agents regularly, with 51% using them daily. It highlights that 41% of all new code is generated by AI, but stresses that the model itself is only a small part of the equation. The authors argue that the majority of failures and inefficiencies stem from misconfiguration, missing tools, and poor context management. They provide concrete examples showing that tweaking the harness—the prompts, tools, rules, and observability—can dramatically improve AI performance without changing the model. This shifts the focus from constantly chasing the latest model to optimizing the surrounding infrastructure.

The paper also introduces the concept of agentic engineering, where AI is embedded within structured specifications, automated tests, and oversight, contrasting with the more casual vibe coding approach. It emphasizes that costs are driven more by configuration and context than by the model itself, with disciplined engineering offering long-term efficiencies.

At a glance
reportWhen: published March 2026
The developmentThe publication of a Google whitepaper highlights that the core of effective AI-driven development lies in harness design and context management, not just the AI model.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Why Configuration and Context Are the New Focus

This shift in understanding matters because it redefines where development teams should invest their resources. Instead of chasing ever-smaller, more powerful models, organizations can gain more control and efficiency by improving how they set up and manage AI systems. This approach can lead to significant cost savings and more reliable software, especially as AI becomes integral to more workflows. It also underscores that the future of AI development depends more on human-led engineering than on the AI models themselves.

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background on the Shift Toward Model Configuration

The whitepaper builds on recent trends where AI-generated code now constitutes a large portion of new software, with 85% of developers using AI tools regularly. Prior to this, the focus was primarily on model capabilities and improvements. However, as models have become more advanced and accessible, the bottleneck has shifted toward how these models are integrated, controlled, and verified. The authors argue that much of the current hype around new models overlooks the importance of system configuration, verification, and context management. This perspective aligns with ongoing industry observations that most AI failures are configuration issues rather than model deficiencies.

“The model is only 10% of what determines behavior; the harness is 90%. Focus on configuration, verification, and context management.”

— Addy Osmani

LEAN PROGRAMMING FOR FORMAL SOFTWARE VERIFICATION: Mathematical proof systems and logical frameworks for verified computation

LEAN PROGRAMMING FOR FORMAL SOFTWARE VERIFICATION: Mathematical proof systems and logical frameworks for verified computation

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Uncertainties About Practical Implementation

While the whitepaper makes a compelling case, it remains unclear how quickly organizations will adopt this shift in focus. Specific best practices for harness design, context engineering, and verification are still being developed, and the relative effectiveness across different domains or project sizes has yet to be validated at scale. Additionally, the long-term impact on AI model development priorities is still uncertain, as the industry continues to invest heavily in model innovation.

AI-Powered Software Testing: Volume 1: Foundational Patterns and Principles for Architects and Technical Leads

AI-Powered Software Testing: Volume 1: Foundational Patterns and Principles for Architects and Technical Leads

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for AI Development and Adoption

Organizations are expected to begin reevaluating their AI integration strategies, emphasizing system configuration and verification. Industry leaders may develop new tools and frameworks to facilitate harness design and context engineering. Further research and case studies will likely emerge to quantify the benefits of this approach, and standards may be established to guide best practices. Monitoring how these insights influence AI tool development and operational workflows will be critical in the coming months.

AI Engineering: Building Applications with Foundation Models

AI Engineering: Building Applications with Foundation Models

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system behavior?

The whitepaper states that the behavior depends mostly on how the AI is configured, verified, and integrated. The model provides the raw capability, but the surrounding harness, prompts, tools, and oversight determine the actual output and reliability.

How can organizations improve AI performance without changing the model?

By focusing on harness design, context management, and verification processes. Tweaking prompts, adding tools, setting guardrails, and structuring the context can significantly enhance performance and reduce failures.

What are the economic implications of this shift?

Cost savings come from reducing token burn, minimizing maintenance, and avoiding vulnerabilities. Investing in configuration and verification can be more cost-effective than constantly upgrading models.

Does this mean model development is less important?

Not necessarily, but the whitepaper suggests that system design and configuration will become the primary focus for effective AI deployment, with model improvements playing a supporting role.

What practical steps should teams take now?

Teams should evaluate their current AI workflows, invest in harness development, context engineering, and verification tools, and adopt disciplined engineering practices to maximize reliability and efficiency.

Source: ThorstenMeyerAI.com

You May Also Like

CarPlay Is Additive

Automakers confirm CarPlay enhances vehicle features but is additive, raising questions about its influence on driving safety and vehicle design.

OpenEuroLLM. The third path.

European consortium OpenEuroLLM faces compute bottlenecks amid progress, highlighting limits of pan-European pooled resources in developing multilingual LLMs.

The $60 Billion Bargain: Why Cursor Could Be a Steal for SpaceX

SpaceX’s recent all-stock $60 billion purchase of AI coding firm Cursor is a strategic move, offering growth and competitive advantages amid soaring valuations.

Pentagon AI Goes Explicit: The Frontier Labs Move Inside the Classified Stack

The Pentagon announces agreements with major AI firms to embed advanced AI models into classified networks, signaling a shift towards AI-first military operations.