The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper from Google highlights that in AI development, the model itself accounts for only 10% of system behavior. The focus should shift to harnessing and configuring the surrounding infrastructure, which represents 90%. This insight could reshape how organizations approach AI deployment and management.

A new Google whitepaper, ‘The New SDLC With Vibe Coding,’ asserts that the most impactful part of AI systems is not the model itself, but the surrounding harness and configuration, which constitute approximately 90% of system behavior. This challenges the common focus on model improvements and suggests a strategic shift for AI practitioners and organizations.The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, emphasizes that AI development should prioritize harnessing and configuring tools, prompts, and context rather than solely relying on the latest model advancements. It presents evidence from benchmarks showing that minor adjustments to the harness can significantly improve performance, even with the same underlying model. The authors introduce the concept of ‘agentic engineering,’ where AI is treated as a system of components—model plus harness—rather than a monolithic entity. They also highlight that the cost and complexity of AI systems are driven more by configuration, context management, and verification processes than by the models themselves. This perspective advocates for a disciplined approach, focusing on verification, testing, and context engineering to reduce long-term costs and security risks associated with AI deployment.
At a glance
reportWhen: published March 2026
The developmentGoogle’s new whitepaper reveals that the core of effective AI systems lies in configuration and verification, not just the AI model itself, marking a significant shift in AI development strategy.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Why Focusing on Harness and Configuration Changes the AI Strategy

This shift in perspective matters because it redirects organizational efforts from chasing ever-larger models to optimizing the surrounding infrastructure. By understanding that 90% of behavior is determined by how the AI system is configured and managed, companies can achieve better performance, lower costs, and improved security without constantly upgrading to new models. This approach enables more sustainable and controllable AI deployment, which is critical as AI becomes embedded in core business processes.
AI Model Risk Blueprint: Model Validation Testing | Ethical Considerations in AI Models | Integrating AI with Business Risk Plans | Real-World AI Model ... Strategies | AI Governance Tools & Resource

AI Model Risk Blueprint: Model Validation Testing | Ethical Considerations in AI Models | Integrating AI with Business Risk Plans | Real-World AI Model … Strategies | AI Governance Tools & Resource

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Reevaluating the Focus in AI Development Strategies

Historically, AI advancements have centered on developing larger, more capable models, with improvements often attributed directly to the model’s architecture. However, recent experiments and benchmarks, including those cited in the whitepaper, show that tuning the harness—prompts, tools, context, and guardrails—can outperform raw model improvements. The whitepaper situates this insight within ongoing debates about AI cost, security, and reliability, emphasizing that configuration and verification are now central to effective AI systems. This represents a paradigm shift from the ‘model-centric’ view to a ‘system-centric’ view, where the surrounding infrastructure is recognized as the primary driver of AI behavior.

“The behavior you experience in AI systems is dominated by scaffolding you can build, own, and improve—it’s not just about the model.”

— Addy Osmani

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Remaining Questions About Implementation and Cost

It is not yet clear how organizations will systematically adopt this approach at scale, or how quickly the industry will shift focus from models to harness and configuration. The specific cost-benefit dynamics across different sectors and use cases remain to be validated through real-world deployment and long-term studies.
FNIRSI LCR-ST1 10KHz LCR Meter Tweezer, Mini Smart SMD Tester, ESR Meter, Multimeter, Capacitance Inductance Continuity Diode Tester, Auto Component Recognition Electronic Component Analyze

FNIRSI LCR-ST1 10KHz LCR Meter Tweezer, Mini Smart SMD Tester, ESR Meter, Multimeter, Capacitance Inductance Continuity Diode Tester, Auto Component Recognition Electronic Component Analyze

【LCR Tweezer Tester】The FNIRSI LCR-ST1 is a multifunctional and portable testing instrument that supports precise measurement of resistance,…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for AI Development and Industry Adoption

Organizations are likely to begin investing more heavily in tools, frameworks, and best practices for harnessing AI systems. Further research and case studies will clarify how configuration and verification can be optimized at scale, potentially leading to new standards and training in system engineering for AI. Monitoring industry shifts and benchmarking improvements will be key to assessing the impact of this paradigm change.
The AI Testing Framework: Quality Assurance for Machine Learning Systems

The AI Testing Framework: Quality Assurance for Machine Learning Systems

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s behavior?

According to the whitepaper, the majority of an AI system’s behavior is determined by how the model is configured, including prompts, tools, and context management, which together account for about 90%.

What is ‘agentic engineering’?

It’s an approach where AI is treated as a system of components—model plus harness—focusing on configuration, verification, and context to improve performance and reliability.

How does this shift affect AI development costs?

While initial setup costs for configuration and testing might be higher, long-term operational costs are lower because tuning and managing the system is cheaper and more predictable than constantly upgrading models.

Will this change the way AI tools are built and sold?

Yes, it suggests that organizations should focus on building and owning their harnesses and verification tools, making these the primary source of competitive advantage rather than just relying on model providers.

Source: ThorstenMeyerAI.com

You May Also Like

The 29th International Obfuscated C Code Contest (IOCCC) 2025 Winners

The 29th International Obfuscated C Code Contest announced its winners in 2025, showcasing innovative and complex code entries. Read the details here.

Amazon’s AI Shopping Overhaul

Amazon introduces a significant overhaul of its shopping platform, integrating advanced AI features to personalize and streamline customer experiences.

Microsoft degrades functionality of perpetually-licensed offline products

Microsoft plans to restrict offline functionality of Office 2019 for Mac starting July 13, 2026, due to expired licensing certificates, affecting users’ ability to edit files.

WWDC 2026: Apple is Folding

Apple announced a foldable iPhone, the iPhone Ultra, at WWDC 2026, signaling a major hardware shift and new developer guidelines for adaptable apps.