📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper from Google highlights that in AI-assisted software engineering, the model accounts for only about 10% of system behavior. The focus should be on harness design and context engineering, which account for the majority of performance and reliability.

A new Google whitepaper published in early 2026 states that the AI model constitutes only about 10% of the factors determining system behavior in AI-assisted software development. This shifts the focus from model improvements to harness and context engineering, which hold the majority of the system’s effectiveness and reliability.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, emphasizes that the most significant shift in software engineering is moving from writing code to expressing intent and trusting machines to interpret that intent. According to the authors, 85% of professional developers now use AI coding agents regularly, with 51% using them daily, and approximately 41% of new code generated by AI.

The core insight is that the model itself is only a small part of the system. Instead, the behavior and quality of AI systems depend heavily on the harness — the prompts, tools, rules, and observability layers surrounding the model — which the paper claims accounts for about 90% of the system’s performance. Concrete experiments cited show that changing only the harness can significantly improve an agent’s performance, even with the same model.

The paper advocates a shift toward agentic engineering, where structured verification, testing, and context management are prioritized over chasing the latest model improvements. It also highlights that effective context engineering — loading relevant instructions, knowledge, and tools dynamically — is crucial for scaling AI systems efficiently and cost-effectively.

At a glance

reportWhen: published early 2026

The developmentThe Google whitepaper reveals that in AI coding workflows, the model is just 10% of the system, with harness and context engineering making up 90%.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Implications for AI Development Strategies

This shift in understanding challenges the common focus on acquiring the latest and most powerful AI models. Instead, organizations can achieve better results by investing in harness design and context engineering. This approach reduces costs, enhances reliability, and creates durable competitive advantages, as the harness is within the organization’s control and can be continuously optimized.

For engineering leaders, the message is clear: model improvements are only part of the story. The real leverage lies in how AI systems are structured, verified, and maintained, which has profound implications for cost management and security in AI deployment.

Building AI-Powered Products: The Essential Guide to AI and GenAI Product Management

As an affiliate, we earn on qualifying purchases.

Background on AI-Assisted Software Development

Since early 2026, the adoption of AI coding agents has accelerated, with 85% of developers using them regularly, and 41% of new code being AI-generated, according to industry surveys. Previous focus centered on model capabilities and rapid iteration of AI models. However, the new Google whitepaper shifts the narrative, emphasizing that the behavior of AI systems depends more on how they are configured and managed than on the models themselves. Experiments cited in the paper demonstrate that changing the harness can significantly outperform simply upgrading the model, marking a fundamental change in AI development philosophy.

“The model is only 10% of what determines behavior; the harness is 90%. Focus on configuration and context.”
— Addy Osmani

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

As an affiliate, we earn on qualifying purchases.

Unanswered Questions About Implementation and Scaling

While the whitepaper provides compelling evidence that harness and context engineering are critical, it remains unclear how organizations will best standardize these practices at scale. The precise methods for measuring harness effectiveness and integrating dynamic context management into existing workflows are still being developed. Additionally, the long-term impact on model innovation and how this shift might influence AI model development cycles is not yet fully understood.

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

As an affiliate, we earn on qualifying purchases.

Next Steps for Organizations Adopting the New SDLC

Organizations should prioritize investing in harness development, context management, and verification frameworks. Pilot projects testing structured engineering approaches can validate cost savings and reliability improvements. Industry groups and standards bodies may soon develop guidelines to formalize best practices in harness and context engineering, helping organizations adopt this paradigm shift effectively.

Amazon

AI testing and verification tools

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of system behavior?

The whitepaper shows that the surrounding infrastructure — prompts, tools, rules, and observability — has a much larger impact on how AI systems perform than the models themselves.

How can organizations improve AI system reliability?

By focusing on harness design, including structured prompts, verification, and dynamic context loading, organizations can significantly enhance reliability and reduce costs.

Does this mean AI models are less important?

Not less important, but the whitepaper suggests that the value of models is amplified when combined with well-engineered harnesses and context management.

What are the economic implications of this shift?

Investing in harness and context engineering can lower long-term costs by reducing token waste, improving security, and decreasing maintenance efforts, despite higher initial development expenses.

What should AI teams focus on first?

Teams should start by optimizing their harnesses — prompts, tools, and verification layers — and developing dynamic context management practices.

Source: ThorstenMeyerAI.com

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

Curious Minds Team

Share article

The model is only 10%

Implications for AI Development Strategies

Building AI-Powered Products: The Essential Guide to AI and GenAI Product Management

Background on AI-Assisted Software Development

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

Unanswered Questions About Implementation and Scaling

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Next Steps for Organizations Adopting the New SDLC

AI testing and verification tools

Key Questions

Why is the model only 10% of system behavior?

How can organizations improve AI system reliability?

Does this mean AI models are less important?

What are the economic implications of this shift?

What should AI teams focus on first?

Next-Generation Lithium-Air Battery Could Dramatically Extend EV Range

The Power Bottleneck: AI Data Centers and the Grid Cliff Approaching 2027-2028

Mistral. The fourth path.

Elevate Your Aerial Video With These 9 AI Camera Drones In 2026

What Makes a Home Infrared Sauna Worth the Space?

How Rewilding Changes More Than Just Animal Numbers

What AI Will Look Like In 2026: 6 Predictions

9 AI-Enhanced Projectors Perfect For Big-Screen Nights In 2026

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

Curious Minds Team

Share article

The model is only 10%

Implications for AI Development Strategies

Building AI-Powered Products: The Essential Guide to AI and GenAI Product Management

Background on AI-Assisted Software Development

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

Unanswered Questions About Implementation and Scaling

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Next Steps for Organizations Adopting the New SDLC

AI testing and verification tools

Key Questions

Why is the model only 10% of system behavior?

How can organizations improve AI system reliability?

Does this mean AI models are less important?

What are the economic implications of this shift?

What should AI teams focus on first?

You May Also Like