I’ve been tracking something troubling across our engineering org, and new research from Thoughtworks just validated what I’ve been seeing.
The headline: AI coding assistants made our developers roughly 30% faster at writing code. That’s huge, right?
The reality: Our net delivery improvement? About 8%.
Let me break down what’s happening. When we rolled out AI coding tools last year, individual velocity metrics skyrocketed. Developers were cranking out features faster than ever. PRs were flying. The team felt productive. Leadership was celebrating the AI investment.
But when we looked at actual feature delivery—time from conception to production—we barely moved the needle.
Why the Gap?
Coding is only about half of our total cycle time. The other half? Testing, code reviews, waiting for environments, managing dependencies between teams, dealing with deployment pipelines.
AI helps developers write code faster, but it doesn’t:
- Speed up security reviews in our fintech environment
- Resolve architectural decisions faster
- Reduce waiting time for QA environments
- Eliminate cross-team dependencies
- Make our CI/CD pipeline any faster
- Reduce the cognitive load of reviewing larger, AI-generated PRs
We optimized one part of the system and celebrated it. But the system didn’t get faster.
The Leadership Blind Spot
Here’s what worries me: we’re measuring—and rewarding—the wrong thing.
We track “lines of code written” and “PRs submitted” because those numbers look great in board decks. But customers don’t care about code velocity. They care about features in production.
The Thoughtworks research calls this out explicitly: the 8% delivery improvement is what actually matters to business outcomes, forecasting accuracy, and customer impact. Not the 30% coding gain.
The Real Question
Should we shift our focus from developer productivity to delivery system productivity?
What if instead of celebrating faster coding, we invested in:
- Automating code reviews and testing pipelines
- Reducing handoffs and wait times
- Simplifying deployment processes
- Resolving architectural bottlenecks
- Improving cross-team coordination
I’m curious what other engineering leaders are seeing. Are you measuring delivery outcomes or just coding throughput? What bottlenecks are you tackling?
And honestly—are we solving the right problem with AI tools, or just making one part of a broken system incrementally better?
Sources: Thoughtworks AI Productivity Research, Software Development Bottlenecks 2026