We’ve all felt it—that rush when AI autocompletes a function, the satisfaction when Copilot nails the boilerplate. It feels faster. Our team went all-in on AI coding assistants 8 months ago, and developers swear they’re saving hours every week.
But here’s what’s bothering me: When I actually measure our velocity, the numbers don’t match the hype.
The Numbers That Don’t Add Up
I just read some eye-opening research that made me question everything:
- 85% of developers now use AI coding tools daily (source)
- 26.9% of production code is now AI-authored (up from 22% last quarter) (source)
- But measured productivity gains? Only 10-20% (source)
The most shocking part? In a controlled study, experienced developers using AI took 19% LONGER to complete tasks—yet they still believed AI made them 20% faster. That’s a 39-percentage-point perception gap between feeling productive and being productive. (source)
What I’m Seeing on My Team
Our data tells a similar story:
- Pull requests are up 60% (sounds great!)
- But PR review time increased 91% (bottleneck alert
) - Code quality issues are 1.7x higher in AI-heavy PRs (source)
- Time from feature request to production hasn’t budged
It’s like we’re coding faster but shipping at the same pace—or slower.
The Productivity Placebo Effect
I think what’s happening is a productivity placebo. AI code generation triggers dopamine—instant feedback, instant “progress.” It feels like achievement, even when the downstream costs (review time, debugging, tech debt) eat those gains alive.
The AI “feels faster” trap is real. One study called it hijacking our brain’s reward system—giving the feeling of achievement without the heavy lifting. (source)
Where the Gains Actually Show Up
Not everything is disappointing though. We ARE seeing wins:
- Onboarding time cut in half (time to 10th merged PR) (source)
- 3.6 hours saved per week per dev on routine tasks (source)
- 46% reduction in time on routine coding (McKinsey study) (source)
So the tools work—but not the way we thought. They’re great for routine stuff, terrible when the bottleneck is review, testing, or deployment.
The Question That Keeps Me Up
Are we measuring the wrong things?
Maybe “lines of code per hour” was always a vanity metric. Maybe AI is exposing that our real constraints are:
- Review bottlenecks (humans can’t keep up)
- Brittle test suites (can’t validate faster code)
- Slow release pipelines (infrastructure can’t match velocity)
- Unclear requirements (AI can’t fix this)
Or maybe we’re in the awkward middle phase—using AI like a faster typewriter instead of rethinking how we build entirely.
What I’m Trying Next
- Stop measuring just “coding time”—measure end-to-end delivery
- Track AI code separately (% of each PR that’s AI-generated) and correlate with quality metrics
- Invest in the bottlenecks (review automation, test quality, faster deploys)
- Set realistic expectations with the team—it’s okay if AI doesn’t 10x us overnight
Questions for You
- Are you seeing similar gaps between perceived and measured productivity?
- What metrics actually matter when AI changes the game?
- How do you prevent the “productivity placebo” from derailing real progress?
- Should we be rethinking our entire software development process instead of just adding AI to the old one?
I want to believe the hype. I really do. But right now, the data is telling me we’re mistaken about AI’s impact—or at least, we haven’t figured out how to capture it yet.
What am I missing? ![]()