After three years as CTO watching the AI productivity conversation evolve, I need to call out what I think is the fundamental problem:
Everyone’s tracking PR count. But PR count is not productivity.
Metrics Drive Behavior—Choose Carefully
The metrics we track determine what our teams optimize for. And right now, we’re optimizing for the wrong things.
I keep seeing orgs celebrate:
- “AI users merge 60% more PRs!”
- “Code output increased 98%!”
- “Developers report feeling 20% more productive!”
Then I ask: “What happened to your deployment frequency? Change failure rate? Customer satisfaction?”
Usually: silence, or declining trends.
DORA Still Matters—Maybe More Than Ever
The DORA metrics remain the best sanity check because they tie delivery speed directly to system stability:
- Deployment frequency - How often do we actually ship to production?
- Lead time for changes - Commit to production (not PR merge)
- Change failure rate - What percentage of deployments cause incidents?
- Time to restore service - When things break, how fast do we recover?
These metrics force you to care about the entire value stream, not just the part AI makes faster.
The New Reality: Separating Code Output from Value Delivery
Here’s the framework I’m using in 2026:
Vanity Metrics (Stop Tracking These)
- Pull requests per week
- Lines of code written
- “Percentage of code written by AI”
- Individual developer velocity
- Time to PR merge
Business Impact Metrics (Start Tracking These)
- Time to customer value (idea → production → adoption)
- Deployment success rate (deployments without incidents)
- Customer satisfaction (NPS, support tickets, adoption)
- Production stability (MTTR, incident frequency)
- Team health (engagement, knowledge distribution, collaboration)
What the Research Actually Shows
The data is stark: Research from 2026 shows that correlation between AI adoption and company-level KPIs completely evaporates.
Teams merge more PRs. Companies don’t ship more value.
Why? Because we’re measuring intermediates, not outcomes.
Writing code is easy. Validating it, integrating it, deploying it safely, ensuring customers actually want it—that’s the hard part. And that’s what actually creates business value.
My Recommendations for 2026
1. Stop celebrating velocity
Start celebrating: successful deployments, resolved customer issues, reduced incidents.
2. Track end-to-end, not intermediates
Measure from customer need → delivered solution, not from code start → PR merge.
3. Make quality visible
Track defect rates, rework rates, production incidents alongside velocity.
4. Include team health
Knowledge distribution, collaboration time, psychological safety matter.
5. Align engineering and business metrics
If engineering hits all its numbers but the business isn’t succeeding, your metrics are wrong.
The Question That Matters
Not: “How many PRs did we merge this quarter?”
But: “How much customer value did we deliver, and how sustainably did we do it?”
For engineering leaders: What are you actually measuring in 2026? Have you stopped celebrating velocity? What metrics tie your team’s work to business outcomes?
I’d love to hear what frameworks are working for others.
Sources: AI Productivity Paradox Research | Rethinking Productivity Metrics | Developer Productivity Measurement 2026