Last Tuesday, I sat in a leadership meeting staring at our engineering dashboard. Every metric was green. DORA deployment frequency: up 40%. AI code acceptance rate: 47%. PR velocity: record high. Our VP of Product smiled and said, “Engineering is crushing it.”
Two hours later, three of my senior engineers came to my office. They were burned out. One was considering leaving. The metrics said we were winning, but my team was drowning.
Welcome to 2026, where we measure everything and understand nothing.
The Measurement Explosion
In the past year, engineering organizations have gone metrics-crazy:
- DORA metrics (deployment frequency, lead time, change failure rate, MTTR, now rework rate)
- AI acceptance rates (how much AI-generated code we merge)
- PR velocity (how fast we close pull requests)
- Cycle time (idea to production)
- Code review throughput
- Developer productivity scores
We track it all. We dashboard it. We present it to leadership. We benchmark against “elite performers.”
But here’s what I’m seeing on the ground: the more we measure, the less we seem to understand about actual productivity.
The Gaming Has Begun
Let me share what’s actually happening on my teams:
PR Velocity Gaming: One team started splitting logical commits into tiny PRs to hit velocity targets. Their merge rate went up 60%. Their feature delivery slowed down 30%. We were measuring speed, they optimized for speed. Wrong speed.
AI Acceptance Rate Theater: Our “47% AI acceptance rate” sounds impressive until you realize it’s dominated by trivial autocompletes—imports, boilerplate, simple variable names. The complex business logic? Still 95% human-written. But the metric doesn’t distinguish.
DORA Deployment Frequency Mirage: Deployment frequency is up, and so are our bug reports. We’re shipping faster, but we’re also shipping broken. The metric rewards frequency, not quality or impact.
The Paradox Nobody Talks About
Here’s the thing that keeps me up at night: We have more data than ever about engineering activity, but less clarity than ever about engineering impact.
- Does a 40% increase in deployment frequency matter if customer satisfaction didn’t move?
- Does AI writing 41% of our code matter if productivity gains are still stuck at 10%?
- Does PR velocity mean anything if the right features aren’t getting built?
I grew up in El Paso, first in my family to go to college. My mentors taught me early: data tells the story you ask it to tell. If you ask the wrong questions, you get precise answers to meaningless questions.
What Should We Actually Measure?
I’m genuinely asking this community: Are we measuring engineers, or just measuring noise?
Some questions I’m wrestling with:
-
Outcome vs. Activity: Should we even track DORA metrics, or should we only track customer impact and business outcomes?
-
The AI Metrics Trap: Is “AI acceptance rate” a useful metric, or does it incentivize accepting mediocre AI suggestions just to hit a number?
-
Speed vs. Impact: PR velocity and deployment frequency measure speed. But speed toward what? How do we measure whether we’re building the right things?
-
Team Health: Developer satisfaction, psychological safety, learning & growth—these matter, but they’re hard to quantify. Should we stop trying to measure them, or measure them differently?
-
The Measurement Tax: Every metric has overhead—someone has to track it, analyze it, present it. At what point does the measurement overhead exceed the value of the insights?
The Signal in the Noise
At my previous companies (Intel, Adobe), I learned that the best teams don’t have the best metrics—they have the clearest understanding of what matters. Sometimes that’s a metric. Sometimes it’s a weekly conversation with customers. Sometimes it’s watching a junior engineer struggle and asking why.
I’m starting to think we’re optimizing for legibility instead of impact. Leadership wants dashboards they can understand, so we create dashboards. But reality doesn’t fit neatly into four DORA metrics and an AI acceptance rate.
Your Turn
I know I’m not alone in this. I see this tension across the industry—in conversations with other directors, in conference talks, in the uncomfortable silence when someone asks “But what does this metric actually tell us?”
So I’m curious:
- What metrics do you actually trust? Which ones have helped you make better decisions?
- What metrics have you stopped tracking? Which ones created more noise than signal?
- How do you measure engineering value in a way that’s honest, useful, and doesn’t incentivize gaming?
- For those in financial services or regulated industries: How do you balance compliance requirements (which demand measurement) with the reality that not everything meaningful can be measured?
Because right now, I have a dashboard full of green metrics and a team that’s struggling. And I’m pretty sure the dashboard is lying.
What am I missing here?