Three months into my current role, I inherited a team with “great” DORA metrics. Deployment frequency? Check. Lead time? Impressive. Change failure rate? Within acceptable range. The dashboards looked beautiful.
But the team was miserable. Quality issues kept surfacing weeks after deployment. Retrospectives felt like performance reviews. Engineers were stressed, burned out, and planning their exits.
What went wrong? The metrics were being used as surveillance tools, not learning instruments.
The Same Metrics, Opposite Outcomes
Here’s what I’ve learned across 18 years in engineering leadership: DORA metrics predict performance, but culture determines whether they help or hurt. The same five metrics (yes, deployment rework rate joined the party as the 5th metric) can drive two completely different outcomes:
The Surveillance Trap:
- Metrics become individual performance evaluations
- Teams game the system (empty commits, hiding incidents, splitting PRs artificially)
- Trust erodes, psychological safety collapses
- Metrics improve on paper while actual delivery degrades
The Learning Approach:
- Metrics indicate system health, not people scorecards
- Focus shifts to “What’s blocking us?” not “Why are you slow?”
- Blameless retrospectives identify bottlenecks
- Continuous improvement becomes team-driven, not manager-driven
Real Examples from My Teams
Team A (Previous Company - Surveillance Culture):
- High deployment frequency achieved by deploying configuration changes and empty commits
- Incidents were “resolved” in tools but root causes never addressed
- Lead time looked great because work was split into tiny PRs to game the metric
- Result: Burned out team, technical debt explosion, customer satisfaction plummeting
Team B (Current Company - Learning Culture):
- Lower deployment frequency, but every deploy delivers real value
- Incidents trigger blameless postmortems that improve the system
- Lead time conversations focus on reducing handoffs and wait times
- Result: Happy team, declining incident rate, customers love the quality
The 2026 AI Complication
Here’s a new wrinkle we’re all dealing with: AI tools are raising throughput across the industry. The 2025 DORA Report found that AI adoption improves deployment frequency but increases delivery instability.
This means:
- Deployment frequency is rising across your organization
- Change failure rate might be following right behind it
- You need to monitor quality metrics with extra care
- The surveillance approach becomes even more dangerous (blaming engineers for AI-amplified failures)
Psychological Safety Is the Foundation
The research is clear: psychological safety is among the strongest predictors of software delivery performance. Teams where members feel safe to take risks and voice concerns consistently perform better across all DORA metrics.
But surveillance-style metrics destroy psychological safety. When engineers fear their deployment frequency will be used in performance reviews, they optimize for the metric, not for the outcome.
What Actually Works
In the teams I’ve rebuilt using a learning approach:
- Never discuss individual metrics in 1:1s or reviews - DORA metrics are team-level indicators only
- Weekly “system improvement” sessions - Review metrics to identify bottlenecks, not to blame people
- Celebrate when metrics temporarily worsen - If a team slows deployments to fix technical debt, that’s success
- Pair DORA with quality metrics - Code quality, customer satisfaction, and technical debt alongside velocity
- Platform engineering investment - Reduce cognitive load system-wide rather than pushing individuals harder
The most powerful shift: When a metric looks bad, the first question is “What’s blocking the team?” not “Who’s underperforming?”
The Question for This Community
How are you using DORA metrics in your organizations?
Are they tools for learning and improvement? Or have they become surveillance instruments that teams learn to game?
For those who’ve successfully built a learning culture around metrics, what practices made the difference? For those dealing with surveillance cultures, what strategies have helped shift the conversation?
I’m particularly curious how other engineering leaders are handling the AI-amplified throughput challenge while maintaining quality and team health.
Context: Director of Engineering at Fortune 500 financial services company, formerly Intel and Adobe. Leading teams of 40+ engineers through digital transformation while trying to maintain the human elements that make engineering work sustainable.