We just wrapped Q4 reviews at our EdTech startup, and the engineering metrics looked phenomenal. Deployment frequency up 40%. Mean time to recovery down 35%. Lead time for changes cut in half. By every DORA metric, we were crushing it.
Then I looked at our quarterly engagement survey. Developer stress was up 28%. Three senior engineers mentioned burnout in their feedback. Two asked about sabbatical policies.
Something didn’t add up.
The Metrics Gap Nobody Talks About
I started digging into what our DORA metrics weren’t measuring. Turns out, a lot.
Recent research shows 76% of organizations admit their software architecture’s cognitive burden creates developer stress and lowers productivity. That number stunned me, but it matched what I was seeing on my own team.
We were measuring how fast we shipped. We weren’t measuring how hard it was to think.
What Is Cognitive Load, Anyway?
Cognitive load is the mental effort required to complete tasks. In software development, it includes:
- Tool complexity: The number and complexity of systems engineers must navigate
- Context switching: Moving between different codebases, tools, meetings, Slack channels
- Unclear ownership: Not knowing who owns what or how systems connect
- Documentation gaps: Having to reverse-engineer intent from code
- Architectural debt: Working around legacy decisions that no longer make sense
Research shows developer performance drops noticeably with cognitive overload. Too much mental burden limits information processing, causing major delays in task completion.
The 2000-2020 explosion in tooling—from IDE and CVS to Docker, Kubernetes, Terraform, and beyond—was unsustainable. And we’re still adding more.
Platform Engineering: One Potential Answer
The good news? Platform engineering adoption has reduced cognitive load by 40-50% through self-service infrastructure and abstraction of complexity.
Gartner projects that 80% of large engineering organizations now maintain dedicated platform teams, up from 45% in 2022. The goal: eliminate toil and let developers focus on business-critical innovation.
At our startup, we’ve started investing in internal developer platforms—golden paths for deployment, observability, testing. Early results are promising. Our cognitive load survey (yes, we now measure this) shows a 25% reduction in self-reported stress around infrastructure decisions.
The Measurement Challenge
But here’s where it gets tricky: How do you quantify cognitive burden?
DORA metrics are clean. Objective. Easy to dashboard. Cognitive load is messier:
- Self-reported stress surveys (subjective, lagging)
- Tool count audits (objective but incomplete—not all tools are equal)
- Time-to-productivity for new engineers (useful but slow-moving)
- Context switch frequency (hard to measure accurately)
- Code comprehension time (interesting proxy, difficult to instrument)
The SPACE framework and DevEx metrics attempt to measure perceptions and workflows across feedback loops, cognitive load, and flow state. But adoption is still early.
The Hard Questions
I’m wrestling with these questions:
-
Should cognitive load be as important as deployment frequency? Or is it just a nice-to-have when DORA metrics are strong?
-
How are others measuring this? What metrics or proxies have you found useful?
-
When does platform engineering help vs. add another layer of complexity? We’ve all seen internal tools that became their own source of cognitive burden.
-
Is there a trade-off between velocity and cognitive sustainability? Or can you optimize for both?
-
How do you convince executives to invest in “thinking time” when the culture rewards shipping fast?
Why This Matters
I used to think productivity was about removing friction and speeding up cycles. Now I think it’s about sustainable delivery—balancing speed with the cognitive capacity to do excellent work.
As architecture complexity increases, the more lines of code get spent on bug-fixing rather than new features. That’s a productivity tax we can’t DORA-metric our way out of.
If we’re serious about developer experience, we need metrics that capture the human side of software delivery—not just the operational side.
What are you tracking beyond DORA? How are you measuring (and managing) cognitive load on your teams?