I’ve been thinking a lot about AI productivity data lately, and there’s something that doesn’t quite add up with how we’re planning sprints.
The numbers are clear: AI coding assistants give us about 20-30% throughput improvements on simple feature work, but we’re seeing up to 90% productivity gains on refactoring and testing. That’s a massive difference. Yet our sprint planning still treats all work as roughly equal effort.
Here’s what I’m wrestling with: If AI makes refactoring 90% faster, should we be doing 3x more of it?
The Current State
At my team, we follow a pretty typical sprint allocation:
- 60% new features
- 20% bug fixes
- 20% technical debt / refactoring
This made sense when all work types took similar effort. But with AI assistance, that 20% refactoring budget now represents way less calendar time than the 60% feature work. We’re artificially constraining our capacity for high-leverage cleanup work.
The Asymmetry Problem
The data is compelling (source):
- Simple CRUD features: 20-30% faster with AI
- Unit test generation: 50%+ faster
- Refactoring existing code: 70-90% faster
- Boilerplate/scaffolding: 55% faster
Traditional sprint planning assumes uniform productivity across task types. That assumption is broken now.
The Opportunity
If we rebalanced our sprint mix to lean into areas where AI excels, we could:
- Dramatically reduce technical debt backlog
- Increase test coverage without sacrificing features
- Free up mental energy currently spent context-switching between messy code
But I’m cautious. Just because we can refactor faster doesn’t automatically mean we should refactor more.
What I Want to Know
For other engineering leaders: Have you adjusted your sprint planning based on AI productivity asymmetry?
What I’m specifically curious about:
- Are you allocating more sprint capacity to testing/refactoring given the productivity gains?
- How do you decide which refactoring is worth doing vs. busywork?
- Have you seen this translate to actual business outcomes, or just happier engineers?
I feel like there’s a strategic advantage hiding in this asymmetry, but I haven’t figured out how to capture it systematically. Would love to hear how others are thinking about this.
Context: Engineering Director leading 40+ engineers at a financial services company. We adopted GitHub Copilot enterprise-wide 8 months ago.