Just got early access to OpenAI’s new GPT-5 Codex optimized for agentic coding. After a week of testing, here are my honest thoughts on what this means for software development.
What ‘Agentic Coding’ Actually Means:
Instead of just generating code snippets, GPT-5 Codex can:
- Plan entire feature implementations
- Break down complex tasks into subtasks
- Execute multi-step coding workflows
- Self-correct and iterate on solutions
- Understand broader codebase context
Real Examples from My Testing:
Feature Planning That Actually Works
Me: “Add real-time collaboration to our React app”
GPT-5 Codex Response:
I'll implement real-time collaboration using WebSockets. Here's my plan:
1. Backend WebSocket server setup
- Socket.io server with rooms
- User presence tracking
- Conflict resolution for simultaneous edits
2. Frontend real-time state management
- Redux store for collaborative state
- Optimistic updates with rollback
- Cursor position broadcasting
3. UI components
- User avatars showing who's online
- Real-time cursor indicators
- Conflict resolution modal
Should I start with the WebSocket server? I'll set up the basic room management first.
Then it actually implemented each step, asked for feedback, and iterated.
Code Review That’s Actually Helpful
I gave it a buggy React component:
function UserList({ users }) {
const [filteredUsers, setFilteredUsers] = useState([]);
useEffect(() => {
setFilteredUsers(users.filter(u => u.active));
}, []);
return (
<div>
{filteredUsers.map(user =>
<div key={user.id}>{user.name}</div>
)}
</div>
);
}
GPT-5 Codex found the bug immediately:
Issue found: useEffect missing dependency array will cause stale closure.
The useEffect should depend on [users], otherwise filteredUsers won't update when users prop changes.
Here's the fix:
useEffect(() => {
setFilteredUsers(users.filter(u => u.active));
}, [users]); // Add users dependency
Alternatively, consider useMemo for better performance:
const filteredUsers = useMemo(() =>
users.filter(u => u.active), [users]
);
This eliminates the state and effect entirely.
Architecture Decisions with Context
Me: “Our API is getting slow with complex queries. Should we add GraphQL?”
GPT-5 Codex:
Based on your codebase analysis:
- 80% of your endpoints are simple CRUD operations
- Main performance issues are N+1 queries in your ORM
- Your frontend makes 5-10 API calls per page load
Recommendation: Fix the immediate performance issues first before adding GraphQL complexity:
1. Add proper eager loading to your ORM queries
2. Implement response caching for frequently accessed data
3. Create composite endpoints for pages that need multiple resources
Estimated impact: 60-80% performance improvement
Estimated effort: 2-3 days vs 2-3 weeks for GraphQL migration
Should I show you the specific ORM optimizations?
What’s Actually Impressive:
Contextual Understanding: It reads my entire codebase and understands the architecture
Iterative Problem Solving: It can go back and refactor its own solutions
Technology Decisions: It considers trade-offs, not just “latest and greatest”
Testing Strategy: It writes tests alongside implementation
Performance Awareness: It considers scalability and optimization
What Still Needs Work:
Complex Business Logic: Still struggles with domain-specific requirements
Security Considerations: Sometimes misses obvious security issues
Legacy Code Integration: Has trouble with older, undocumented codebases
Infrastructure Decisions: Weak on DevOps and deployment strategies
Cost Optimization: Doesn’t consider cloud costs or resource efficiency
Real Productivity Impact:
30% faster for new features - The planning and scaffolding phases are much quicker
50% faster for refactoring - It understands the impact of changes across the codebase
80% faster for debugging - Code review and issue identification is incredible
20% faster overall - Still need human judgment for architecture and business logic
Concerns About the Future:
Skill Atrophy: Will junior developers lose fundamental programming skills?
Over-Dependence: What happens when the AI is wrong and we don’t catch it?
Debugging AI-Generated Code: How do you debug code you didn’t write?
Job Market Impact: What does this mean for different experience levels?
My Current Workflow Integration:
- Planning Phase: Let GPT-5 Codex break down features and suggest architectures
- Implementation: Use it for boilerplate and common patterns
- Review Phase: Have it review my code for issues I might have missed
- Testing: Generate test cases and edge cases I didn’t consider
- Documentation: Create technical documentation and API docs
Prompt Engineering Tips That Work:
// Good prompt
"Implement user authentication for my Express.js app. Here's my existing user model [paste code]. I need JWT tokens, password hashing, and middleware for protected routes. Follow the security patterns already established in my codebase."
// Bad prompt
"Add login"
Integration with Existing Tools:
- VS Code Extension: Works surprisingly well with existing workflows
- GitHub Copilot: Can be used alongside for different types of assistance
- Testing Frameworks: Generates test cases that actually make sense
- Documentation: Auto-generates JSDoc and README updates
The Honest Assessment:
This isn’t going to replace developers, but it’s going to change how we work significantly. The developers who learn to work with AI effectively will have a massive advantage.
It’s like having a really smart junior developer who:
- Never gets tired
- Has read every Stack Overflow answer
- Knows every framework and library
- But still needs senior oversight and domain expertise
Questions for the Community:
- How are you integrating AI into your development workflow?
- What tasks do you think should remain “human-only”?
- How do we maintain code quality when AI generates more of our code?
- What’s your experience with AI pair programming?
- How do we train junior developers in an AI-assisted world?
Hot Take: The developers who resist AI tools will be at a massive disadvantage within 2 years. But the developers who rely on them too heavily will produce brittle, unmaintainable code.
The sweet spot is using AI to amplify human creativity and judgment, not replace it.
What’s your take? Are you excited or worried about agentic coding?