A developer on your team ships six pull requests before lunch. The review queue has 47 items in it. Nobody's looking at them.
This is the new normal.
The year code became cheap
Paul Dix, CTO of InfluxDB, called 2025 "the year code became cheap and programming changed forever." He's not wrong. With agentic tools like Claude Code, Codex CLI, and Gemini CLI, a single developer can now produce prototypes, migrations, refactors, bug fixes, tests, and glue code at a pace that would have been absurd two years ago.
The capability jump is real. I've felt it myself. Work that used to take an afternoon now takes twenty minutes. You describe the shape of what you want, the agent fills in the details, and you move on to the next thing.
But here's what nobody celebrating the speed seems to notice: shipped PRs are not shipped value.
The constraint moved
Writing code was never the bottleneck in software engineering. It never was. The slow parts were always the same: clarifying requirements, reviewing changes, validating correctness, getting to production safely, and operating what you shipped.
AI made the fast part faster. It left the slow parts completely untouched.
The data tells the story. Teams with high AI adoption merged 98% more pull requests in 2026. Sounds brilliant until you see the other side: PR review times increased by 91%. A CodeRabbit study found AI-written code surfaces 1.7x more issues than human-written code. And 43% of AI-generated changes need debugging in production.
More code. More issues per PR. Longer review times. Slower path to production.
The bottleneck didn't disappear. It moved downstream, right into the laps of your most experienced engineers.
The review queue is eating your seniors alive
Think about what a code review actually requires. You need to understand the intent behind the change, trace its effects through the system, and verify that nothing subtle broke. That demands a mental model of the codebase.
Now consider: the person who wrote the code didn't build that mental model either. The agent did the thinking. The author clicked "approve" on a diff they skimmed. And now they're asking you to review it.
This is the cognitive load problem nobody's measuring. Every AI-generated PR carries less context from its author. The patterns are unfamiliar, the naming conventions drift, the "why" behind decisions is absent. Your reviewer has to reconstruct all of that from scratch.
Your most senior people, the ones who can spot the subtle problems, are now buried under a wall of context-free diffs. The 2026 State of Code survey found that 96% of developers don't fully trust AI-generated code, yet only 48% actually verify it before merging.
Half of all AI-generated code ships without anyone genuinely checking it. That's not a productivity gain. That's a delayed incident.
The divergence is already here
Paul Dix predicted a split: organisations that adapt their entire delivery chain will compound their advantage, while everyone else bottlenecks in review, testing, and release.
He's right, and you can already see it happening.
Small startups and fresh YC batches operate without heavy process gates. They can point AI at every part of the delivery chain, not just the code itself. If they're good, you get tighter feedback loops, faster experiments, and surprisingly robust systems for the team size. If they're sloppy, you get lava layers that petrify within weeks.
Established teams have a different problem. They built their review processes, their CI pipelines, their release gates for a world where humans wrote code at human speed. Those processes were already the constraint. Now they're being hit with a firehose.
The gap compounds. Every week a team spends drowning in its review queue is a week it isn't shipping. Every week a leaner team spends shipping is a week it's learning from production.
What to actually do
The answer isn't another AI coding tool. You have enough of those.
The answer is investing in the boring parts of your pipeline:
- Automated testing that catches real issues: Not coverage theatre. Tests that verify behaviour, catch regressions, and run fast enough that developers don't skip them. If your CI takes 45 minutes, no amount of AI-generated code will help you ship faster.
- Deployment guardrails: Canary releases, feature flags, automated rollbacks. The infrastructure that lets you ship with confidence instead of hope. If deploying a change requires a meeting, you've already lost.
- Observability that tells you when something breaks: Not dashboards nobody checks. Alerts that fire when behaviour changes. If you find out about production issues from your customers, your monitoring is decoration.
- Review processes that scale: Smaller PRs, automated checks for common issues, clear ownership. The goal isn't to remove human review. It's to make sure humans review the things that actually need human judgment.
None of this is glamorous. None of it will trend on X. But it's the difference between a team that ships and a team that generates code.
The real question for 2026
The industry spent 2025 asking "how fast can you write code?" It was the wrong question then and it's the wrong question now.
The question for 2026 is simpler and harder: how fast can you trust code?
Because the developer who ships six PRs before lunch isn't 10x anything if those PRs sit in a queue until next Thursday. And the team that generates a thousand lines a day isn't productive if half of those lines need debugging in production.
Your pipeline is the ceiling. Everything else is just typing faster.