The Concentration Problem Nobody Is Talking About
The GitHub Octoverse 2025 report highlights a statistic that should give every open source advocate pause: 6 of the top 10 fastest-growing projects on GitHub are AI-related. Combined with the 582,000 newly created AI repositories (a 50.7% YoY increase) and the fact that AI-related activity is driving a disproportionate share of the platform’s nearly 1 billion commits, we are witnessing an unprecedented concentration of open source energy in a single technology domain.
As someone who works in AI infrastructure, I should be celebrating this. Instead, I am concerned.
The Numbers Paint a Clear Picture
Let me lay out what we know from the Octoverse data:
- 6 of top 10 fastest-growing projects: AI-related (LLM frameworks, model hubs, agent tools, etc.)
- 582K new AI repos in 2025, up 50.7% YoY
- Python drives ~50% of new AI repositories, creating a self-reinforcing ecosystem
- 180M+ developers on the platform, with a significant portion of new developer activity concentrated in AI
- Total commits approaching 1 billion, with AI projects contributing a growing share of that total
This is not organic, diversified growth. This is a gold rush.
Why I Think This Is a Problem
1. Attention and funding are zero-sum
Every dollar of venture funding, every corporate open source contribution, and every developer’s attention that flows toward AI projects is attention that is not flowing toward:
- Infrastructure and security projects: The boring but critical tools that keep the internet running. OpenSSL, curl, SQLite, core Linux kernel work — these projects are chronically underfunded and understaffed.
- Developer tooling: Linters, formatters, build systems, package managers. These are the picks and shovels of software development, and many are maintained by tiny teams or single individuals.
- Accessibility and localization: Projects that make software usable for people with disabilities or in non-English-speaking markets. These were already underfunded before AI consumed all the oxygen.
2. The quality problem in AI repos
Not all 582K new AI repos are created equal. From what I see in the AI infrastructure space, a significant portion of these repos are:
- Tutorial clones and forks: Someone following a “Build your own ChatGPT” tutorial and pushing the result to GitHub
- Thin wrappers: Projects that add a minimal UI layer on top of an OpenAI or Anthropic API call and call it a product
- Abandoned experiments: Repos with a burst of initial commits followed by months of inactivity
- Duplicate efforts: Multiple projects solving the same problem (yet another RAG framework, yet another AI agent framework) because the space moves too fast for consolidation
The quantity is impressive. The sustainability is questionable.
3. AI project maintenance is especially expensive
AI-related open source projects have uniquely high maintenance costs:
- Model compatibility: Every time a major model provider releases a new version, frameworks and tools need to be updated
- API changes: OpenAI, Anthropic, Google — they all iterate their APIs frequently, and every change cascades through the ecosystem
- Compute costs: Testing and CI/CD for AI projects often requires GPU resources, which are expensive. Many small AI open source projects cannot afford proper CI.
- Rapid obsolescence: The pace of AI development means that a framework that was cutting-edge 6 months ago might be architecturally obsolete today
What Happens to Non-AI Open Source?
This is the question that keeps me up at night. If the best talent, the most funding, and the most attention are all flowing toward AI, what happens to the rest of the open source ecosystem?
Some specific concerns:
- Maintainer burnout in non-AI projects: If you are maintaining a popular non-AI open source project, you are watching AI projects get millions in funding while you struggle to attract contributors. That is demoralizing.
- Talent drain: Strong engineers who might have contributed to infrastructure, security, or developer tooling projects are being pulled into AI by higher compensation and more exciting narrative.
- Dependency risk: The AI ecosystem is built on top of non-AI infrastructure. If that infrastructure degrades due to neglect, the AI projects built on top of it become fragile.
What Should We Do About It?
I do not have all the answers, but here are some starting points:
- Diversify open source funding: Organizations like the Linux Foundation, Apache Foundation, and GitHub Sponsors should explicitly earmark funds for non-AI critical infrastructure projects.
- Measure ecosystem health, not just growth: GitHub’s Octoverse could report on maintainer well-being, project sustainability metrics, and funding distribution alongside growth numbers.
- Corporate OSS programs should fund dependencies: If your AI product depends on non-AI open source tools, fund those tools. It is enlightened self-interest.
- Resist the AI hype cycle in OSS: Not every project needs an AI feature. Not every developer needs to pivot to AI. The open source ecosystem is healthiest when it is diverse.
The Bottom Line
AI dominating the top 10 fastest-growing projects is not inherently bad — AI is genuinely important technology. But when 6 out of 10 top projects are in one domain, and 582K new repos are all chasing the same wave, we should be asking whether we are building a healthy, sustainable ecosystem or a bubble.
What are you seeing in your corners of open source? Are non-AI projects struggling to attract contributors? Is the AI concentration helping or hurting the broader ecosystem?