google-ai

David Silver's .1B Raise Is the DeepMind Alumni Network Going Independent at Scale

Anatoliy Kolodkin

27 Apr 2026 • 4 min read

David Silver left DeepMind and raised $1.1 billion at a $5.1 billion valuation. The headline writes itself. The interesting question is what exactly he is building — and why so many smart investors decided the answer was worth more than the GDP of a small country before a single model existed.

Silver, who led AlphaGo from first paper to the Lee Sedol match and then to AlphaZero, has been quiet about his next move since departing DeepMind. What we know: his new company is called Ineffable AI, it is building systems that learn without human-generated data, and it has backing from Sequoia and Nvidia at a valuation that puts it in serious territory before the company has published anything or released a product. That is a sentence that would have sounded absurd five years ago and now sounds like Tuesday.

The "without human data" framing is doing a lot of work in the announcement. Silver has been one of the clearest internal advocates at DeepMind for reinforcement learning as the path to genuinely general intelligence — not because language models are bad, but because the kind of learning you get from self-play and environmental interaction reaches parts of the capability distribution that predicting the next token simply cannot. AlphaZero, which learned to play Go, Chess, and Shogi from scratch with no human games as input, was always the proof of concept. Ineffable is the attempt to scale that idea past games into something more like the real world.

Why $1.1 Billion Changes the Conversation

Funding rounds this size are not just capital events. They are market signals. When Sequoia and Nvidia co-lead a round at a $5.1 billion post-money valuation for a company with no published research, no product, and no name most developers would recognize, they are making a specific argument: that the next level of AI capability requires a different research direction than the one the entire industry has been running down for the past several years. That argument has been whisperable in AI labs for a while. It has never before been accompanied by a nine-figure check.

The Nvidia involvement is particularly notable. Nvidia's interest in AI chip demand is obvious. But investing at the model layer — and investing in a company whose stated goal is learning without human data, which could imply radically different inference and training compute profiles than current transformers — suggests Nvidia is also thinking about what the hardware stack looks like if the industry pivots toward RL-heavy self-supervised learning at scale. If Ineffable's approach requires different GPU utilization patterns, different memory bandwidth characteristics, or different distributed training topologies than standard LLM training, Nvidia's LP stake is also an option on that scenario.

Sequoia's involvement has a different but equally interesting logic. Sequoia has been building a thesis around AI infrastructure and application layer for several years. A $1.1 billion bet on a research-first, pure-play AGI shop with a clear DeepMind lineage is a statement that they believe the capability ceiling is not close — and that the path there requires the kind of long-horizon, high-conviction research investment that public markets cannot price and typical venture time horizons cannot absorb.

What "Learning Without Human Data" Actually Means

This is where honest skepticism is warranted. "Learning without human data" is a real research paradigm with genuine results behind it. AlphaZero is the canonical example. The General Language Understanding Evaluation benchmark and reading comprehension scores have been beaten by models trained without human-labeled data. The concept is not science fiction.

But scaling that paradigm past game environments into open-ended real-world tasks is a fundamentally different problem than scaling it within a well-defined game. Games have clear victory conditions, well-specified state spaces, and fast simulation. Real-world tasks have ambiguous success criteria, incomplete feedback, and environments that are expensive, slow, or dangerous to simulate. Getting a reinforcement learning system to learn to write code that passes code review, or to discover novel scientific hypotheses, or to design a robot controller for an unfamiliar physical task — these all require solving RL problems that are currently too sample-inefficient, too slow, or too brittle for practical use at frontier scale.

The AlphaFold story is actually the best historical analogy here. DeepMind did not start with protein folding solved. It spent years building AlphaFold 1 and 2 as research systems before AlphaFold 3 became the kind of tool that 85,000 Korean researchers would use in production workflows. The jump from "proof of concept that works in the lab" to "reliable tool scientists trust" is where the hardest work lives. Silver knows this better than almost anyone, which means the $1.1 billion is buying not just current research but a multi-year timeline that investors with shorter horizons would not stomach.

For Practitioners: What This Means Today

Ineffable AI will not ship a product this year. That is not the point of the round. The point is that the research direction Silver represents — reinforcement learning at scale, self-supervised world models, systems that learn through interaction rather than human text — now has enough institutional backing that it is a credible third path alongside next-token prediction and mixture-of-experts scaling.

For engineers and architects building AI systems today, this matters in a specific way: it is another reason not to assume that the current transformer + human text pretraining paradigm is the final form of the field. The architecture assumptions baked into your vector database, your fine-tuning pipeline, your RLHF workflow, and your evaluation framework all assume that "more human data at scale" is the primary input. If Ineffable's work produces genuinely different capability profiles in 3-5 years, the systems being built today will need to accommodate that. Not by pivoting immediately, but by keeping an eye on where the RL-at-scale research agenda actually lands.

It also matters for hiring and team building. The DeepMind alumni network is not a abstraction — it is an actual network of researchers and engineers who have worked together at the frontier, know each other's research taste, and share institutional norms about what rigorous work looks like. Silver's ability to raise at this valuation tells you something about how that network is mobilizing. If you are building a serious AI research or engineering team and have not thought carefully about what your relationship to that ecosystem looks like, this fundraise is a useful signal that the competitive landscape for talent and direction is about to get more complicated.

None of this means Ineffable succeeds. Building at the frontier with long time horizons is how you end up with either genuine breakthroughs or very expensive empty buildings. But $1.1 billion at a $5.1 billion valuation is not a bet on immediate output. It is a bet that the RL path is real, that it scales, and that David Silver is the person to find out.

Sources: TechCrunch, Wired, Bloomberg

Why $1.1 Billion Changes the Conversation

What "Learning Without Human Data" Actually Means

For Practitioners: What This Means Today

Sign up for more like this.