xai

xAI’s Colossus Buildout Is Starting to Look Like a Neocloud With a Chatbot Attached

Anatoliy Kolodkin

06 Jun 2026 • 5 min read

Colossus was supposed to be the engine behind Grok. The more interesting version of the story, at least this week, is that it may be becoming something less glamorous and more valuable: scarce AI infrastructure rented by the month to the companies Grok is trying to beat.

TechCrunch reports that Google will pay SpaceX $920 million per month from October 2026 through June 2029 for access to about 110,000 NVIDIA GPUs, plus CPUs, memory, and related components. The number comes from a SpaceX SEC free-writing prospectus, not from the usual “people familiar with the matter” vapor trail. The filing says SpaceX entered into a Cloud Service Agreement with Google on June 5, with capacity ramping through September at a reduced fee.

That makes this a cleaner signal than most AI infrastructure gossip. Google, already one of the deepest-pocketed compute owners on the planet, is renting a six-figure GPU block from SpaceX as bridge capacity. SpaceX, now tied to xAI’s Colossus buildout, is turning AI compute into recurring revenue before its expected public-market debut. Grok is still the product people see. The balance sheet may care more about the cluster.

The chatbot story is no longer big enough

The obvious read is simple: Google needs GPUs. The better read is that xAI’s infrastructure story is starting to detach from Grok’s product story.

Colossus was publicly framed as the machine behind xAI’s frontier-model push. That made sense. Frontier models are not trained on ambition; they are trained on racks, power, cooling, networking, and a procurement team with no fear of capital expenditure. But the last few weeks have made Colossus look less like a private training asset and more like a neocloud strategy with a chatbot attached.

TechCrunch previously reported that Anthropic agreed to pay $1.25 billion per month through 2029 for access to available compute from Colossus 1 near Memphis. That earlier SpaceX filing described the arrangement as a way to “monetize unused compute capacity” and said similar services contracts were expected. Google now appears to be the next proof point. One deal could be opportunistic. Two deals, with Google and Anthropic as customers, starts to look like a business model.

There is competitive irony here, and it is not subtle. Anthropic and Google both compete directly with Grok. If xAI’s infrastructure can earn nearly a billion dollars a month from one rival and more than a billion from another, the internal opportunity cost of serving Grok becomes painfully visible. Every token served to Grok is competing against external rental revenue. That does not mean Grok is doomed. It does mean Grok has to justify compute like a product, not like a mascot.

Google calling this “bridge capacity” is the tell

Google told TechCrunch the agreement is “a short-term, timely agreement” to meet surging customer demand for its agent platform, Gemini Enterprise. That phrasing matters. Google is not a startup that forgot to order servers. Alphabet’s June investor presentation says 2026 capital expenditures are expected to reach $180 billion to $190 billion, roughly six times its 2022 spend and double last year’s, with 2027 expected to increase significantly again. The company also said Google Cloud Q1 revenue grew 63% year over year, backlog nearly doubled quarter-over-quarter to more than $460 billion, and its APIs are processing about 19 billion tokens per minute.

If that company still needs bridge capacity, the market is no longer debating whether AI agents will consume serious compute. The debate is whether anyone can build capacity fast enough without turning their capital plan into a controlled burn.

Enterprise agents are especially nasty from a capacity-planning perspective. They are not single prompts. They retrieve documents, call tools, inspect records, generate artifacts, run multi-step workflows, retry failed actions, and often sit in interactive loops where latency matters. A customer does not experience “token throughput”; they experience whether the agent finishes the contract review, updates the CRM, drafts the incident report, or blocks on a queue. At scale, agent workloads look less like web traffic and more like factory scheduling.

That is the practitioner lesson hiding under the IPO headline. Agent compute is not a vibes budget. It needs metering, routing, caching, hard caps, tool-loop limits, model selection, and explicit spend controls. If Google is renting 110,000 GPUs because customer demand overshot internal supply, your team should not be letting an autonomous coding agent run unbounded because “we’ll check the bill later.” Later is where the invoice lives.

The contract terms are more interesting than the GPU count

The SEC filing includes the kind of boring details engineers should actually care about. If SpaceX fails to deliver access to the committed GPU amount by September 30, 2026, Google can terminate after a one-month grace period or accept fewer GPUs with a pro rata fee reduction. After December 31, 2026, either party can terminate with 90 days’ notice. The filing also says Google retains ownership and intellectual property rights in its content, AI models, and related data.

That is the real shape of the market: huge commitments, delivery risk, escape hatches, and carefully carved-up data rights. Capacity is not “available” because a slide says 110,000 GPUs. It is available when it is powered, cooled, networked, scheduled, monitored, isolated, and reliable enough for a hyperscaler to put customer workloads on it.

For teams buying AI infrastructure, the checklist is not glamorous, but it is necessary. Where does the workload run? What happens if the supplier terminates a bridge-capacity deal? Are logs isolated? Who owns prompts, outputs, fine-tuned artifacts, embeddings, and traces? What are the latency and region guarantees? What capacity is reserved for your tier during a launch spike? Can the vendor explain failure modes without translating everything into “more GPUs”?

“Powered by 100,000 GPUs” is not an SLA. It is a procurement anecdote until the failure modes are written down.

xAI may have built a cloud before it built a durable moat

For xAI, the strategic question is whether this strengthens Grok or makes Grok secondary. Leasing capacity to Google and Anthropic can finance the buildout, smooth utilization, and validate Colossus as serious infrastructure. It also changes how investors and practitioners should read xAI. The company may be less of a pure model lab than advertised and more of an AI infrastructure operator that happens to run its own model product.

That is not a criticism by itself. In a world where compute scarcity is the bottleneck, owning the bottleneck is a good business. But it complicates the Grok narrative. If the best customers for xAI-adjacent compute are Grok’s rivals, the market may decide the pipes are more valuable than the model running through them. The uncomfortable version: Grok helped justify the cluster; the cluster may become the product.

This should also sober up smaller AI companies. Alphabet has TPUs, NVIDIA GPUs, Axion CPUs, global data centers, Google Cloud, DeepMind, billions of users, and the balance sheet to raise tens of billions for infrastructure. It still wants bridge capacity. Startups treating GPU availability as a solved backend detail are writing fiction in YAML.

The practical move for engineering leaders is to design for compute scarcity even when vendors promise abundance. Put budgets on agent workflows. Route simple tasks to cheaper models. Cache repeated context aggressively. Pin model choices to business value, not leaderboard screenshots. Add stop conditions to autonomous loops. Track cost per completed workflow, not just cost per token. And when a vendor sells you capacity, ask whether you are buying reserved infrastructure or best-effort access in someone else’s utilization game.

My read: this is the moment Colossus stops being just Grok’s training cluster and becomes part of the AI infrastructure market. xAI’s most valuable product today may not be Grok. It may be the compute scarcity Grok helped rationalize building.

Sources: TechCrunch, SpaceX SEC filing, TechCrunch on the Anthropic compute agreement, Alphabet June 2026 investor presentation

The chatbot story is no longer big enough

Google calling this “bridge capacity” is the tell

The contract terms are more interesting than the GPU count

xAI may have built a cloud before it built a durable moat

Sign up for more like this.