google-ai

Alphabet’s AI Pitch Is Really a Compute-Supply Confession

Anatoliy Kolodkin

03 Jun 2026 • 4 min read

Alphabet’s June investor presentation reads like a confidence deck until you notice the confession hiding in plain sight: Google does not have enough compute for the AI demand it is already seeing. That is the useful story for builders. The company is not merely saying Gemini is popular or Cloud AI is growing. It is telling the market that model access, latency, quotas, product packaging, and developer tooling are now downstream of infrastructure supply.

The numbers are not subtle. Alphabet says its proposed capital raise began as $80 billion, including a $10 billion Berkshire Hathaway investment and a $30 billion underwritten offering. The offering was oversubscribed, about $35 billion was priced and allocated, and the total is expected to land around $85 billion. Alphabet now expects 2026 capital expenditure of $180 billion to $190 billion — roughly six times its 2022 CapEx of about $31 billion, about double last year’s spend, and likely to rise again in 2027.

Sundar Pichai’s explanation was blunt: AI product and service demand from enterprises and consumers is “meaningfully exceeding” available supply. That line matters more than the finance headline. It explains why Google is raising at this scale, why Cloud is central to the pitch, why TPUs are getting more prominent, and why Antigravity is suddenly an investor-story detail rather than just a developer-tool launch.

The AI platform war is becoming a supply-chain problem

Alphabet’s operating metrics show why the company thinks the spending is justified. Q1 revenue climbed 22% year over year to $110 billion, and operating income grew 30% year over year to $40 billion. Search revenue grew 19%. Google Cloud Q1 revenue grew 63%, backlog nearly doubled quarter over quarter to more than $460 billion, and later in the transcript CFO Anat Ashkenazi puts Cloud backlog at $462 billion. Cloud delivered a record $20 billion in quarterly revenue, 33% margins, and $7 billion in operating income, more than tripling year over year.

That is the business backdrop. The technical backdrop is token volume. Google says Gemini token volume rose from 9.7 trillion tokens per month two years ago to 3.2 quadrillion tokens per month — more than 300x growth across surfaces. Its model APIs process about 19 billion tokens per minute. More than 8.5 million developers build with Google’s models each month, and more than 375 Cloud customers each processed over one trillion tokens in the last 12 months.

Those numbers should change how engineering teams think about platform risk. The early AI-platform adoption phase trained everyone to expect faster models, bigger context, lower prices, more generous previews, and fewer constraints every quarter. That trend may continue in some areas, but it now has to coexist with a supply curve. When demand outruns capacity, the effects show up as rate limits, priority tiers, enterprise commitments, regional availability gaps, model routing, queueing, context-window economics, and pricing changes. In other words: infrastructure scarcity leaks into product behavior.

Google’s TPU story is meant to reassure investors that the company can bend that curve. Alphabet says it operates 10 million kilometers of terrestrial and subsea fiber, more than 30 data centers, and more than 40 Cloud regions. Its eighth-generation TPUs, 8i and 8t, use a dual-chip design for training and inference in what Google calls the “agentic era.” The company says it reduced Gemini serving costs by 78% in 2025; Ashkenazi also says core AI response costs are down more than 30% since Gemini 3 launched. Those efficiency gains are crucial. Inference economics, not keynote applause, determine whether AI products become businesses.

Antigravity is now part of the infrastructure argument

The developer-tooling detail buried in the investor material is just as interesting as the CapEx number. Google says Antigravity launched six months ago, has “millions” of developers building with it, now includes a standalone desktop app, and is being brought to Cloud customers. Internally, Antigravity coding-harness usage is doubling token volume every few weeks and recently reached more than three trillion tokens per day.

That is not a cute adoption stat. It is a signal that agentic coding has become an infrastructure workload inside Google. Coding agents generate long traces, branch into tool calls, retry failed plans, read and write files, run tests, and consume context aggressively. A normal autocomplete tool is a convenience feature. A coding harness using trillions of tokens per day becomes a capacity-planning input, a model-feedback loop, and a reason to integrate developer tooling with Cloud infrastructure.

This is where the Gemini CLI to Antigravity migration cluster stops being SEO trivia. Google is consolidating agentic development into a platform surface: desktop app, Cloud path, internal dogfooding, model routing, and likely enterprise controls. Builders evaluating Antigravity should ask the boring questions now: How portable are project formats? How visible are tool calls? What does the approval model look like? Is MCP support inspectable? Are audit logs exportable? Can models be swapped? Can a team run local or non-Google models for low-risk work? What happens when quota gets tight?

The answer to those questions will matter because Google is telling investors the free, cheap, always-available AI frontier is capital intensive. A $462 billion Cloud backlog gives Google a credible demand story, but backlog is not the same as margin-safe revenue. If 2027 CapEx climbs again from a $180 billion to $190 billion base, the pressure to monetize every AI surface gets sharper. Expect more packaging around AI plans, enterprise commitments, Cloud TPU access, agent platforms, and premium developer tooling. The free lunch will not disappear. It will acquire a procurement workflow.

For practitioners, the takeaways are immediate. Measure token spend by feature, not just by product. Cache aggressively. Track latency and retry behavior separately for interactive users and background agents. Keep local and open models in the test matrix for workloads that do not need frontier intelligence. Avoid hard-coding one agent harness as if the platform war is over. If you are making a Google Cloud AI commitment, ask about quota, regional availability, TPU/GPU roadmap, burst behavior, and failure modes before the workload is business-critical.

Also: treat agentic systems like distributed systems, not chatbots. Log tool calls. Track approvals. Correlate traces across model calls, user actions, and backend systems. Put budgets on agents before they find creative ways to spend money. Compute scarcity will punish teams that cannot explain where their tokens went.

Alphabet’s pitch is that AI demand is now real enough to justify infrastructure spending at historic scale. That may be true. But for builders, the sharper read is simpler: AI platforms are becoming supply-constrained infrastructure businesses. The teams that design for quota, cost, portability, and observability now will have a much better time when the investor-deck numbers turn into product constraints.

Sources: Google Blog, Alphabet Investor Relations, Alphabet investor presentation PDF, Reuters, CNBC

The AI platform war is becoming a supply-chain problem

Antigravity is now part of the infrastructure argument

Sign up for more like this.