google-ai

Google Just Let the Government Review Its AI Models Before You Can — And That's a Bigger Deal Than It Sounds

Anatoliy Kolodkin

05 May 2026 • 5 min read

Something interesting happened the week of May 5, 2026. Google signed an agreement with the U.S. government to let it review new AI models before public release. Forty-eight hours later, Reuters reported that the same agreement was partly spurred by alarm inside the government over a different company's model — Anthropic's Mythos, which officials believe has crossed some threshold of hacking capability that made Washington's hair stand up. And Anthropic itself was not at the table. It was outside, designated a supply-chain risk by the Pentagon after refusing to give the government unrestricted access to its models.

That is the story. But it is not the headline.

The headline is that a two-track system for AI governance is taking shape in real time, and Google just chose its track.

The Mechanism and What It Actually Does

The Center for AI Standards and Innovation (CAISI) at the Department of Commerce signed agreements on May 5 with Google DeepMind, Microsoft, and xAI. The agreements grant CAISI pre-deployment access to review new frontier models before they reach the public, alongside provisions for post-deployment assessment, classified-environment testing, and ongoing research into national security risks.

CAISI has completed more than 40 evaluations to date, including on unreleased state-of-the-art models. The agreements specifically support evaluations where developers hand over models with "reduced or removed safeguards" — meaning government evaluators see what the public version will not show. Evaluators from across the interagency participate and provide feedback through the TRAINS Taskforce, a group focused on AI national security concerns. The testing can happen in classified environments.

CAISI Director Chris Fall put it this way: "Independent, rigorous measurement science is essential to understanding frontier AI and its national security implications. These expanded industry collaborations help us scale our work in the public interest at a critical moment."

That is the press-release version. The Reuters reporting fills in the urgency: U.S. officials have been alarmed by Anthropic's Mythos model's "hacking capabilities" in recent weeks, and that alarm accelerated the push to formalize access agreements with the companies still willing to provide it.

The Two-Track System Is Not Theoretical Anymore

Here is the structural fact that gets lost in the day-to-day AI policy coverage: companies that cooperate with government review get commercial access and defense contracts. Companies that refuse get designated supply-chain risks and lose access to federal markets. That is not a subtle incentive. That is a market signal.

Anthropic refused to give the government unrestricted access to its models — specifically, it would not remove guardrails against autonomous weapons and domestic surveillance use cases. The Pentagon's response was immediate and material: Anthropic was blacklisted as a supply-chain risk in February 2026. The consequence was not a strongly-worded memo. It was losing access to defense markets.

Google, Microsoft, and xAI signed the CAISI agreements. The consequence of signing is the classified Pentagon deal reported April 28, the ongoing government cloud work, and continued eligibility for defense-adjacent contracts. Alphabet lifted its ban on AI for weapons and surveillance tools in 2025, aligning its ethical guidelines with "national security" needs — the same language it is now using to justify the classified work.

The irony is not lost on the developer community. The company most associated with AI safety — Anthropic — is now blacklisted by the Pentagon. The companies that have historically been more willing to work with defense and national security use cases are the cooperative participants. That is a data point about what the current U.S. government actually rewards, not what any company says about its values.

What Practitioners Should Actually Take From This

Most developers using Gemini are not thinking about TRAINS Taskforces or classified-environment evaluations. They are thinking about whether the model will be available, priced right, and behave predictably. This story does not change any of that today. But the mechanism that was just formalized has real implications for what future model releases look like.

CAISI evaluations that flag national security concerns could trigger delays, modifications, or a government advisory attached to a model's enterprise positioning. The 40 evaluations already completed suggest this is not a future risk — it is happening now, quietly, on unreleased models the public has never touched. If you are building production systems on Gemini, the safety behaviors you tested in May may reflect the public version, not the version that went through a CAISI review with reduced safeguards.

For developers building security-sensitive applications — penetration testing tools, code generation for infrastructure, anything touching cybersecurity workflows — this context is not academic. The government is explicitly evaluating these models for capabilities that map directly to offensive security use cases. That does not mean Gemini is unsafe for your use case. It means the model capabilities landscape is more complicated than a public benchmark.

The practical question for builders is not "should I trust Google with my data." It is: does pre-deployment government review change the risk profile of any model I am building on? The honest answer is: not yet for publicly-available models, but the architecture for government-influenced model behavior now exists, and it has already been used on unreleased state-of-the-art systems.

The Internal Dissent Data Point

One detail from the surrounding coverage deserves its own note. More than 600 Google employees sent an open letter to Sundar Pichai urging him not to sign the classified Pentagon deal reported April 28. The deal was signed hours later. A Google researcher told Business Insider: "When I went to bed yesterday, I was hopeful that the employee letter would have an effect... This morning I woke up to a worst-case version of the contract being signed."

In 2018, Google declined to renew its Project Maven drone surveillance contract after employee backlash. Palantir took it over. That moment was held up as a landmark case of tech employee activism working. The April 2026 classified deal — signed hours after the internal letter, with Google's own AI safety settings now adjustable at the government's request — suggests that era is over. The employee activism mechanism broke. Alphabet's 2025 decision to remove "weapons or surveillance" language from its ethical guidelines was the formal end of the line. The contract was the practical one.

For developers who care about the ethical positions of the platforms they build on, this is not a comfortable data point. But it is an important one. The story is not that Google is uniquely compromised. OpenAI and xAI have similar government arrangements. The story is that the era of meaningful corporate ethical red lines in defense AI work is over, and the companies still willing to work inside the government's preferred framework are being rewarded with contracts and access while the holdouts are being penalized.

The Editorial Take

Read the headlines and you see "Google agrees to government AI review." That sounds procedural. Sounds like a checkbox.

Look at the actual structure — the two-track system, Anthropic's blacklisting, the TRAINS Taskforce, the classified-environment testing, the Reuters reporting tying the urgency to alarm over a model's hacking capabilities — and you see something more consequential. This is not a compliance story. This is a story about who gets access to the most powerful AI systems, under what conditions, and who gets shut out when they refuse.

Google picked its side. Microsoft and xAI picked theirs. The companies that cooperated are inside the government contracting tent. The company that drew a line is outside, locked out of defense work, and designated a supply-chain risk. That is not a neutral outcome. That is a market signal about what kind of AI company the current U.S. government will reward.

For builders: the model you are using today is not changing because of this. But the governance architecture around future frontier models just got clearer — and it looks less like safety standards and more like a procurement preference list.

Sources: The Verge, NIST/CAISI, Reuters, The Guardian

The Mechanism and What It Actually Does

The Two-Track System Is Not Theoretical Anymore

What Practitioners Should Actually Take From This

The Internal Dissent Data Point

The Editorial Take

Sign up for more like this.