claude-code

NSA’s MCP Security Guidance Is the Checklist the Agent Ecosystem Was Trying Not to Need

Anatoliy Kolodkin

30 May 2026 • 5 min read

MCP just got its first serious “please stop shipping vibes into production” memo from the people whose job is noticing when flexible systems become attack surfaces.

The National Security Agency’s Artificial Intelligence Security Center has published a 17-page guidance document on Model Context Protocol security, and the useful part is not that NSA has discovered agents. It is that the document names the thing many teams have been quietly building around: MCP is no longer a cute connector format. It is becoming the tool bus for AI systems that can read repositories, query internal services, write tickets, touch databases, call SaaS APIs, and hand results back to a model that may decide the next move.

That changes the security conversation. A bad MCP setup is not “our chatbot saw the wrong wiki page.” It is an authorization, provenance, logging, and tool-execution problem with a language model in the middle. The protocol gives agents a common way to discover tools and exchange context. The deployment model often gives those tools access to systems that were never designed for probabilistic callers.

NSA’s guidance is blunt about the timing mismatch. It describes MCP as having become a de facto standard across AI-driven services, including experimental and production deployments in software development, finance, legal, and business automation. It also says MCP’s proliferation has outpaced the development of its security model, comparing the current moment to early web protocols: useful, flexible, widely adopted, and under-specified in exactly the places defenders care about.

The protocol is not the policy layer

The report’s central warning is architectural. MCP reverses a familiar pattern. Instead of a client simply asking a server for data, MCP-enabled systems often let servers expose tools, provide context, and participate in action chains where an AI host may call capabilities on the user’s behalf. That creates what NSA calls “new and largely not well-traced attack paths.” In plain engineering terms: the data plane and instruction plane are now uncomfortably close to each other.

That is why the failure modes are broader than “prompt injection.” NSA calls out access-control ambiguity, insecure context and data serialization, weak approval workflows, token and session security, tool poisoning, path confusion, missing audit logs, and denial-of-service or fatigue-based techniques. It also explicitly points to arbitrary code execution risk under CWE-77, CWE-78, CWE-94, and CWE-95 when attacker-controlled inputs reach execution environments without proper constraints.

For Claude Code users, this is not policy wallpaper. Claude Code and similar coding agents increasingly live at the intersection of shell access, repository context, issue trackers, package managers, CI logs, browser sessions, internal docs, and MCP servers. If MCP is the bridge into those systems, then MCP permissions are part of your production security model whether or not the agent feels like “developer tooling.”

The GitHub example is the one that should make engineering managers sit up. NSA highlights the risk of broad read/write repository access in GitHub-based MCP tools, where a malicious or compromised server can touch private and public repositories without the user fully understanding the blast radius. That is not an exotic attack. It is OAuth scope sprawl plus an assistant optimized to comply. If a coding agent can read private source, open PRs, post comments, inspect CI, and call external services, then a compromised tool path can turn convenience into exfiltration with a nice progress spinner.

Approval prompts are not governance

One of the better parts of the NSA document is its skepticism of approval workflows. Many agent products lean on prompts: approve this server, approve this tool, approve this command, approve this action. Prompts are better than silent execution, but they are not a durable security boundary. Users habituate. Tool descriptions change. Servers gain capabilities after installation. A model can call the same innocuous-looking tool in a dangerous context. Eventually the permission dialog becomes the new cookie banner: legally present, operationally ignored.

The replacement is not “more modals.” It is boring infrastructure: least privilege, explicit trust boundaries, schema validation, sandboxed execution, signed or verified messages where appropriate, vulnerability tracking, and logs that tell responders who called what, with which parameters, under which policy, and what came back. NSA’s recommendations point in that direction: choose supported MCP projects, design trust boundaries, validate parameters, constrain and sandbox tool execution, filter output pipelines, instrument detection, track vulnerabilities, and scan local networks for open or vulnerable MCP servers.

That last item is easy to underweight. Shadow MCP servers are the agent-era equivalent of forgotten admin panels. A developer spins one up for a demo, a team leaves a local bridge listening, an integration runs with overbroad credentials, and suddenly “the agent can use our internal tools” means nobody knows which internal tools, on whose machine, with what permissions. Inventory is not glamorous. Neither is incident response when the inventory does not exist.

Treat tool output like untrusted source code

The most important practitioner shift is how teams think about outputs. In conventional application plumbing, output from a service is usually treated as data. In agent systems, output from one tool can become context for the model, which can become instructions for another tool call. That means MCP outputs are not passive strings. They are potential control-flow inputs.

This is where indirect prompt injection stops being a blog-post curiosity and becomes a systems problem. A server can return poisoned metadata, hidden instructions, misleading tool descriptions, or content shaped to steer the next model decision. If the host blindly passes that output into the next reasoning step, the agent pipeline has become a compiler for attacker-supplied intent. The safer default is to treat every tool result as untrusted input for the next phase, especially when the result came from the web, a ticket, a document, a repository controlled by someone else, or any system with user-generated content.

For engineering teams, the immediate checklist is short enough to act on this week. Inventory every MCP server and assign an owner. Separate read-only tools from write-capable tools. Avoid blanket repository, workspace, or organization scopes. Put sensitive MCP servers behind local or private network boundaries where possible. Validate tool parameters against schema and context, not just type. Deny ambiguous parameter forwarding. Run command-capable tools in sandboxes with filesystem and network constraints. Log identities, tool names, parameters, decisions, output hashes, and policy results. Rehearse revocation before the incident, not during it.

Claude Code teams should add one more item: review MCP and skill configuration with the same seriousness as CI config, package scripts, editor tasks, and pre-commit hooks. The fact that something is described in Markdown or JSON does not make it harmless. If it changes what an agent can see, call, or execute, it belongs in code review. “It’s just a connector” is how the connector becomes the breach path.

The broader industry read is simple: MCP won developer mindshare before the security ecosystem fully caught up. That does not make MCP bad. It makes MCP normal. Every useful integration surface goes through this phase: adoption first, governance second, painful lessons somewhere in the middle. The teams that get ahead of it will not be the ones with the most elaborate safety prompt. They will be the ones that can answer four questions without a meeting: what is plugged in, what can it touch, what did it do, and how do we turn it off?

NSA has effectively handed the agent ecosystem a checklist it was trying not to need. Take the hint. The future of coding agents is not fewer tools. It is better boundaries around the tools we are already letting them use.

Sources: NSA Cybersecurity Advisory, NSA MCP Security PDF, Model Context Protocol specification, MCP servers repository

The protocol is not the policy layer

Approval prompts are not governance

Treat tool output like untrusted source code

Sign up for more like this.