ai-frameworks

MCPGuard Is a Tiny Firewall for the Part of MCP Security Teams Actually Need to Block

Anatoliy Kolodkin

31 May 2026 • 4 min read

MCP security is finally getting the boring shape it always needed: a checkpoint that can say no before an agent turns a suggestion into a system call.

That is why MCPGuard is worth paying attention to even though it is tiny, early, and very much not a household name. The project describes itself as a “security scanner and firewall for MCP servers” that checks configs for known issues, blocks suspicious tool calls at runtime, and keeps audit logs. In other words, it tries to put policy in the hot path between an agent and the Model Context Protocol tools it is about to invoke.

That sounds obvious. It is not. Most agent-security advice still lives as documentation, prompt guidance, or a checklist someone reads during setup and forgets the first time a demo works. MCP changes the shape of the problem because tools are named, schema-bearing calls. Once the agent action is structured, the industry can stop pretending a smarter model is the only defense and start applying the older lesson every web engineer already knows: validate the request before the handler runs.

A WAF-shaped idea for tool calls

MCPGuard’s split between scanning and runtime enforcement is the right mental model. The scanner side looks for dangerous MCP configuration before deployment: unencrypted non-localhost HTTP, wildcard permissions, shell metacharacters, sensitive paths, exposed Docker sockets, privileged container flags, literal secrets, insecure defaults, input-validation gaps, and missing audit posture. The runtime side applies policy rules to tool calls as they happen, matching against fields such as tool.name, tool.description, param.name, param.value, server.name, and server.url.

The policy format is deliberately mundane: YAML rules with allow, deny, and audit, plus operators like equals, contains, matches, startsWith, endsWith, and in. That is not glamorous, which is exactly the point. The thing standing between an LLM and a filesystem tool should look less like a magic incantation and more like a firewall rule your platform team can review in a pull request.

The project says it maps its checks to the OWASP MCP Top 10 (2026), covering categories including tool poisoning, excessive permissions, insecure transport, command injection, path traversal, secret exposure, insecure defaults, missing validation, audit gaps, and privilege escalation. It also includes prompt/tool-poisoning tripwires for phrases such as “ignore previous instructions,” fake system: text, HTML comments, “override,” “do not tell,” “secretly,” and “hidden instruction.” Those checks will not catch every attack. Regex never wins the whole war. But they are useful speed bumps in a world where many teams currently have no runtime speed bumps at all.

The threat is not theoretical; the boundary is just new

The official MCP security guidance emphasizes confused-deputy risks in MCP proxy servers and calls for per-client consent, redirect URI validation, CSRF protection, secure cookies, and consent screens that show the client name, scopes, and redirect target. That broader protocol work matters because MCP is not only “let the agent call a tool.” It is identity, authorization, transport, consent, and tool semantics all colliding inside a developer-friendly abstraction.

MCPGuard focuses on only part of that surface: config and tool-call policy. That is a limitation, but it is also why the project is useful. It gives teams a concrete enforcement point for the moment when poisoned context becomes an attempted action. If an agent reads hostile documentation and then calls shell_exec with a destructive argument, the correct answer is not “we should have prompted it better.” The correct answer is “that call should have been refused by infrastructure that does not care how persuasive the model sounded.”

There is a useful analogy to web application firewalls, but it should not be taken too far. A WAF can be bypassed, misconfigured, and oversold. So can an MCP tool firewall. The value is not perfect semantic understanding. The value is an inspectable layer where teams can encode obvious invariants: do not read .ssh, do not write outside the workspace, do not call production deploy tools from untrusted sessions, do not pass shell metacharacters to convenience wrappers, do not let a documentation server request local network resources, and always audit high-risk tools even when allowed.

Small repo, useful signal

The adoption signal is modest. At research time, the GitHub repo had one star, no forks, and no open issues; Hacker News searches for MCPGuard and related queries returned nothing useful. The NPM package @gtprojects/mcpguard was created on May 31, 2026, with version metadata around 0.1.x, MIT licensing, Node >=18, and dependencies including commander, glob, js-yaml, winston, and zod. This is early infrastructure, not a mature control plane.

That distinction matters. No serious team should install a brand-new one-star security proxy and declare the MCP problem solved. The right adoption path is more conservative: run the scanner in CI against MCP config files, start the proxy in audit-only mode, compare findings with real agent traces, tune noisy rules, and only then promote a small set of high-confidence denies. Treat the tool as a policy checkpoint you can learn from, not as a certificate of safety.

The project-supplied figures are also worth handling carefully. MCPGuard cites studies claiming 82% of MCP implementations with path traversal issues, 67% with code injection vectors, and roughly 5.5% of public servers with tool poisoning. Those numbers may be directionally plausible given the current maturity of the ecosystem, but they should be treated as project-supplied until independently verified. The safer editorial point does not depend on the exact percentages: MCP servers expand an agent’s authority, and authority needs runtime controls.

What teams should do now

If your team is using MCP seriously, build a layered checklist. Start with server inventory: which MCP servers are installed, who owns them, what tools do they expose, what permissions do they have, and which agents can reach them? Then add config scanning for obvious hazards: broad filesystem mounts, command runners, Docker socket access, local-network reachability, secrets in config, and unencrypted remote transport. After that, add runtime enforcement: allowlists for safe tools, deny rules for high-risk parameters, audit rules for sensitive actions, and kill switches when a session starts behaving like a loop with privileges.

The most important implementation detail is ownership. MCP policy should live in version control, not in someone’s local dotfiles. It should be reviewed like infrastructure code. It should be tested against known-bad calls. It should produce logs that can answer the incident-review question: what did the agent try to do, what was blocked, who configured that policy, and what context triggered the attempt?

MCPGuard may or may not become the tool people standardize on. That is less important than the pattern it represents. The agent ecosystem is moving from “tools are cool” to “tools are authority,” and authority needs an enforcement boundary. Prompts can express intent. Policy has to enforce it.

Sources: MCPGuard GitHub repository, MCP security guidance

A WAF-shaped idea for tool calls

The threat is not theoretical; the boundary is just new

Small repo, useful signal

What teams should do now

Sign up for more like this.