ai-frameworks

Armorer Guard’s MCP Proxy Puts Agent Security Where It Belongs: In Front of the Tool

Anatoliy Kolodkin

15 May 2026 • 5 min read

Armorer Guard v0.2.4 is a licensing and packaging release for a tiny project with 22 GitHub stars. That would normally be a footnote. But one day earlier, v0.2.3 added a local MCP proxy for enforcing policy on stdio tool calls, and that is the interesting part: a small Rust binary is pointing at one of the biggest gaps in agent security.

The gap is simple. Agent frameworks keep giving models more tools, more filesystem access, more browser access, more ticketing-system access, and more ways to move data. The main line of defense is still too often a system prompt telling the model to be careful. That is not a control. That is a suggestion written to the component most likely to be manipulated.

Armorer Guard’s bet is better: put enforcement at the point where tool calls become actions. Wrap the MCP server, inspect the JSON-RPC tools/call arguments locally, and return a block response before the requested action reaches the tool. It is not glamorous. It is also much closer to how real security systems work.

The proxy belongs at the action boundary

Version 0.2.3 introduced armorer-guard mcp-proxy for line-delimited stdio MCP enforcement. The example shape is straightforward: armorer-guard mcp-proxy -- npx your-mcp-server. The proxy sits between the agent client and the MCP server, scans params.arguments, and can return JSON-RPC block responses with reasons, confidence, sanitized text, and scan IDs. It can also write JSONL audit receipts with --audit-log.

That placement matters. Prompt injection defenses inside the model are advisory. A proxy in front of a tool is procedural. If the model is tricked into passing a credential, deleting a path, calling a shell command, posting customer data, or browsing to an internal URL, the proxy still gets a chance to say no. That does not make the system “safe,” because no single guardrail does. But it changes the failure mode from invisible compliance theater to observable enforcement.

This is exactly where MCP security needs to go. MCP has made tool integration easier, which is useful. It has also standardized a path for agents to call systems that mutate files, open browsers, hit APIs, and retrieve sensitive data. Standardization is a gift to both builders and attackers. If the action boundary is consistent, it can be wrapped, logged, tested, and governed. Armorer Guard is small, but it is exploiting that architectural leverage.

The v0.2.3 release also added a Node wrapper as @armorerlabs/[email protected], a Python entry point armorer-guard-py, binary version output, smoke scripts, fixtures, Claude Code/Cursor/Windsurf/MCP docs, a security model, a comparison guide, and a marketing playbook. That is a lot for a young project, and teams should separate the useful engineering from the inevitable early-stage positioning. The useful engineering is the proxy and the audit trail.

Local scanning is the right default for agent tool arguments

Armorer Guard is written in Rust and runs locally. That matters more than the implementation-language bragging rights. Agent tool arguments are not harmless telemetry. They may contain source code, customer records, internal hostnames, credentials, file paths, queries, operational intent, and partial secrets. Sending all of that to a cloud scanner creates a second exfiltration path while trying to prevent the first one.

A local-first scanner with Python and Node wrappers fits the shape security teams are likely to tolerate. It can be placed in front of risky stdio MCP servers without sending every attempted action to a third party. JSONL audit receipts also matter because “the agent did something weird” is not an incident report. Teams need scan IDs, sanitized text, reasons, confidence, timestamps, and enough context to reconstruct what was blocked without leaking the sensitive payload again.

The project-reported metrics are eye-catching: average classifier latency of 0.0247 ms, micro recall of 1.0000, and macro F1 of 0.9833 across 1,411 validation rows. Those numbers should be treated as claims, not conclusions. The obvious questions are dataset coverage, adversarial robustness, false positives, false negatives on domain-specific secrets, and how well the scanner handles weird tool schemas. A benchmark can say “this classifier worked on this dataset.” It cannot say “your agent stack is protected.”

Still, latency matters. Security controls that add visible lag to every tool call will get disabled, especially in developer workflows. If Armorer Guard can stay effectively invisible while blocking high-risk calls, that is the right product shape. The hard part will be policy tuning: strict enough to stop dangerous actions, explainable enough for developers to trust, and adaptable enough not to become another noisy gate everyone bypasses.

Start with the tools that can actually hurt you

Practitioners should not put a young proxy in front of every tool on day one and declare victory. Start with the MCP servers that can do damage: shell execution, filesystem writes, browser automation, ticketing systems, secrets managers, deployment controls, cloud APIs, customer-data stores, and anything that can send data outward. Run in audit mode first if possible. Collect blocks. Review sanitized text and scan IDs. Tune policy against your actual workflows, not generic examples.

Also test the uncomfortable cases. Can the proxy catch credential-shaped content embedded in JSON? What about base64? What about prompt-injection text hidden inside a file path or issue body? What happens when a legitimate command looks suspicious? How do developers override a block, and is that override logged? If the answer is “we trust the agent,” you have not built a security system; you have relocated the vibes.

Version 0.2.4 itself is mostly cleanup: the project moved to the MIT License, updated Cargo, Python, and npm package metadata to 0.2.4, removed older noncommercial/commercial license wording, and reports verification with cargo test --locked, release builds, clippy with warnings denied, Python tests, wheel build, npm wrapper tests, Cargo publish dry run, and a GitHub PyPI wheel workflow. Licensing matters here because security infrastructure needs adoption. A noncommercial license around a developer security proxy is friction. MIT removes one reason not to try it.

The bigger story is that agent security is becoming infrastructure-shaped. The answer to prompt injection and tool misuse will not be a perfect system prompt. It will be layered controls: narrower tool permissions, signed and reviewed tool manifests, runtime proxies, audit logs, approval gates, egress policy, and regression tests built from real failures. Armorer Guard is not the whole answer. It is a useful piece in the right layer.

Small tools often reveal the future before platforms productize it. Armorer Guard’s MCP proxy is one of those signals. The next phase of agent security will be less about teaching models to be cautious and more about making dangerous actions impossible, reviewable, or at least logged before they hit the real world. That is less magical than the demo. It is also how shipping works.

Sources: Armorer Guard v0.2.4 release, Armorer Guard v0.2.3 release, Armorer Guard repository, Dev Journal writeup

The proxy belongs at the action boundary

Local scanning is the right default for agent tool arguments

Start with the tools that can actually hurt you

Sign up for more like this.