openclaw

OpenClaw’s beta.5 Sandbox Regression Shows Why Secure by Default Still Needs Egress Semantics

Anatoliy Kolodkin

17 May 2026 • 4 min read

“Secure by default” is one of those phrases that sounds like an answer until it collides with a real workload. OpenClaw’s new beta.5 sandbox regression is a clean example. The platform tried to reduce the blast radius of Codex app-server runs inside OpenClaw sandboxing. Reasonable instinct. The implementation, according to issue #83347, turned that instinct into a hard network blackout for sandboxed agents that still needed the internet to do their jobs.

The bug report was filed May 18 at 00:39 UTC against 2026.5.16-beta.5. The reported failure shape is specific: restrictCodexAppServerSandboxForOpenClawSandbox(appServer, sandbox) takes a Codex app-server configuration using danger-full-access and rewrites it to workspace-write whenever OpenClaw sandboxing is enabled. That sounds like a downgrade from “too much power” to “safer workspace access.” The catch is the next layer. The workspace-write policy hardcodes networkAccess: false, with no operator override.

The result is not subtle. DNS fails. Outbound access disappears. Research agents lose web search. Google Workspace agents cannot reach Drive, Gmail, or Calendar APIs. Cron jobs that depend on arXiv, PubMed, or other public sources stop behaving like research assistants and start behaving like well-contained paperweights.

The problem is not that OpenClaw cares about sandboxing

OpenClaw is right to be paranoid here. Codex app-server agents can execute tools, inspect workspaces, fetch remote content, and operate across long-running channel sessions. If that runtime is allowed to run with broad host access while also being connected to messaging channels, browser flows, repositories, and credentials, the platform has created the classic agent security problem: a helpful assistant with enough authority to become the attacker’s assistant too.

So the defensive move makes sense. If an OpenClaw sandbox is enabled, the Codex app-server path should not casually preserve danger-full-access. A runtime that says “sandboxed” but quietly leaves a privileged app server outside the intended boundary would be worse than no sandbox at all, because it would give operators false confidence.

But network access is not the same thing as full host access. This is the missing distinction. A sandboxed research agent may need outbound HTTPS and DNS while still being forbidden from reading ~/.ssh, ~/.openclaw, browser profiles, shell history, or arbitrary home-directory files. A Google Drive reader may need narrowly scoped egress to Google APIs and no local credential access beyond its configured token path. A repository-review agent might need workspace reads and no internet. A dependency-debugging agent might need package registry access but only through an audited proxy.

Those are materially different policies. Collapsing them into “workspace-write means no network” is simple, and simplicity is valuable in security design. But for agent platforms, the simple setting becomes brittle because agents combine filesystem, process, network, credential, and channel authority in ways traditional CLI tools usually do not.

The workaround tells you the abstraction is wrong

The issue report says disabling sandbox mode restored DNS and Google Drive API success. That is useful diagnostic evidence and a bad operational answer. It asks the operator to choose between “agent works” and “sandbox exists.” Under deadline pressure, teams will choose “agent works,” then tell themselves they will come back and harden it later. That calendar invite never ships.

Worse, the attempted workaround reportedly introduced sessions_send tool-call timeouts after 30 seconds through codex_dynamic_tool_rpc, so even dropping the sandbox was not a clean escape hatch. That matters because operators need reversible, documented, low-surprise mitigations when a security default breaks production behavior. A workaround that weakens isolation and destabilizes tool RPC is not a workaround. It is moving the failure from one subsystem to another.

ClawSweeper’s response was appropriately conservative: source reproducibility was confirmed, but automatic repair was declined because changing Codex app-server network policy is sandbox- and security-sensitive. That is exactly right. A bot should not patch this by guessing the desired egress model. The owner decision is the product decision.

The correct direction is not “make the sandbox weaker.” It is to make egress semantics explicit. OpenClaw needs policy shapes that separate filesystem writes, filesystem reads, process execution, network egress, DNS, provider calls, credential exposure, and channel delivery. “Workspace-write with network denied” can remain the safest default for many tasks. It just cannot be the only safe mode.

What a better contract would look like

A mature agent sandbox should let operators express something closer to intent: this agent may read the current repo, write inside the workspace, access api.github.com and registry.npmjs.org, use provider inference through the configured gateway, and nothing else. Another agent may call Google APIs through an OAuth-scoped connector but cannot access arbitrary internet destinations. A security-analysis agent may have broader web egress but no local secret access. A local coding agent may have no network at all unless a human approves a package install.

That sounds more complicated because it is. But the alternative is pretending all sandboxed agents have the same network needs. They do not. The agent category is already moving from single-purpose code assistants into always-on operators: inbox readers, research bots, deployment helpers, incident responders, calendar coordinators, repo maintainers, and multi-agent teams. Their security posture cannot be captured by one boolean called network.

There is also an observability requirement. When the runtime denies egress, it should say so in a way both the agent and operator can understand. DNS failure, policy denial, proxy rejection, provider outage, expired OAuth, and remote 403 are different events. If they collapse into the same “network failed” behavior, the model will improvise, the operator will misdiagnose, and support threads will fill with rituals.

Practitioners should add this to their acceptance checklist for OpenClaw or any competing runtime. Can a sandboxed agent use outbound internet without broad host access? Can egress be allowlisted by domain or routed through a proxy? Are denied attempts logged with enough detail to debug but not enough to leak secrets? Does the runtime preserve approval policy when network access is requested mid-task? Can tests cover common network-dependent agents, not just filesystem-only toy scenarios?

The uncomfortable truth is that “secure by default” is not a destination. It is a negotiation between safe defaults and useful overrides. OpenClaw’s beta.5 regression shows the platform is taking host protection seriously. Good. Now it needs to take egress policy seriously too, because an agent that cannot reach the world is not automatically safer in the way operators need. It is often just broken enough to encourage unsafe workarounds.

Sources: OpenClaw issue #83347, OpenClaw v2026.5.16-beta.5 release, PR #83150, issue #83093

The problem is not that OpenClaw cares about sandboxing

The workaround tells you the abstraction is wrong

What a better contract would look like

Sign up for more like this.