codex

Codex 0.136 Alpha Adds Image Generation, Guardian Metrics, and a Diff Security Lesson

Anatoliy Kolodkin

29 May 2026 • 5 min read

Codex 0.136.0-alpha.1 looks like a minor alpha if you only read the release note. The note itself is almost comically blank: “Release 0.136.0-alpha.1.” The useful story is in the compare range, where OpenAI is doing the work that separates a flashy coding assistant from an agent runtime teams can actually operate: multimodal extensions, review metrics, safer Git boundaries, MCP visibility, packaging seams, and more lifecycle accounting.

That combination matters because coding agents are crossing two lines at once. They are becoming more capable — now reaching toward generated and edited images as first-class artifacts — while also getting closer to untrusted repositories, enterprise policy, usage metering, and long-running workflows. The agent that can generate an image from your UI notes is neat. The agent that can do that without turning every repo checkout into a trust exercise is the product people can deploy.

The image feature is less interesting than the way it is gated

The headline feature is PR #24723, which adds a standalone codex-image-generation-extension crate. It supports generate/edit execution, prior-image selection for edits, model-visible image output, and local persistence for generated images. The initial model is fixed to gpt-image-2, and the extension is disabled by default behind the imagegenext feature gate.

That “disabled by default” part is not trivia. Image generation inside a coding agent is not the same product surface as a web chatbot making a sticker. In a developer workflow, generated images can become UI references, app assets, screenshot-derived debugging material, visual diffs, fixture data, or accidentally persisted sensitive artifacts. If the agent is allowed to edit prior images and hand the result back into the model-visible context, teams need to know where those artifacts live, whether they are committed, whether telemetry includes them, and who can inspect them later.

OpenAI appears to understand that risk. The extension is app-server-installed, feature-gated, backend-eligibility-aware, and only replaces hosted Responses image generation when the standalone executor is registered and eligible. In the PR snippet surfaced by GitHub, generated image tool output also avoids copying image bytes into tool-call telemetry, returning a preview string like [generated image]. That is the right instinct: multimodal capability should be explicit infrastructure, not a surprise side effect of “the model can do images now.”

Practitioners should treat this like any other tool privilege. If you test the alpha, do not just ask Codex to make a mockup and call it done. Verify that image generation is off unless deliberately enabled, confirm where generated artifacts are persisted, decide whether edits can reuse prior images from the session, and inspect how generated files show up in diffs, logs, and review flows. The correct rollout question is not “does it work?” It is “can we explain exactly what it touched?”

The security fix is the release’s loudest quiet signal

The most important line item may be commit 2e0c4f4: “prevent repository-configured code execution in /diff.” The problem is precise and nasty. Codex’s TUI /diff command is meant to display working-tree changes, but Git diff operations can honor repository-selected executable helpers: text conversion, external diff commands, clean/process filters, core.fsmonitor, hooks such as post-index-change, and even helper configuration inside checked-out submodules.

That is exactly the sort of trust boundary coding-agent products keep rediscovering. A developer thinks they are viewing a diff. The tool is actually asking Git to interpret repo-controlled configuration. In a normal trusted repo, that is usually fine. In a workflow where an agent or human may inspect unknown pull requests, cloned reproductions, bug reports, or third-party projects, it is an execution surface wearing a read-only costume.

The fix is appropriately defensive: pass --no-textconv and --no-ext-diff, neutralize configured clean/process filters through structured GIT_CONFIG_KEY_* and GIT_CONFIG_VALUE_* overrides, force core.fsmonitor=false, set a null hooks path, and avoid inspecting dirty submodule worktrees in ways that could execute child-repository filters. OpenAI added real-Git marker tests for filters, fsmonitor, hooks, and submodules to prove those helpers are not invoked.

If your team lets agents touch unfamiliar repositories, steal this mental model. Diff rendering is not automatically safe. Formatter discovery is not automatically safe. Package-manager metadata is not automatically safe. Hook systems, filters, build scripts, editor integrations, and MCP tool discovery all deserve hostile-input treatment. The correct default for “show me what changed” is non-executing inspection, even if it means omitting convenience details like dirty-only submodule markers.

Guardian metrics are where governance becomes measurable

The release also adds a cluster of Guardian work: stabilized Guardian client cache-key handling, a Guardian review prompt cache key, assertions around cache-key reuse, threading that key through the session, and new Guardian review metrics. This is less demo-friendly than image generation, but it is closer to what enterprise operators ask for after the first policy incident.

A safety or review layer that nobody can measure is not a control; it is a ritual. Stable prompt-cache keys reduce repeated review overhead. Metrics make review behavior observable. Threading review data through a session lets teams correlate a review with the user turn, subagent work, tool calls, compaction, and final output. That is how agent governance moves from “we have a policy prompt somewhere” to “we can explain which review ran, why it ran, and how it behaved.”

This fits the broader Codex trajectory from the last several releases. OpenAI has been making runtime structure more explicit: memory gets its own database, goal usage limits get terminal states, MCP startup and tool-listing stalls get instrumentation, sessions group root and subagent work, profiles shape permissions, and now Guardian reviews get better accounting. None of that wins a screenshot contest. All of it matters when an agent is allowed to run for hours, call tools, compact context, remember facts, spawn helpers, and stop because a budget or policy boundary said so.

There are also smaller but related control-plane changes in the 0.136 alpha: exposing MCP server information in server status, wiring task completion into thread-idle lifecycle, gating goal tools by thread eligibility, storing pending response items directly, reaping stale multi-agent slots, and continuing app-server/SDK packaging work. The pattern is consistent. Codex is becoming less of a terminal UI and more of a programmable runtime with places to observe, constrain, and recover the work.

For teams evaluating Codex against Copilot, this is the distinction to watch. Copilot’s advantage is distribution: it lives in GitHub, IDEs, admin consoles, cloud agent flows, and enterprise billing. Codex’s advantage is runtime explicitness: local CLI behavior, MCP routing, feature gates, app-server seams, extension hooks, review metrics, and increasingly inspectable state. The buying question should not be “which model is best this week?” The useful question is “which product lets us govern cost, tools, memory, artifacts, and security boundaries without guessing?”

The practical recommendation is boring and therefore correct: do not roll 0.136.0-alpha.1 across developer laptops just because it landed. Stage it. Test the image extension with the feature flag both off and on. Validate that /diff does not execute repo-provided helpers in hostile fixtures. Check whether Guardian metrics appear in the traces or logs your platform team actually reads. Confirm MCP server status is visible enough for support. Then decide whether any of these alpha seams solve a real deployment problem today.

My read: this is a multimodal release on the surface and an operations release underneath. OpenAI is broadening Codex beyond text while adding the dull, necessary controls — feature gates, cache keys, metrics, lifecycle hooks, MCP visibility, and non-executing diff behavior — that determine whether coding agents graduate from impressive local toys to systems engineering infrastructure. The image generator will get the clicks. The diff fix and Guardian metrics are the parts I would bring to standup.

Sources: OpenAI Codex GitHub release, GitHub compare range, PR #24723, security fix commit, npm package metadata

The image feature is less interesting than the way it is gated

The security fix is the release’s loudest quiet signal

Guardian metrics are where governance becomes measurable

Sign up for more like this.