agentic-coding

Cline CLI’s May 22 Patch Train Is About Keeping Agent Sessions Attached to Reality

Anatoliy Kolodkin

22 May 2026 • 5 min read

Cline’s May 22 patch train is a useful reminder that agentic coding is mostly state management with a model attached. The model gets the attention. The state decides whether the work survives contact with reality.

Across CLI v3.0.10, v3.0.11, and v3.0.12, Cline fixed the sorts of issues that rarely make launch blogs but absolutely determine daily trust: interactive sessions now stay alive while idle or waiting for approval, ask-question prompts render inline with the conversation, provider and model switches show loading state instead of looking frozen, Vertex Gemini auth uses Google auth when a gcp.projectId is configured, tool-result content blocks include tool names, and manual updates can bypass the automatic release-age gate when the user explicitly asks.

None of that is a single giant feature. Together, it is the infrastructure that keeps a coding-agent session attached to the right provider, the right model, the right approval state, and the right audit trail. That is the real work now.

An agent waiting for approval is not finished

The foundational fix is in v3.0.10: interactive sessions stay alive while idle or awaiting approval instead of being treated as ended. That sounds obvious, but many agent systems still behave as if work consists of one prompt, one response, and a clean stop. Real engineering does not work that way. A developer pauses to inspect a diff, answer a question, approve a command, check logs, review docs, or simply get pulled into a meeting. The session is not done. It is waiting.

If the runtime mistakes waiting for completion, it breaks the human mental model. Work gets stranded. Logs become misleading. Automation that expects a resumable session has to guess whether the agent ended intentionally, crashed, or is sitting at a human decision point. For a tool like Cline, which is often used inside an interactive development loop, preserving this distinction is table stakes.

It also affects safety. Approval prompts are not decorative friction. They are how many teams prevent an agent from running shell commands, editing files, or touching external systems without human sign-off. If approval-waiting sessions are not represented as live, then the control plane around those approvals becomes unreliable. A safe agent workflow needs to model “blocked on user” as a first-class state, not as an awkward terminal pause.

Inline questions make the audit trail readable

Version 3.0.12 moves ask-question tool prompts inline with the assistant turn instead of rendering them as a separate modal. This is a subtle UX change with disproportionate audit value. When an agent asks a question, the question is part of the reasoning path. If the prompt appears outside the transcript, future readers lose context: what did the agent ask, what options did it present, and what answer shaped the next action?

Inline prompts keep the human decision attached to the conversation. That matters when debugging a bad outcome later. If the agent chose the wrong branch, skipped a migration, or made a risky assumption, the transcript should show whether it asked the user, whether the user answered ambiguously, or whether the agent never escalated. The best agent logs are not just records of model output. They are records of shared control.

Cline’s v3.0.11 fix to include tool names in tool-result content blocks belongs in the same category. Tool results without tool names are audit debt. In a short demo, the user may remember that a blob of output came from a shell command or file read. In a long session with local tools, MCP calls, provider APIs, plugins, skills, and background operations, unnamed outputs become mystery meat. Adding tool names makes logs replayable and policy review possible.

Provider flexibility is power, and also drift

Cline’s appeal is partly that it spans a messy provider world: ChatGPT OAuth, Vertex Gemini, Ollama, Poolside, Gemini 3.5 Flash, SDK model catalogs, plugins, skills, file-based installs, and more. That flexibility is valuable. It also creates many ways for the runtime to drift away from what the user believes is selected.

The May 22 fixes attack that drift from several angles. Version 3.0.12 shows loading dialogs when configuration switches provider or model, which is a small but important honesty layer: do not leave the user wondering whether the selected runtime changed or the UI froze. Version 3.0.11 fixes ChatGPT OAuth failures caused by unsupported max_output_tokens handling, restores the full output-token budget, and authenticates Vertex Gemini through Google auth when gcp.projectId is configured. It also surfaces the full Vertex model list instead of only Claude models.

Version 3.0.10 preserves SDK model output-token limits so context-window math matches upstream provider catalogs. That is not bookkeeping. Context-window and output-token limits shape cost, truncation, completion quality, and whether the agent can finish a task without silently compressing away the thing it needed. If the local runtime’s idea of a model budget differs from the provider’s catalog, the user will experience that mismatch as flaky intelligence.

The plugin and skills changes also point toward more explicit runtime state. v3.0.10 adds plugin installation from file:// URLs, adds Poolside as a provider, adds Gemini 3.5 Flash to the Gemini provider list, includes an Ollama API-key note in TUI settings, hides the skills tool from the system prompt when skills are disabled, and refreshes slash commands after toggling a skill. These are not glamorous features, but they close gaps between configuration and agent behavior. If a skill is disabled, it should not linger in the prompt. If a provider is configured, the model list should reflect reality. If a local plugin is installed, the runtime should make that path explicit.

The manual update change in v3.0.12 deserves a deployment footnote. Letting cline update install the latest published version immediately is useful when a hotfix matters, especially in a fast-moving CLI patch train. It is also a policy decision. Teams should decide whether developers may bypass automatic release-age gates or whether updates must go through a pinned rollout channel. The right answer depends on the codebase and risk profile; the wrong answer is pretending update behavior is not part of agent governance.

For practitioners, the checklist is concrete. Start a session, let it wait for approval, and verify it remains live. Confirm ask-question prompts are preserved inline in logs. Inspect tool-result entries and make sure tool names are present. Switch between ChatGPT OAuth, Vertex Gemini, Ollama, and any local providers your team actually uses. Verify output-token limits and context-window math after the SDK catalog refresh. Toggle skills and confirm the prompt/tool surface changes accordingly. Test manual update behavior against your team’s rollout policy.

Cline’s May 22 releases are not trying to win the news cycle. They are fixing the connective tissue between user intent, provider selection, approval lifecycle, tool logs, and context math. That is where serious agent tools are going to compete. The vibe gets you a demo. State integrity gets you through the week.

Sources: Cline CLI v3.0.12 release notes, Cline CLI v3.0.11, Cline CLI v3.0.10, Cline repository

An agent waiting for approval is not finished

Inline questions make the audit trail readable

Provider flexibility is power, and also drift

Sign up for more like this.