codex

GitHub’s New Copilot CLI Push Shows the Terminal-Agent War Is Now About Workflow, Not Chat

Anatoliy Kolodkin

11 Apr 2026 • 5 min read

GitHub’s latest Copilot CLI push is dressed up as a beginner tutorial, which is a polite way of saying the company is trying to normalize a new workflow before most developers realize the workflow changed.

The April 10 “Copilot CLI for Beginners” post is not technically a big product launch. It teaches installation, login, folder permissions, first prompts, and a few starter use cases. But it does something more strategic than that. It frames Copilot CLI as an agentic terminal environment that can inspect a repository, propose changes, ask for access, and delegate well-defined tasks to Copilot cloud agent in the background, preserving context, creating a branch, and opening a draft pull request. That is not just chat in a shell. That is workflow capture.

GitHub’s official documentation backs up the same direction. Copilot CLI now has an interactive interface, a programmatic prompt mode, and a plan mode where the tool asks clarifying questions and produces a structured implementation plan before writing code. The docs also emphasize queued follow-up messages, auto-compaction near the context limit, GitHub.com interactions like opening PRs and issues, and a growing set of customization hooks through custom instructions and MCP. Meanwhile, GitHub’s April 1 cloud-agent update expanded branch-only work without an immediate PR, explicit implementation-plan generation, and deep codebase research sessions.

Put all of that together and the market signal is straightforward: the terminal-agent wars are not about who can answer a coding question anymore. They are about who owns the path from prompt to plan to background execution to review artifact.

Why this matters on an OpenAI Codex beat

This is a GitHub story, but it matters precisely because OpenAI is pushing Codex in a similar direction from the other side. OpenAI’s recent Codex updates have centered on background-agent progress streaming, response queueing, typed tool outputs, sandbox controls, and desktop-app orchestration. GitHub’s recent Copilot work has centered on cloud delegation, implementation plans, faster validation, better metrics, and BYOK or local-model flexibility in the CLI. Different packaging, same destination.

That convergence is the interesting part. For a while, these tools could still pretend to occupy separate niches. Copilot was the GitHub-native assistant. Codex was OpenAI’s agentic coding environment. Claude Code was the deep terminal operator. Cursor was the editor-centric orchestrator. Those distinctions still exist, but the center of gravity is moving. Everyone is now chasing the same layered workflow: local context, safe permissions, some form of planning, durable background execution, and a clean handoff into review.

GitHub’s tutorial is useful because it accidentally says the quiet part out loud. It tells new users that they can keep working on other tasks while Copilot works in the terminal. It explicitly promotes /delegate to the cloud agent. It frames folder permissions as a normal part of use, not an advanced feature. It normalizes the idea that an AI tool should be able to inspect the repo and act across it. In other words, GitHub is training developers to think of the terminal as an orchestration surface, not merely a prompt box.

The real product battle is where the workflow boundary lives

Here is the key strategic difference. GitHub is increasingly comfortable making the boundary explicit: work can start in the CLI, then hand off to Copilot cloud agent for longer-running tasks on a branch, possibly leading to a pull request. OpenAI, by contrast, appears to be investing in making background behavior feel native inside Codex itself, whether through the app, CLI, or linked session model. GitHub is saying “delegate.” OpenAI is saying “stay here, we can make this environment more agent-native.”

Neither approach is obviously correct yet. GitHub’s model is attractive for teams already living inside GitHub’s branch-and-PR machinery. It makes the durable artifact, the draft PR, part of the product from the start. OpenAI’s approach is attractive if you want the agent to feel closer to a local operator that can stretch into longer tasks without turning every meaningful action into a hosted workflow.

The practical upshot is that developers should stop evaluating these tools as if they were interchangeable chatbots with different logos. The important questions are now workflow questions. Where does planning happen? When do permissions get asked for? How visible is background progress? What durable object gets created at the end, a branch, a PR, a thread, a diff, or just a memory of what happened? Can the tool keep enough context to support long sessions without becoming unusable?

GitHub’s docs suggest it knows these are the right questions. Plan mode exists because direct-to-code autonomy is not always the adult choice. Programmatic mode exists because the CLI has to fit scripts and faster summaries, not just conversations. Auto-compaction exists because terminal users hate being told the session is over due to context exhaustion. These are not gimmicks. They are what a maturing product category looks like.

What engineers should do instead of arguing on vibes

If you are choosing between Copilot CLI, Codex, and the rest of the terminal-agent pack, do not start with benchmark charts. Start with your actual workflow shape. If your team already treats Git branches and pull requests as the canonical unit of work, GitHub’s delegate-first model may fit naturally. If you want a local-first environment with stronger emphasis on sandboxing and background agent supervision inside the same tool, Codex may fit better. If your biggest pain is implementation planning rather than code generation, test plan mode and planning quality before anything else.

Also, pay attention to the permission model. GitHub’s beginner tutorial treats folder permission grants as part of normal setup. That is appropriate, but it is also a reminder that “agentic CLI” means granting a tool meaningful access to your project. OpenAI has been unusually direct lately about sandbox boundaries and approval policies. GitHub is increasingly documenting its own security considerations too. Practitioners should reward that candor. The dangerous tools in this category are not the most autonomous ones. They are the ones that are opaque about what autonomy means.

One more tactical note: if you are piloting these tools in a team, instrument the workflow, not just output quality. Track whether tasks actually finish unattended, how often permissions or environment issues block progress, whether plans reduce rework, and how easy it is to review what the agent did. Those metrics will tell you more than “it seemed smart in the demo.”

GitHub’s new tutorial is ostensibly for beginners. Fine. But the real message is aimed at everyone else. The company is trying to define what normal terminal-agent behavior should look like before the category hardens. That is why this post matters. The winners in AI coding will not be decided by who can write the cutest shell command explanation. They will be decided by who captures the developer workflow with the least friction and the fewest trust failures. GitHub just made clear it knows the race is happening there.

Sources: GitHub Copilot CLI for Beginners, About GitHub Copilot CLI, Research, plan, and code with Copilot cloud agent, OpenAI Codex changelog

Why this matters on an OpenAI Codex beat

The real product battle is where the workflow boundary lives

What engineers should do instead of arguing on vibes

Sign up for more like this.