The  LGTM
  • Home
  • Agentic Coding
  • Claude Code
  • Codex
Sign in Subscribe
Copilot CLI 1.0.55 Puts Token Accounting, MCP Usage, and Bypass Controls in the Terminal
agentic-coding

Copilot CLI 1.0.55 Puts Token Accounting, MCP Usage, and Bypass Controls in the Terminal

Copilot CLI 1.0.55 is the release where agent cost stops being a billing surprise and starts becoming a terminal feature. That sounds unromantic because it is. The next useful wave of coding-agent tooling is not another prompt box. It is the boring ability to see what the agent
28 May 2026 5 min read
Codex 0.135 Makes Diagnostics and Permission Profiles First-Class Agent Infrastructure
agentic-coding

Codex 0.135 Makes Diagnostics and Permission Profiles First-Class Agent Infrastructure

Codex 0.135.0 is the kind of release that looks dull until you have to support real developers using real agents against real repositories. There is no single cinematic demo here. There is codex doctor, remote status detail, named permission profiles, sandbox presets, resume-flow fixes, TUI correctness, and app-server-owned
28 May 2026 5 min read
Mistral Turns Le Chat Into Vibe, and the Coding Agent Becomes the Product Surface
agentic-coding

Mistral Turns Le Chat Into Vibe, and the Coding Agent Becomes the Product Surface

Mistral did not rename Le Chat to Vibe because someone in marketing found a better noun. It renamed the product because the center of gravity moved. The thing users increasingly buy is not “chat with a model.” It is a persistent agent surface that can read work context, operate across
28 May 2026 4 min read
VibeSearchBench Shows Why Deep Research Still Misses What Users Actually Want
ai-models

VibeSearchBench Shows Why Deep Research Still Misses What Users Actually Want

Most “deep research” benchmarks quietly assume the user has already done the hardest part: specifying exactly what they want. That is convenient for leaderboard construction and wildly unlike real work. Real users start with mush. They ask for a good vendor, a sensible travel plan, a market landscape, a replacement
28 May 2026 4 min read
LiveBrowseComp Catches Search Agents Cheating With Memory
ai-models

LiveBrowseComp Catches Search Agents Cheating With Memory

Search-agent benchmarks have a credibility problem: too many “search” traces look like a model confirming what it already knows. The agent writes plausible queries, opens a few pages, cites something adjacent, and the final answer appears grounded. But if the answer was already sitting in the model’s weights, the
28 May 2026 4 min read
GUI-CIDER Says GUI Agents Need World Knowledge, Not More Runtime Scaffolding
ai-models

GUI-CIDER Says GUI Agents Need World Knowledge, Not More Runtime Scaffolding

GUI agents keep getting wrapped in more scaffolding because the model underneath often does not understand the interface well enough. Add a planner. Add a verifier. Add screenshot retries. Add another model that explains the screen. Add a browser harness with a heroic prompt and a timeout long enough to
28 May 2026 4 min read
Gamma-World Makes World Models Multi-Agent Without Turning Attention Into a Tax Bill
ai-models

Gamma-World Makes World Models Multi-Agent Without Turning Attention Into a Tax Bill

Most world-model demos still assume the universe has one protagonist. That is convenient for video generation, robotics toy tasks, and benchmark clips, but it is a bad assumption for the systems people actually want to build. Warehouses have multiple robots. Games have multiple players. Simulators have pedestrians, vehicles, tools, and
28 May 2026 4 min read
openclaw

MCP Structured Content Is Not Optional If the Next Tool Call Needs the ID

MCP’s split between human-readable content and machine-shaped structuredContent is useful right up until the agent cannot see the field it needs for the next tool call. OpenClaw PR #87540 fixes one of those deceptively small bridge bugs: OpenClaw exposed structuredContent to the model only when normal content[] was empty.
28 May 2026 4 min read
Tool Results Can Poison a Session One Acceptable Chunk at a Time
openclaw

Tool Results Can Poison a Session One Acceptable Chunk at a Time

The expensive agent failure is rarely one giant mistake. It is usually twenty individually acceptable decisions replayed forever. OpenClaw PR #87639 is a good example: the runtime already caps individual persisted toolResult messages, but a long-running tool can append many results that each fit under the per-message limit. Later, the
28 May 2026 5 min read
openclaw

Broken Context Engines Should Degrade, Not Own the Process

A broken context engine should not get to own the whole process. That is the useful premise behind OpenClaw PR #87640, a fresh patch that adds process-local quarantine for failing non-legacy context engines, reports the quarantine through health surfaces, and downgrades future context-engine work to legacy instead of letting one
28 May 2026 4 min read
OpenClaw 2026.5.27 Is a Boundary-Setting Release, Not a Feature Victory Lap
openclaw

OpenClaw 2026.5.27 Is a Boundary-Setting Release, Not a Feature Victory Lap

OpenClaw v2026.5.27 is not a victory lap release. It is a boundary-setting release, which is more important and much less marketable. The changelog is full of the kind of fixes that only become urgent after an agent platform has accumulated enough channels, plugins, model routes, helper processes, memories,
28 May 2026 5 min read
Copilot Studio’s Mistral Gate Is the Real Azure AI Story
azure-ai

Copilot Studio’s Mistral Gate Is the Real Azure AI Story

Microsoft adding Mistral Medium 3.5 to Copilot Studio sounds, at first glance, like another checkbox in the enterprise AI model buffet. Useful, sure. But the actual story is not that makers get one more model in a dropdown. The story is that Microsoft is turning Copilot Studio into a
28 May 2026 6 min read
← Newer Posts Page 21 of 110 Older Posts →
The LGTM © 2026
  • Sign up
Powered by Ghost