The  LGTM
  • Home
  • Agentic Coding
  • Claude Code
  • Codex
Sign in Subscribe
Codex-Spark Is OpenAI Admitting Coding Agents Need a Fast Lane, Not Just a Bigger Brain
codex

Codex-Spark Is OpenAI Admitting Coding Agents Need a Fast Lane, Not Just a Bigger Brain

OpenAI’s new Codex-Spark is easy to misread as another model SKU in a week already full of model SKUs. It is more interesting than that. GPT-5.3-Codex-Spark is OpenAI saying the quiet part about coding agents out loud: sometimes the bottleneck is not whether the model can solve the
16 May 2026 5 min read
AI Middleware Is Becoming Critical Infrastructure, and MCP Is the Audit Boundary
claude-code

AI Middleware Is Becoming Critical Infrastructure, and MCP Is the Audit Boundary

The phrase “AI middleware” sounds like something vendors invented to make plumbing billable. Unfortunately, it is also the right threat model. The dangerous layer in modern agent stacks is increasingly not the model itself, but the glue around it: model routers, MCP servers, plugin managers, tool proxies, local CLIs, background
16 May 2026 5 min read
Claude Code 2.1.143 Turns Plugin Dependencies and Background Sessions Into Things You Can Govern
claude-code

Claude Code 2.1.143 Turns Plugin Dependencies and Background Sessions Into Things You Can Govern

Claude Code 2.1.143 is not the release that will make a launch video go viral. Good. The interesting work in coding agents right now is not another sparkling demo of a model editing five files at once. It is whether the runtime can survive plugin graphs, background workers,
16 May 2026 5 min read
OpenClaw’s Stuck-Tool Recovery Patch Is the Kind of Runtime Observability Agents Actually Need
openclaw

OpenClaw’s Stuck-Tool Recovery Patch Is the Kind of Runtime Observability Agents Actually Need

Agent observability has a bad habit of stopping one step too early. It tells you a session is “processing.” Then it tells you the queue depth. Then, if the platform is unusually honest, it classifies the problem as a blocked tool call. Useful. Also not enough. The operator does not
15 May 2026 4 min read
OpenClaw’s Subagent Completion Regression Shows Multi-Agent Orchestration Is Still a Delivery Problem
openclaw

OpenClaw’s Subagent Completion Regression Shows Multi-Agent Orchestration Is Still a Delivery Problem

Multi-agent orchestration is usually marketed as decomposition: split the work, spawn specialists, collect results. That story skips the part operators actually live with, which is delivery. A delegated task that finishes but never reports back is not automation. It is a very polite way to lose state. OpenClaw issue #82370,
15 May 2026 3 min read
OpenClaw’s Codex Harness Still Has to Prove Tool Policy Is a Runtime Boundary, Not a Config Wish
openclaw

OpenClaw’s Codex Harness Still Has to Prove Tool Policy Is a Runtime Boundary, Not a Config Wish

There are two ways to ship a tool policy in an agent platform. One is as configuration: a tidy object, a feature flag, maybe a helper function with a reassuring test name. The other is as an actual runtime boundary: the exact place where the model tries to run bash,
15 May 2026 4 min read
Nemotron CLIMB Is NVIDIA Admitting the Expensive Part of Scaling Laws Needs Cheaper Test Fixtures
nvidia

Nemotron CLIMB Is NVIDIA Admitting the Expensive Part of Scaling Laws Needs Cheaper Test Fixtures

The least glamorous way to save millions of dollars in AI is to find out your training recipe is bad before you run it at the size where the invoice becomes memorable. That is the real story behind NVIDIA’s Nemotron-CLIMB proxy models: two small base models, 62M and 350M
15 May 2026 5 min read
NVIDIA’s New Streaming ASR Model Is the Boring Multilingual Agent Component Everyone Eventually Needs
nvidia

NVIDIA’s New Streaming ASR Model Is the Boring Multilingual Agent Component Everyone Eventually Needs

The most important model releases are usually not the ones that make the best demo reel. They are the ones that remove a piece of glue code from a production system. NVIDIA’s new Nemotron 3.5 ASR Streaming Multilingual 0.6B is that kind of release: a 600 million-parameter
15 May 2026 4 min read
Microsoft’s SC-500 Cert Is a Roadmap for the AI Security Job Nobody Has Fully Defined Yet.
azure-ai

Microsoft’s SC-500 Cert Is a Roadmap for the AI Security Job Nobody Has Fully Defined Yet.

Microsoft’s new Cloud and AI Security Engineer Associate certification sounds, at first pass, like ordinary credential churn. Another exam code, another badge, another study guide for people who already have too many tabs open in Microsoft Learn. But SC-500 is more interesting than that because exam outlines are roadmaps
15 May 2026 5 min read
azure-ai

Codex on Mobile Is Not About Coding on a Phone. It Is About Keeping Long-Running Agents Inside the Approval Loop.

Codex landing in the ChatGPT mobile app is easy to misunderstand. This is not OpenAI trying to convince serious developers to review a refactor with their thumbs. It is OpenAI acknowledging the actual shape of agentic coding work: the agent runs for a while, hits a decision point, needs permission,
15 May 2026 4 min read
Armorer Guard’s MCP Proxy Puts Agent Security Where It Belongs: In Front of the Tool
ai-frameworks

Armorer Guard’s MCP Proxy Puts Agent Security Where It Belongs: In Front of the Tool

Armorer Guard v0.2.4 is a licensing and packaging release for a tiny project with 22 GitHub stars. That would normally be a footnote. But one day earlier, v0.2.3 added a local MCP proxy for enforcing policy on stdio tool calls, and that is the interesting part:
15 May 2026 5 min read
CrewAI 1.14.5a6 Patches the Prompt-Manifest Trust Boundary Agents Keep Ignoring
ai-frameworks

CrewAI 1.14.5a6 Patches the Prompt-Manifest Trust Boundary Agents Keep Ignoring

CrewAI v1.14.5a6 is an alpha release with the kind of changelog item production teams should read slowly: a dependency bump for a LangSmith prompt-manifest vulnerability and a fix for streamed tool calls that could disappear when available_functions was absent. Neither item looks dramatic in isolation. Together they
15 May 2026 4 min read
← Newer Posts Page 54 of 113 Older Posts →
The LGTM © 2026
  • Sign up
Powered by Ghost