The  LGTM
  • Home
  • Agentic Coding
  • Claude Code
  • Codex
Sign in Subscribe
Microsoft’s Claude Code Pullback Is the Real Copilot CLI Performance Review.
azure-ai

Microsoft’s Claude Code Pullback Is the Real Copilot CLI Performance Review.

Microsoft reportedly pulling most internal Claude Code licenses is not a vendor-drama footnote. It is the most honest benchmark Copilot CLI has faced so far. Marketing pages can say a terminal agent is ready for enterprise engineering; taking away the rival tool your own developers have been using is where
14 May 2026 4 min read
Codex’s New Pricing and Browser-State Docs Make the Real Platform Boundary Visible
ai-frameworks

Codex’s New Pricing and Browser-State Docs Make the Real Platform Boundary Visible

Codex pricing looks like a billing page. It is actually an architecture diagram with dollar signs attached. OpenAI’s fresh Codex docs make the platform boundary clearer than any launch post: Codex is not merely a CLI or an IDE helper. It is a metered agent runtime that spans local
14 May 2026 5 min read
Microsoft Agent Framework 1.6.1 Is the Boring Production Release Agent Platforms Actually Need
ai-frameworks

Microsoft Agent Framework 1.6.1 Is the Boring Production Release Agent Platforms Actually Need

Microsoft Agent Framework 1.6.1 is the kind of release that will not trend, which is usually how you know it matters. The agent-framework market spent two years selling demos: chat with tools, agents handing work to other agents, diagrams full of arrows that looked suspiciously like distributed systems
14 May 2026 5 min read
Codex Mobile Turns Agentic Coding Into Remote-Control Infrastructure
agentic-coding

Codex Mobile Turns Agentic Coding Into Remote-Control Infrastructure

The least interesting version of OpenAI’s new Codex mobile launch is the one in the headline: “now you can code from your phone.” Nobody serious wants to review a gnarly refactor on a 6-inch screen while standing in line for coffee. The more important shift is quieter and more
14 May 2026 6 min read
google-ai

Gemini 3.1 Deep Think Is Google’s Agentic Coding Flex — But Bring Your Own Eval Harness

Google’s Gemini 3.1 update is not subtle about the audience it wants: developers choosing which model gets to touch a terminal, a repo, a browser session, and eventually production-adjacent workflow state. The DeepMind model page now presents Gemini 3.1 Pro and Gemini 3.1 Deep Think less
14 May 2026 5 min read
Codex Hooks Going GA Makes Agent Policy Programmable — and Also Reviewable
codex

Codex Hooks Going GA Makes Agent Policy Programmable — and Also Reviewable

Hooks are the kind of feature that rarely wins a launch-day popularity contest and then quietly becomes the thing enterprises cannot deploy without. OpenAI’s Codex Hooks going generally available is not glamorous. It is also one of the more important Codex platform moves this week, because it gives teams
14 May 2026 4 min read
OpenClaw’s Token-Auth Scope Bug Shows Agent Observability Has a Trust Boundary Problem
openclaw

OpenClaw’s Token-Auth Scope Bug Shows Agent Observability Has a Trust Boundary Problem

The most honest thing about OpenClaw issue #81775 is that both sides of the bug are defensible. A WebSocket client authenticates with the configured shared gateway token, requests operator read scopes, and successfully connects. Then the gateway returns payload.auth.scopes: [], which means every useful observability subscription fails with missing
14 May 2026 4 min read
OpenClaw’s Channel MemoryFlush Proposal Gets the Agent-Memory Problem Exactly Half Right
openclaw

OpenClaw’s Channel MemoryFlush Proposal Gets the Agent-Memory Problem Exactly Half Right

OpenClaw issue #81804 asks for an opt-in pre-compaction memoryFlush turn for channel-driven sessions: Google Chat, Slack, Discord, Matrix, Telegram, Feishu, and the rest of the places where agents slowly become part of a team’s working memory. The proposal is smart because it avoids the hardest version of the problem.
14 May 2026 4 min read
OpenClaw 2026.5.12-beta.8 Is Shrinking the Core and Hardening the Edges
openclaw

OpenClaw 2026.5.12-beta.8 Is Shrinking the Core and Hardening the Edges

OpenClaw’s newest beta is not trying to win the release-notes beauty contest. Good. The interesting work in v2026.5.12-beta.8 is mostly the kind of platform plumbing people notice only after it fails: dependency boundaries, fallback boundaries, inbound queue boundaries, credential boundaries, and sandbox roots. That is the
14 May 2026 4 min read
A 100B-Class NVFP4 Quant on 2× DGX Spark Is the Most Useful Kind of Messy Benchmark
nvidia

A 100B-Class NVFP4 Quant on 2× DGX Spark Is the Most Useful Kind of Messy Benchmark

The most useful benchmark posts are usually the messy ones. A fresh NVIDIA Developer Forum thread on quantizing a 100B-class Llama-derived model to NVFP4 across two DGX Spark systems is valuable because it does not sand off the parts operators actually trip over: unified-memory misdetection, exporter quirks, vLLM sidecar fixes,
14 May 2026 4 min read
nvidia

The Best Local Coding-Agent Post Today Says the Quiet Part: The Model Is a Worker, Not the Authority

The best local coding-agent post today does not ask whether Qwen 3.6 is smart enough to write code. It asks the more useful question: what should the model be allowed to decide? Manolo Remiddi’s writeup on building a local coding-agent setup with Qwen 3.6, OpenCode, Hermes, and
14 May 2026 4 min read
Hermes on DGX Spark Shows Local Agents Are Becoming a Hardware Product — With a Reliability Bill Attached
nvidia

Hermes on DGX Spark Shows Local Agents Are Becoming a Hardware Product — With a Reliability Bill Attached

NVIDIA’s latest RTX AI Garage post is not really about one more agent framework. It is about local agents becoming a hardware product. Hermes Agent, Qwen 3.6, RTX PCs, RTX PRO workstations, and DGX Spark are being bundled into a single story: keep the agent near your files,
14 May 2026 4 min read
← Newer Posts Page 58 of 113 Older Posts →
The LGTM © 2026
  • Sign up
Powered by Ghost