The  LGTM
  • Home
  • Agentic Coding
  • Claude Code
  • Codex
Sign in Subscribe
xAI’s Management API Gives Grok the Admin Surface Teams Actually Need
xai

xAI’s Management API Gives Grok the Admin Surface Teams Actually Need

xAI is doing the least glamorous work in the AI platform stack, which is exactly why this update matters. The company’s refreshed Management API documentation is not a model launch, not a benchmark chart, and not another demo of Grok writing a React component. It is the admin surface
28 May 2026 4 min read
Zed’s Agent Skills Push Makes the Editor a Supply-Chain Surface
agentic-coding

Zed’s Agent Skills Push Makes the Editor a Supply-Chain Surface

Zed’s 1.5.0-pre release looks, at first glance, like normal editor evolution: better Mermaid rendering, thread renaming, smoother panels, clickable paths, faster OpenAI service tiers, and the usual bug-fix parade. The more important story is tucked inside the Agent section. Zed is turning skills, AGENTS.md rules, MCP/
28 May 2026 6 min read
agentic-coding

Goose 1.36 Turns Code Review, Hooks, Skills, and Goals Into the Agent Runtime

Goose 1.36.0 is not a “new buttons in the chat window” release. It is more interesting than that, which is inconvenient for anyone trying to reduce coding-agent progress to model benchmarks. The release turns several things teams have been improvising around — code review, permission hooks, reusable skills, goal
28 May 2026 5 min read
NotebookLM Is Turning Launch Week Into a Source-Grounded Briefing Layer
google-ai

NotebookLM Is Turning Launch Week Into a Source-Grounded Briefing Layer

Google’s smallest I/O follow-up this week may be the one more product teams should copy. The company published a public NotebookLM notebook for Google I/O 2026, loaded with keynote videos, product demonstrations, blog posts, and generated ways to consume the whole pile: a sub-two-minute Audio Overview, a
28 May 2026 5 min read
ai-models

SkillGrad Treats Agent Skills Like Code That Needs an Optimizer, Not a Pep Talk

Agent skills are being treated too much like helpful markdown and not enough like dependencies. That is the mistake SkillGrad is trying to correct. A skill can change how an agent edits a spreadsheet, reads a table, calls a tool, writes code, or follows a procedure. If that artifact can
28 May 2026 3 min read
LearnWeak Makes Small Computer-Use Agents Better by Training on Their Actual Mistakes
ai-models

LearnWeak Makes Small Computer-Use Agents Better by Training on Their Actual Mistakes

LearnWeak is a useful reminder that small agents do not need motivational posters. They need failure-specific training. The framework takes computer-use agents that are weak in particular desktop domains, finds where a stronger teacher succeeds and the smaller student fails, generates new tasks around those weaknesses, and trains the student
28 May 2026 3 min read
MemTrace Turns Agent Memory Bugs Into Something You Can Actually Debug
ai-models

MemTrace Turns Agent Memory Bugs Into Something You Can Actually Debug

Memory is where agent demos go to become production incidents. In a demo, the assistant remembers the user’s preference and everyone nods. In production, it stores the wrong preference, retrieves the stale one, overwrites the useful one, cites irrelevant history, and then produces an answer that looks confident enough
28 May 2026 3 min read
AXPO Shows Why Tool-Using Models Need Different RL Than Chat Models
ai-models

AXPO Shows Why Tool-Using Models Need Different RL Than Chat Models

The interesting thing about AXPO is not that it makes Qwen3-VL-Thinking score a little better on multimodal benchmarks. The interesting thing is the failure mode it catches: models that know a tool exists, talk about using it, and then retreat back into pure text because acting has become too expensive
28 May 2026 4 min read
Codex 0.135 Alpha Is Turning Agent Memory, Search, and Goal Accounting Into Runtime Primitives
codex

Codex 0.135 Alpha Is Turning Agent Memory, Search, and Goal Accounting Into Runtime Primitives

Codex 0.135.0-alpha.2 is not the kind of release that gets a product-launch video. Good. The interesting part of coding agents right now is not whether they can write another demo todo app; it is whether their runtime has enough explicit state, accounting, and diagnostics that a serious
28 May 2026 5 min read
Microsoft’s Agent Governance Toolkit Gives MCP Agents the Policy Layer Prompts Cannot Provide
claude-code

Microsoft’s Agent Governance Toolkit Gives MCP Agents the Policy Layer Prompts Cannot Provide

The most useful thing about Microsoft’s Agent Governance Toolkit is that it refuses to pretend prompts are a permission system. That should not be a controversial position in 2026, but here we are: teams are connecting agents to codebases, databases, ticketing systems, browsers, email drafts, cloud APIs, and MCP
28 May 2026 5 min read
Claude Code 2.1.153 Fixes the Boring Surfaces That Become Incidents
claude-code

Claude Code 2.1.153 Fixes the Boring Surfaces That Become Incidents

The most important Claude Code release this week is not the one with the flashiest feature. It is the one that fixes the surfaces nobody demos and everybody depends on once coding agents become part of daily engineering work: MCP policy inheritance, gateway credentials, background-session recovery, update channels, and the
28 May 2026 5 min read
openclaw

OpenClaw’s Remote iMessage Image Bug Is What Happens When Media Pipelines Cross Machines

OpenClaw’s remote iMessage media bug is narrow enough that most users will never hit it. That is exactly why it is worth covering. The interesting failures in agent platforms are often not the universal ones. They live at the handoff points: one machine sees a file, another machine owns
27 May 2026 3 min read
← Newer Posts Page 23 of 111 Older Posts →
The LGTM © 2026
  • Sign up
Powered by Ghost