The  LGTM
  • Home
  • Agentic Coding
  • Claude Code
  • Codex
Sign in Subscribe
CUDA 13.3 Is NVIDIA Moving GPU Programming Up the Stack Without Hiding the Metal
nvidia

CUDA 13.3 Is NVIDIA Moving GPU Programming Up the Stack Without Hiding the Metal

CUDA 13.3 is the kind of release that looks incremental until you line up the moving pieces. Tile programming comes to C++. CUDA Python reaches a 1.0 stability line. CompileIQ exposes compiler auto-tuning as a first-class workflow. Green contexts and process checkpointing move closer to production control-plane primitives.
27 May 2026 4 min read
LangGraph’s Tiny Checkpoint Fix Is a Big Reminder: Durable Agents Need Stable Identity
ai-frameworks

LangGraph’s Tiny Checkpoint Fix Is a Big Reminder: Durable Agents Need Stable Identity

LangGraph 1.2.2 is a small release about a boring invariant. That is precisely why it matters. The patch fixes unstable message IDs before DeltaChannel checkpoint writes are serialized. In plain English: under certain durability modes, a message with id=None could be written to the checkpoint before the
27 May 2026 4 min read
LangChain Built a Database Because Agent Traces Stopped Looking Like Logs
ai-frameworks

LangChain Built a Database Because Agent Traces Stopped Looking Like Logs

LangChain did not build SmithDB because databases are trendy. It built SmithDB because agent traces stopped behaving like normal logs. That distinction matters. The obvious version of this story is “LangChain made LangSmith faster.” True, but incomplete in the way a passing unit test can still hide a broken architecture.
27 May 2026 5 min read
Copilot CLI 1.0.55-3 Turns Plugins, Hooks, and Skills Into Session Controls
agentic-coding

Copilot CLI 1.0.55-3 Turns Plugins, Hooks, and Skills Into Session Controls

GitHub Copilot CLI 1.0.55-3 is a pre-release, which is usually code for “skip unless you enjoy changelog archaeology.” This one is worth reading because it shows where the terminal-agent fight is going: plugins, hooks, skills, remotes, policy, and token accounting are becoming session controls rather than background configuration.
27 May 2026 5 min read
OpenCode 1.15.11 Fixes the Runtime Edges That Make Agents Feel Haunted
agentic-coding

OpenCode 1.15.11 Fixes the Runtime Edges That Make Agents Feel Haunted

OpenCode 1.15.11 is not trying to win the screenshot war. It is doing something more useful: sanding down the runtime edges that make coding agents feel haunted after the happy path breaks. The release is a pile of fixes that only looks small if you have never operated
27 May 2026 5 min read
Google Health Is Turning Fitbit Into an AI Data Layer, Not Just a Dashboard
google-ai

Google Health Is Turning Fitbit Into an AI Data Layer, Not Just a Dashboard

Google Health’s latest update is easy to misread as another wellness-app consolidation story. Fitbit gets renamed, Apple Health gets mentioned, Gemini gets the predictable product cameo, and everyone moves on. That would miss the actual architecture decision: Google is trying to turn personal health data into an agent-ready substrate.
27 May 2026 5 min read
MiniMax-M2.7 Is the Open-Weight Coding-Agent Release to Watch — With Some Benchmark Caveats Attached
ai-models

MiniMax-M2.7 Is the Open-Weight Coding-Agent Release to Watch — With Some Benchmark Caveats Attached

MiniMax-M2.7 is the kind of model release that deserves attention and a raised eyebrow at the same time. It is open-weight, agent-focused, benchmark-heavy, and clearly aimed at the coding-agent comparison table where Claude, Codex, Copilot, Gemini, Qwen, and the local-model crowd are all fighting for developer mindshare. It also
27 May 2026 5 min read
LocateAnything Fixes a Small-Looking VLM Bottleneck That Breaks Real Agents
ai-models

LocateAnything Fixes a Small-Looking VLM Bottleneck That Breaks Real Agents

Visual grounding is the kind of model capability that looks like plumbing right up until an agent clicks the wrong thing. A coding assistant can recover from a bad suggestion. A screen-operating agent that taps the wrong delete button, selects the wrong invoice total, or drags the wrong bounding box
27 May 2026 4 min read
VitaBench 2.0 Finds the Missing Capability in Personal Agents: Remembering Without Making a Mess
ai-models

VitaBench 2.0 Finds the Missing Capability in Personal Agents: Remembering Without Making a Mess

The personal-agent industry keeps treating memory like a feature toggle. Add embeddings, store a few preferences, retrieve the “relevant” chunks, and suddenly the assistant is supposed to know you. VitaBench 2.0 is useful because it says the quiet part out loud: storing user context is the easy half. Using
27 May 2026 4 min read
MobileGym Makes GUI-Agent Benchmarks Look Less Like Vibes and More Like Engineering
ai-models

MobileGym Makes GUI-Agent Benchmarks Look Less Like Vibes and More Like Engineering

Mobile-agent demos have been winning the wrong argument. The impressive part is not that a model can tap through a shopping flow on a phone-shaped screenshot while a video plays nicely on social media. The hard part is proving what happened after the tap: which state changed, which side effects
27 May 2026 4 min read
Copilot Memory Gets the Controls It Needed Before Teams Trust It
codex

Copilot Memory Gets the Controls It Needed Before Teams Trust It

Copilot Memory is where coding assistants stop being stateless helpers and start becoming part of a team’s operating lore. That is useful. It is also exactly the kind of feature that should make engineering managers ask uncomfortable questions before the agent quietly teaches itself the wrong house style. GitHub’
27 May 2026 5 min read
Codex 0.134.0 Turns the Alpha Plumbing Into the Stable Runtime Contract
codex

Codex 0.134.0 Turns the Alpha Plumbing Into the Stable Runtime Contract

Codex 0.134.0 is not the release you show in a keynote. It is the release you want before you let a coding agent anywhere near a real engineering workflow. OpenAI promoted @openai/[email protected] to the stable npm latest channel on May 26, about 20 minutes
27 May 2026 5 min read
← Newer Posts Page 26 of 111 Older Posts →
The LGTM © 2026
  • Sign up
Powered by Ghost