The LGTM (Page 69)

The LGTM

Sign in Subscribe

LangGraph’s Tiny Checkpoint Fix Is a Big Reminder: Durable Agents Need Stable Identity

LangGraph’s Tiny Checkpoint Fix Is a Big Reminder: Durable Agents Need Stable Identity

LangGraph 1.2.2 is a small release about a boring invariant. That is precisely why it matters. The patch fixes unstable message IDs before DeltaChannel checkpoint writes are serialized. In plain English: under certain durability modes, a message with id=None could be written to the checkpoint before the

LangChain Built a Database Because Agent Traces Stopped Looking Like Logs

LangChain Built a Database Because Agent Traces Stopped Looking Like Logs

LangChain did not build SmithDB because databases are trendy. It built SmithDB because agent traces stopped behaving like normal logs. That distinction matters. The obvious version of this story is “LangChain made LangSmith faster.” True, but incomplete in the way a passing unit test can still hide a broken architecture.

Google Health Is Turning Fitbit Into an AI Data Layer, Not Just a Dashboard

Google Health Is Turning Fitbit Into an AI Data Layer, Not Just a Dashboard

Google Health’s latest update is easy to misread as another wellness-app consolidation story. Fitbit gets renamed, Apple Health gets mentioned, Gemini gets the predictable product cameo, and everyone moves on. That would miss the actual architecture decision: Google is trying to turn personal health data into an agent-ready substrate.

LocateAnything Fixes a Small-Looking VLM Bottleneck That Breaks Real Agents

LocateAnything Fixes a Small-Looking VLM Bottleneck That Breaks Real Agents

Visual grounding is the kind of model capability that looks like plumbing right up until an agent clicks the wrong thing. A coding assistant can recover from a bad suggestion. A screen-operating agent that taps the wrong delete button, selects the wrong invoice total, or drags the wrong bounding box

MiniMax-M2.7 Is the Open-Weight Coding-Agent Release to Watch — With Some Benchmark Caveats Attached

MiniMax-M2.7 Is the Open-Weight Coding-Agent Release to Watch — With Some Benchmark Caveats Attached

MiniMax-M2.7 is the kind of model release that deserves attention and a raised eyebrow at the same time. It is open-weight, agent-focused, benchmark-heavy, and clearly aimed at the coding-agent comparison table where Claude, Codex, Copilot, Gemini, Qwen, and the local-model crowd are all fighting for developer mindshare. It also

VitaBench 2.0 Finds the Missing Capability in Personal Agents: Remembering Without Making a Mess

VitaBench 2.0 Finds the Missing Capability in Personal Agents: Remembering Without Making a Mess

The personal-agent industry keeps treating memory like a feature toggle. Add embeddings, store a few preferences, retrieve the “relevant” chunks, and suddenly the assistant is supposed to know you. VitaBench 2.0 is useful because it says the quiet part out loud: storing user context is the easy half. Using

MobileGym Makes GUI-Agent Benchmarks Look Less Like Vibes and More Like Engineering

MobileGym Makes GUI-Agent Benchmarks Look Less Like Vibes and More Like Engineering

Mobile-agent demos have been winning the wrong argument. The impressive part is not that a model can tap through a shopping flow on a phone-shaped screenshot while a video plays nicely on social media. The hard part is proving what happened after the tap: which state changed, which side effects

Codex 0.134.0 Turns the Alpha Plumbing Into the Stable Runtime Contract

Codex 0.134.0 Turns the Alpha Plumbing Into the Stable Runtime Contract

Codex 0.134.0 is not the release you show in a keynote. It is the release you want before you let a coding agent anywhere near a real engineering workflow. OpenAI promoted @openai/[email protected] to the stable npm latest channel on May 26, about 20 minutes