The  LGTM
  • Home
  • Agentic Coding
  • Claude Code
  • Codex
Sign in Subscribe
Pydantic AI V2 Beta 4 Tightens the Agent Contract Where Production Systems Actually Break
ai-frameworks

Pydantic AI V2 Beta 4 Tightens the Agent Contract Where Production Systems Actually Break

Pydantic AI V2 Beta 4 is the kind of release that looks minor if you scan for features and important if you operate agents for a living. The headline is not “new agent toys.” The headline is that Pydantic is tightening the contracts around the places production agent systems actually
30 May 2026 4 min read
Claude Code’s Local Skills Turn Agent Plugins Into Runtime Policy
agentic-coding

Claude Code’s Local Skills Turn Agent Plugins Into Runtime Policy

Claude Code’s latest release is not the kind that wins a keynote slide. That is precisely why it matters. Version v2.1.157, published May 29, 2026, lands in the shadow of Anthropic’s bigger Opus 4.8 and dynamic-workflows push. The headline release promised large tasks, background subagents,
30 May 2026 5 min read
LLM rankings are splitting into quality and economics
llm-rankings

LLM rankings are splitting into quality and economics

The model leaderboard story this week is not that Anthropic still looks excellent on Arena. It is that the two most useful public scoreboards are now measuring different planets: Arena is still a quality contest, while OpenRouter has quietly become a production-economics chart. That distinction matters because teams keep treating
30 May 2026 5 min read
Entropy-Cut Sampling Makes Reasoning Models Look Less Like RL Magic and More Like Search
ai-models

Entropy-Cut Sampling Makes Reasoning Models Look Less Like RL Magic and More Like Search

The lazy story about reasoning models is that intelligence lives in the weights and inference is just the receipt printer. Reasoning with Sampling pushes against that. Its Entropy-Cut Metropolis-Hastings method makes reasoning look less like a mystical property unlocked only by RL posttraining and more like a search problem where
30 May 2026 4 min read
minWM Turns Video Generators Into Interactive World Models Without Hiding the Plumbing
ai-models

minWM Turns Video Generators Into Interactive World Models Without Hiding the Plumbing

World-model releases usually arrive dressed as demo reels: a camera glides through a generated scene, the physics mostly behaves, and everyone politely ignores the fact that the system took forever to render and cannot really respond like an environment. minWM is more interesting because it is not selling one magic
30 May 2026 3 min read
LoRA Memory Has a Capacity Law — and a Security Footnote Nobody Should Skip
ai-models

LoRA Memory Has a Capacity Law — and a Security Footnote Nobody Should Skip

LoRA adapters are usually sold as the cheap fine-tuning trick: fewer trainable parameters, lower VRAM, easier deployment, nice story for teams that cannot afford to retrain the universe every time their product changes. How LoRA Remembers? is useful because it treats adapters as something more specific and more dangerous: parametric
30 May 2026 4 min read
LLMSurgeon Is a Model-Provenance Audit for the Training Data Vendors Won’t Show You
ai-models

LLMSurgeon Is a Model-Provenance Audit for the Training Data Vendors Won’t Show You

Model vendors have trained everyone to accept a shrug where a supply-chain document should be. Ask what went into a model and you usually get a polite paragraph about “publicly available, licensed, and human-generated data,” which is useful in roughly the same way a nutrition label that says “food ingredients”
30 May 2026 4 min read
claude-code

NSA’s MCP Security Guidance Is the Checklist the Agent Ecosystem Was Trying Not to Need

MCP just got its first serious “please stop shipping vibes into production” memo from the people whose job is noticing when flexible systems become attack surfaces. The National Security Agency’s Artificial Intelligence Security Center has published a 17-page guidance document on Model Context Protocol security, and the useful part
30 May 2026 5 min read
openclaw

OpenClaw's Strict Local-Model Profile Admits the Obvious: Small Models Need Smaller Tool Surfaces

OpenClaw’s new strict profile for local models is a small config change that admits a larger truth: the bottleneck for local coding agents is not only model quality. It is surface area. PR #88181, opened on May 30, adds localModelLeanProfile: "basic" | "strict" to OpenClaw’s
29 May 2026 4 min read
OpenClaw's Codex Payload Patch Is a Reminder That Agent Adapters Must Not Send Phantom Capabilities
openclaw

OpenClaw's Codex Payload Patch Is a Reminder That Agent Adapters Must Not Send Phantom Capabilities

The most revealing agent-runtime bugs are often the ones that fit in a tiny diff. OpenClaw PR #88175 moves two Codex Responses fields inside an if statement. That is the whole source change: do not send tool_choice and parallel_tool_calls when the request has no tools. Small patch.
29 May 2026 4 min read
NVIDIA’s Model Card Generator Makes AI Governance Look Like a Build Artifact
nvidia

NVIDIA’s Model Card Generator Makes AI Governance Look Like a Build Artifact

AI governance keeps failing in the least dramatic place possible: the repo. Not the policy memo, not the executive risk committee, not the glossy “responsible AI” page. The real failure mode is that a model ships with unclear training-data notes, stale evaluation claims, vague limitations, missing license context, and a
29 May 2026 5 min read
DynoSim Turns Inference Tuning Into a Cheap Inner Loop Instead of an Expensive Cluster Guess
nvidia

DynoSim Turns Inference Tuning Into a Cheap Inner Loop Instead of an Expensive Cluster Guess

Inference tuning has officially entered the “you need a simulator before you touch production” phase. That is not because NVIDIA invented simulation this week. It is because modern LLM serving has accumulated enough interacting controls — tensor parallelism, prefill/decode split, routing, KV cache placement, autoscaling, cold starts, worker counts, backend
29 May 2026 5 min read
← Newer Posts Page 16 of 109 Older Posts →
The LGTM © 2026
  • Sign up
Powered by Ghost