The  LGTM
  • Home
  • Agentic Coding
  • Claude Code
  • Codex
Sign in Subscribe
DGX Spark’s Community vLLM Stack Shows Local AI Now Comes With Wheels, Switches, and Pager Duty
nvidia

DGX Spark’s Community vLLM Stack Shows Local AI Now Comes With Wheels, Switches, and Pager Duty

The least glamorous part of local AI is becoming the part that matters most: the wheel file. That is the useful signal from eugr/spark-vllm-docker, a community-maintained Docker stack for running vLLM on NVIDIA DGX Spark and GB10-class systems. The project shipped same-day prebuilt vLLM and FlashInfer wheels on May
11 May 2026 5 min read
A Tiny Grok API Bug Shows Why xAI's May 15 Model Retirement Will Hurt Real Integrations
xai

A Tiny Grok API Bug Shows Why xAI's May 15 Model Retirement Will Hurt Real Integrations

A small 400 error is usually not a story. It is a line item in somebody's issue tracker, a one-line guard in a provider adapter, and maybe a patch release if everyone is lucky. But the fresh Hermes Agent report against xAI's grok-4-1-fast is useful because
11 May 2026 4 min read
Semantic Kernel 1.76.0 Ships a Security-Heavy .NET Patch After Microsoft’s Prompt-Injection RCE Research
ai-frameworks

Semantic Kernel 1.76.0 Ships a Security-Heavy .NET Patch After Microsoft’s Prompt-Injection RCE Research

Semantic Kernel 1.76.0 is the kind of release that looks boring until you read it with the right threat model. Microsoft did not ship a new agent abstraction here. It shipped path validation, OpenAPI escaping, gRPC address allowlists, cloud-drive defaults, upload-directory controls, and dependency vulnerability updates. In other
11 May 2026 5 min read
codex

Codex /goal Is the Background-Agent Feature That Needs a Definition of Done Before It Needs More Autonomy

Codex /goal sounds like another autonomy feature. It is better understood as a forcing function for engineering discipline. The feature lets Codex keep working across turns toward a durable objective, but the useful part is not that the agent can run for hours. The useful part is that OpenAI’s
11 May 2026 4 min read
Codex Remote Connections Move the Agent Onto the Devbox — Which Means Your SSH Boundary Is Now the Product Boundary
codex

Codex Remote Connections Move the Agent Onto the Devbox — Which Means Your SSH Boundary Is Now the Product Boundary

Codex remote connections look like a convenience feature until you say the quiet part out loud: the agent is no longer working on the laptop. It is working where the real environment lives. For many teams, that means a devbox with private packages, internal services, corporate certificates, GPU drivers, staging-adjacent
11 May 2026 4 min read
Codex Security Is OpenAI’s Bet That AppSec Agents Need Threat Models, Not More Alert Spam
codex

Codex Security Is OpenAI’s Bet That AppSec Agents Need Threat Models, Not More Alert Spam

Codex Security is not interesting because OpenAI found a way to put another AI badge on vulnerability scanning. The interesting part is narrower and more useful: OpenAI is trying to make the threat model the unit of work. That is the piece most scanner products skip, and it is the
11 May 2026 4 min read
OpenCode’s 158K-Star Hedge Says the Coding-Agent Split Is Now Structural
claude-code

OpenCode’s 158K-Star Hedge Says the Coding-Agent Split Is Now Structural

The useful way to read OpenCode’s momentum is not as a referendum on Anthropic. It is a referendum on where developers want the control boundary for coding agents to live. The New Stack’s May 10 piece framed that split well: Anthropic is building the managed harness, while OpenCode
11 May 2026 4 min read
DeepClaude Turns Claude Code Cost Pain Into a Backend-Switching Pattern
claude-code

DeepClaude Turns Claude Code Cost Pain Into a Backend-Switching Pattern

DeepClaude is the kind of project that shows up when a product has crossed from novelty into line item. Developers like the Claude Code workflow. They do not always like the bill, the caps, or the feeling that every routine edit needs to burn premium-model tokens. So a small wrapper
11 May 2026 4 min read
OpenClaude 0.10 Is the Model-Agnostic Claude Code Fork Growing Up in Public
claude-code

OpenClaude 0.10 Is the Model-Agnostic Claude Code Fork Growing Up in Public

OpenClaude v0.10.0 is not interesting because it is a fork wearing a new badge. It is interesting because it makes a bet that more developers are about to make explicitly: the Claude Code workflow is valuable enough to preserve, but too strategic to leave entirely inside one vendor’
11 May 2026 4 min read
The Cheapest Local AI GPU Story Is Really About Old NVIDIA Data-Center Silicon Finding a Second Life
nvidia

The Cheapest Local AI GPU Story Is Really About Old NVIDIA Data-Center Silicon Finding a Second Life

The most interesting local-AI GPU this week is not new, friendly, or remotely normal. It is a 2017 NVIDIA Tesla V100 SXM2 server accelerator, pulled out of the data-center afterlife, bolted to an SXM-to-PCIe adapter, cooled with a 3D-printed duct, and asked to run Ollama like it was born for
10 May 2026 5 min read
vLLM 0.20.2 Is a Patch Release About the Boring Parts That Decide Whether Local Inference Works
nvidia

vLLM 0.20.2 Is a Patch Release About the Boring Parts That Decide Whether Local Inference Works

vLLM 0.20.2 is the kind of release that will never trend, which is exactly why it matters. The patch notes are six commits from six contributors, mostly fixes for DeepSeek V4, gpt-oss MXFP4, and Qwen3-VL. No launch video. No grand benchmark chart. Just the maintenance work that decides
10 May 2026 4 min read
Qwen Code 0.15.10 Turns Long Coding Sessions Into a Tool-Budget Problem
ai-frameworks

Qwen Code 0.15.10 Turns Long Coding Sessions Into a Tool-Budget Problem

Qwen Code 0.15.10 is a coding-agent release about budgets. Not the finance kind, although your token bill may have opinions. The important budgets are tool schemas, context windows, and reusable instructions. Those are the constraints that decide whether an agent can keep working after the demo, after the
10 May 2026 4 min read
← Newer Posts Page 66 of 114 Older Posts →
The LGTM © 2026
  • Sign up
Powered by Ghost