nvidia - The LGTM (Page 4)

The LGTM

Sign in Subscribe

nvidia

A collection of 139 posts

DSX Is NVIDIA Admitting Tokens Are an Industrial Operations Problem

DSX Is NVIDIA Admitting Tokens Are an Industrial Operations Problem

NVIDIA’s DSX announcement is easy to file under “more AI factory branding,” which would be a mistake. The useful thing buried in the launch is not the phrase AI factory. It is the phrase token performance per megawatt. That is NVIDIA admitting, out loud, that tokens are no longer

BlueField-4 STX Makes Agent Security a Storage-Path Problem

BlueField-4 STX Makes Agent Security a Storage-Path Problem

Agent security is usually sold as an application problem: add an approval prompt, scope a tool, log a transcript, maybe slap a policy engine next to the orchestrator and call it governance. NVIDIA’s Vera BlueField-4 STX announcement is more interesting because it starts from a less comfortable premise: if

NVIDIA’s NVFP4 MaxText Recipe Makes 4-Bit Training Look Operational, Not Experimental

NVIDIA’s NVFP4 MaxText Recipe Makes 4-Bit Training Look Operational, Not Experimental

Everyone wants to talk about agents because agents demo well. NVIDIA’s more important June 9 story is less photogenic: making 4-bit training boring enough that infrastructure teams can put it into a budget spreadsheet. The company published a JAX and MaxText recipe for training large language models with NVFP4

Doosan Shows NVIDIA’s AI Factory Problem Is Also Power, Boards, and Robots

Doosan Shows NVIDIA’s AI Factory Problem Is Also Power, Boards, and Robots

The Doosan announcement is the NVIDIA Korea story that looks the least like software news and the most like actual infrastructure news. That is exactly why it deserves attention. AI factories are usually described as if the hard part is choosing accelerators and drawing a heroic rack diagram. Doosan is

SK hynix Is the Memory Roadmap Behind NVIDIA’s AI Factory Pitch

SK hynix Is the Memory Roadmap Behind NVIDIA’s AI Factory Pitch

The NVIDIA-SK hynix partnership is the least flashy kind of AI news and therefore one of the most important. It is not a new chatbot, not a benchmark victory, not a demo of a robot folding a shirt under studio lighting. It is a reminder that the token economy runs

LG Is NVIDIA’s Physical-AI Productization Test, Not Just Another AI Factory Buyer

LG Is NVIDIA’s Physical-AI Productization Test, Not Just Another AI Factory Buyer

NVIDIA’s deal with LG is easy to misread as another “AI factory” press release, which is now the industry’s default way to say “we bought a lot of GPUs and would like the market to clap.” The more useful read is sharper: LG is one of the few

Korea Is NVIDIA’s Physical-AI Supply Chain Test Case, Not Just Jensen’s Press Tour

Korea Is NVIDIA’s Physical-AI Supply Chain Test Case, Not Just Jensen’s Press Tour

NVIDIA’s Seoul liveblog looks, at first glance, like the usual CEO roadshow content: airport arrival, gaming café appearances, a few partner smiles, and enough fried chicken mythology to keep the local press fed for a week. Read it that way and it is easy to dismiss. Read it as

Nemotron 3 Ultra Is NVIDIA’s Answer to the Agent Invoice Problem

Nemotron 3 Ultra Is NVIDIA’s Answer to the Agent Invoice Problem

NVIDIA’s Nemotron 3 Ultra launch is not a normal “new model, bigger number” announcement. The interesting claim is more operational: long-running agents are becoming expensive enough that model quality and model economics can no longer be evaluated separately. If an agent needs 40 tool calls, three retries, a sub-agent

MoE Inference Is Becoming a Rack-Scale Systems Problem, Not Architecture Trivia

MoE Inference Is Becoming a Rack-Scale Systems Problem, Not Architecture Trivia

Mixture-of-experts models used to be a model-architecture detail. Now they are an infrastructure procurement strategy. NVIDIA’s latest Blackwell NVL72 pitch is nominally about MoE models running “10x faster” at “one-tenth the token cost.” Fine. Vendor math belongs in the same drawer as benchmark charts until proven otherwise. But the

Microsoft and NVIDIA Are Building the Agent Stack From Laptop to AI Factory

Microsoft and NVIDIA Are Building the Agent Stack From Laptop to AI Factory

The loud version of the Microsoft-and-NVIDIA story is hardware: RTX Spark PCs, DGX Station for Windows, Vera Rubin, Grace Blackwell, petaflops, unified memory, AI factories. That is the version built for keynote slides. The more important version is quieter: Microsoft and NVIDIA are trying to make agent deployment span laptop,

GraspGen-X and NitroGen Show NVIDIA’s Real Physical-AI Bet: Scalable Action Data

GraspGen-X and NitroGen Show NVIDIA’s Real Physical-AI Bet: Scalable Action Data

Embodied AI keeps rediscovering the same uncomfortable truth: intelligence is not the hard part in isolation. Contact is hard. Latency is hard. Hardware variation is hard. The gap between a clean simulated policy and a robot that reliably grasps the weird object in front of it is where most robotics