The agent boom has a glamour problem. Everyone wants to talk about the model that “thinks,” the assistant that “acts,” or the demo that “does work for you.” Almost nobody wants to talk about GPU topology, cache routing, Kubernetes orchestration, provenance, throughput per watt, or why your clever agent suddenly