Voice agents have spent the last few years winning demos and losing production. They could answer quickly, sound pleasant, and still collapse the moment the user interrupted, changed their mind, asked for a policy-constrained action, or needed the system to check two pieces of backend state before speaking. That is