Text-to-speech has spent the last few years stuck in an awkward middle state. The demos kept getting smoother, the voices kept getting less robotic, and the APIs kept pretending that the remaining problem was mostly cosmetic. Pick a voice, maybe tweak the speed slider, ship a narrator, call it innovation.