Vibe coding's most common real-world use case — "build me a website" — has had no rigorous benchmark until now. Vision2Web, accepted at ICML and built by Zehai He et al. at Zhipu AI, fills that gap with 193 tasks, 918 prototype images, and 1,255 test cases