Microsoft Fires Back at AI Rivals With Three New MAI Models — Cheaper Than Google and OpenAI

Microsoft's MAI Superintelligence team shipped three production-grade multimodal AI models to Microsoft Foundry today — and is pricing them below comparable offerings from Google and OpenAI.

The three new MAI models

  • MAI-Transcribe-1 — Speech-to-text across 25 languages; 2.5× faster than Azure Fast; starts at $0.36/hour
  • MAI-Voice-1 — Audio generation; produces 60 seconds of voice in 1 second; custom voice support; starts at $22/million characters
  • MAI-Image-2 — Image/video generation (originally launched March 19 on MAI Playground); starts at $5/image

Why it matters

Microsoft is actively building its own model stack alongside its OpenAI partnership. A recently renegotiated contract now explicitly allows Microsoft to pursue its own superintelligence research — giving Mustafa Suleyman's MAI team the green light to compete directly with OpenAI and Google on foundation models.

Suleyman in the announcement blog post: "You'll see more models from us soon in Foundry and directly in Microsoft products and experiences."

Context: just this week, Microsoft also launched Copilot Cowork pairing GPT + Claude for enterprise research. Now it's shipping its own base models. The multi-model enterprise AI stack is taking shape.

🔗 Read the full story → TechCrunch


Stay ahead of AI. Subscribe to LGTM for daily AI model news.