speech-synthesis

2 articles tagged with speech-synthesis

April 2, 2026
model releaseMicrosoft

Microsoft releases three in-house AI models for speech and images, signaling independence from OpenAI

Microsoft released public preview versions of three proprietary AI models: MAI-Transcribe-1 for speech recognition across 25 languages at 50% lower GPU cost than alternatives, MAI-Voice-1 for speech synthesis generating 60 seconds of audio in under a second, and MAI-Image-2 for text-to-image generation. The models are available exclusively through Microsoft Azure AI Foundry and already power Copilot, Bing, and PowerPoint.

March 11, 2026
model release

Hume AI releases TADA-1B, a 1 billion parameter text-to-speech model

Hume AI has released TADA-1B, a 1 billion parameter text-to-speech model available on Hugging Face under an MIT license. The model, which combines speech and language capabilities, has already accumulated over 3,100 downloads since its January 12 release.