Google's Lyria 3 Pro extends AI music generation to 3-minute songs with structural control
Google released Lyria 3 Pro, an updated music generation model capable of creating full 3-minute songs—six times longer than the 30-second limit of its predecessor launched last month. The new version adds granular control over song structure, allowing users to specify intros, verses, choruses, and bridges. It's available now for paid Gemini users, enterprise customers, and developers via API.
Google Extends AI Music Generation with Lyria 3 Pro
Google has released Lyria 3 Pro, an updated version of its AI music generation model that increases maximum song length from 30 seconds to 3 minutes. The model also introduces structured composition controls and is now available to paid Gemini users, enterprise customers on Vertex AI, and developers via the Gemini API and Google AI Studio.
Key Improvements
Beyond the 6x increase in generation length, Lyria 3 Pro adds control over specific song elements. Users can now prompt the model to generate individual components—intros, verses, choruses, bridges—giving creators more agency over compositional structure. Google claims the model "better understands musical composition" and is "great for experimenting with different styles or generating songs with complex transitions."
The tool is also being integrated into Google Vids, Google's AI video generation platform, enabling music-to-video workflows within a single environment.
Responsible Training and Watermarking
Google states it trained Lyria 3 Pro exclusively on materials with confirmed rights. All generated outputs are embedded with SynthID, Google's watermarking system designed to identify AI-generated audio content. This addresses growing concerns about provenance in the wake of widespread AI-generated music spam.
Market Context
The release comes as AI music generation faces significant friction in streaming ecosystems. Spotify currently removes approximately 50,000 AI-generated tracks daily, and deleted 75 million spam tracks in 2024 alone. The proliferation of low-quality AI music has created infrastructure challenges for platforms and raises questions about the practical utility of yet another music generation tool.
Google's integration with existing services (Gemini, Workspace, API ecosystem) and compliance mechanisms (SynthID watermarking, rights-cleared training data) position Lyria 3 Pro differently from standalone music generators, though adoption will depend on actual user demand and the model's output quality.
What This Means
Lyria 3 Pro represents incremental capability expansion rather than fundamental innovation in AI music generation. The 3-minute ceiling and structural controls make it viable for limited use cases—video backgrounds, demo tracks—but don't solve the platform fragmentation or quality concerns plaguing AI-generated music. Google's emphasis on training ethics and watermarking reflects industry pressure, not technical breakthroughs. Success depends on whether Workspace integration can create a primary use case before the broader industry backlash against AI music consolidates.
Related Articles
Google DeepMind's Gemini 3.1 Flash-Lite generates websites in real time, 2.5x faster than predecessor
Google DeepMind released Gemini 3.1 Flash-Lite, a model that generates functional websites in real time through a new pseudo-browser demo. The model achieves first response token 2.5 times faster than Gemini 2.5 Flash and outputs over 360 tokens per second, though output pricing has tripled from $0.40 to $1.50 per million tokens.
OpenAI shuts down Sora video app amid declining user engagement and strategic shift
OpenAI announced the discontinuation of Sora, its consumer video generation app and API. The shutdown follows declining user engagement and aligns with OpenAI's strategic pivot toward enterprise AI products and robotics research, with Disney exiting a planned $1 billion investment.
Stable Diffusion 3.5 TensorRT optimization delivers 2x faster generation, 40% less VRAM on RTX GPUs
Stability AI has released TensorRT-optimized versions of the Stable Diffusion 3.5 model family in collaboration with NVIDIA. The optimization uses FP8 quantization to achieve 2x faster generation speed and 40% lower VRAM requirements on supported RTX GPUs.
OpenAI Python SDK v2.23.0 adds GPT-4 Realtime 1.5 and Audio 1.5 model support
OpenAI released Python SDK version 2.23.0 on February 24, 2026, adding support for two new model options in realtime API calls: gpt-realtime-1.5 and gpt-audio-1.5. The minor update primarily focuses on expanding model availability for real-time applications.
Comments
Loading...