Google releases Lyria 3 Clip Preview for music generation via API
Google has released Lyria 3 Clip Preview, a music generation model available through the Gemini API as of March 30, 2026. The model generates 30-second audio clips from text prompts or images at $0.04 per clip, with a 1,048,576 token context window.
Google Lyria 3 Clip Preview — Quick Specs
Google Releases Lyria 3 Clip Preview Music Generation Model
Google has launched Lyria 3 Clip Preview, a music generation model available through the Gemini API starting March 30, 2026. The model generates short audio clips, loops, and previews from text prompts or images.
Key Specifications
Context Window: 1,048,576 tokens
Pricing: $0.04 per 30-second audio clip. Input and output token pricing is listed as $0/M, suggesting the per-clip pricing model supersedes traditional token-based billing.
Audio Quality: The model generates high-quality, 48kHz stereo audio with structural coherence, including vocals, timed lyrics, and full instrumental arrangements.
Capabilities
Lyria 3 Clip is part of Google's broader Lyria 3 family of music generation models. It can:
- Generate audio from text prompts
- Generate audio from images
- Produce clips up to 30 seconds in duration
- Create loops and preview content
- Generate vocals with synchronized lyrics
- Arrange full instrumental compositions
Availability
The model is accessible through the Gemini API and is routed through OpenRouter, which handles provider selection and fallback management for uptime optimization. Developers can access the model using OpenAI-compatible APIs or through the OpenRouter SDK.
Usage data shows current demand, with prompt activity at 410K and completion activity at 280K tokens tracked across the platform in recent monitoring periods.
What This Means
Google enters the consumer music generation market with a pricing model that simplifies billing compared to token-based systems—developers pay per 30-second clip rather than tracking input/output token consumption. The 1M+ token context window is a technical feature that likely supports longer creative instructions or batch processing capabilities, though typical use cases focus on discrete clip generation. This positions Lyria 3 Clip as a tool for music preview generation, loop creation, and short-form content production, competing with existing music AI tools at a transparent, clip-based price point.
Related Articles
Google releases Lyria 3 Pro Preview for full-length music generation
Google has released Lyria 3 Pro Preview, a music generation model capable of producing full-length songs with verses, choruses, bridges, vocals, and timed lyrics from text prompts or images. The model features a 1,048,576 token context window and charges $0.08 per generated song through the Gemini API.
Google launches Lyria 3 Pro music generator, claims training data is rights-cleared
Google has released Lyria 3 Pro, its latest AI music generation model capable of creating tracks up to three minutes long with improved understanding of musical structure. The model is available through Gemini, Google Vids, Vertex AI, and Google AI Studio. Google claims the training data comes from sources it has contractual and legal rights to use.
Google releases Gemini 3.1 Flash Live, claims improved audio recognition and lower latency for voice conversations
Google announced Gemini 3.1 Flash Live as its updated audio and voice model for Gemini Live and Search Live. The model claims improved acoustic recognition, better background noise filtering, support for over 90 languages, and lower latency compared to 2.5 Flash Native Audio.
Google releases Gemini 3.1 Flash Live, its highest-quality audio model for real-time voice AI
Google has released Gemini 3.1 Flash Live, its highest-quality audio and voice model designed for real-time dialogue. The model scores 90.8% on ComplexFuncBench Audio and 36.1% on Scale AI's Audio MultiChallenge with reasoning enabled, with improved tonal understanding and lower latency compared to previous versions.
Comments
Loading...