Google releases Lyria 3 Clip Preview for music generation via API
Google has released Lyria 3 Clip Preview, a music generation model available through the Gemini API as of March 30, 2026. The model generates 30-second audio clips from text prompts or images at $0.04 per clip, with a 1,048,576 token context window.
Google Lyria 3 Clip Preview — Quick Specs
Google Releases Lyria 3 Clip Preview Music Generation Model
Google has launched Lyria 3 Clip Preview, a music generation model available through the Gemini API starting March 30, 2026. The model generates short audio clips, loops, and previews from text prompts or images.
Key Specifications
Context Window: 1,048,576 tokens
Pricing: $0.04 per 30-second audio clip. Input and output token pricing is listed as $0/M, suggesting the per-clip pricing model supersedes traditional token-based billing.
Audio Quality: The model generates high-quality, 48kHz stereo audio with structural coherence, including vocals, timed lyrics, and full instrumental arrangements.
Capabilities
Lyria 3 Clip is part of Google's broader Lyria 3 family of music generation models. It can:
- Generate audio from text prompts
- Generate audio from images
- Produce clips up to 30 seconds in duration
- Create loops and preview content
- Generate vocals with synchronized lyrics
- Arrange full instrumental compositions
Availability
The model is accessible through the Gemini API and is routed through OpenRouter, which handles provider selection and fallback management for uptime optimization. Developers can access the model using OpenAI-compatible APIs or through the OpenRouter SDK.
Usage data shows current demand, with prompt activity at 410K and completion activity at 280K tokens tracked across the platform in recent monitoring periods.
What This Means
Google enters the consumer music generation market with a pricing model that simplifies billing compared to token-based systems—developers pay per 30-second clip rather than tracking input/output token consumption. The 1M+ token context window is a technical feature that likely supports longer creative instructions or batch processing capabilities, though typical use cases focus on discrete clip generation. This positions Lyria 3 Clip as a tool for music preview generation, loop creation, and short-form content production, competing with existing music AI tools at a transparent, clip-based price point.
Related Articles
DeepSeek Releases V4 Models: 1M Context Window, 90% Less KV Cache Than V3
DeepSeek has released two new MoE models: DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated). Both models support a one million token context window and use a hybrid attention architecture that requires only 27% of single-token inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2.
OpenAI previews GPT-5.6 to select partners with three variants priced from $1 to $30 per million tokens
OpenAI has begun previewing its GPT-5.6 series to a limited group of trusted partners after government review. The release includes three variants: Sol at $5 input/$30 output per million tokens, Terra at $2.50/$15, and Luna at $1/$6.
OpenAI announces GPT-5.6 series with Sol flagship, Terra at 50% cost of GPT-5.5, and Luna budget model
OpenAI has begun a limited preview of its GPT-5.6 series, introducing three models: Sol (flagship), Terra (2x cheaper than GPT-5.5 with competitive performance), and Luna (lowest cost option). The models are launching first with trusted partners before general availability in coming weeks, following U.S. government preview requirements.
OpenAI's ChatGPT 5.6 release restricted to government-approved customers initially
OpenAI will release ChatGPT 5.6 first to customers approved by the federal government, according to a staff memo from CEO Sam Altman. The company plans a broader release "a couple of weeks later," marking a significant departure from typical model rollouts.
Comments
Loading...