model release

Google releases Lyria 3 Pro Preview for full-length music generation

TL;DR

Google has released Lyria 3 Pro Preview, a music generation model capable of producing full-length songs with verses, choruses, bridges, vocals, and timed lyrics from text prompts or images. The model features a 1,048,576 token context window and charges $0.08 per generated song through the Gemini API.

2 min read
0

Google Releases Lyria 3 Pro Preview for Full-Length Music Generation

Google has released Lyria 3 Pro Preview, a music generation model that produces complete songs with structural coherence, including vocals, timed lyrics, and full instrumental arrangements.

Key Specifications

Lyria 3 Pro Preview launched on March 30, 2026. The model accepts text prompts or images as input and generates high-quality 48kHz stereo audio output. According to Google, the model can generate full-length songs featuring verses, choruses, and bridges with maintained musical structure.

The model supports a context window of 1,048,576 tokens. Pricing is structured differently from traditional token-based models: $0.08 per generated full-length song. Input and output token pricing shows as $0/M, reflecting Google's song-based pricing model rather than token-based billing.

Distribution and Access

Lyria 3 Pro Preview is available through the Gemini API. The model is also accessible via OpenRouter, which provides unified API access across multiple providers and handles request routing for optimal uptime and performance.

OpenRouter's integration normalizes requests and responses, allowing developers to call Lyria 3 Pro Preview using OpenAI-compatible SDK calls or directly through the OpenRouter API. The platform tracks usage metrics and provides uptime statistics across providers.

Capabilities

The model handles both text-to-audio and image-to-audio generation workflows. Google claims Lyria 3 Pro delivers "structural coherence" in generated songs, a technical specification addressing a known limitation in earlier music generation systems where songs could lose harmonic and rhythmic consistency across sections.

The inclusion of "Pro" in the name suggests a higher-capability tier within the Lyria 3 family, implying standard and potentially other variants may exist with different feature sets or pricing.

What This Means

Google's release of Lyria 3 Pro Preview marks a shift toward production-ready music generation. The fixed $0.08-per-song pricing eliminates variable costs based on song length, potentially making long-form music generation more economical than token-based pricing would allow. However, the "Preview" designation indicates this remains a limited availability release, suggesting broader availability may require additional refinement or capacity planning by Google. For music applications, content creators, and developers building audio tools, this provides a new option from a major cloud provider with significant infrastructure backing.

Related Articles

model release

Google releases Gemini 3.1 Flash Lite with 1M context at $0.25 per million input tokens

Google has released Gemini 3.1 Flash Lite, a high-efficiency multimodal model with a 1,048,576 token context window priced at $0.25 per million input tokens and $1.50 per million output tokens. The model supports text, image, video, audio, and PDF inputs with four thinking levels for cost-performance optimization.

model release

Baidu Releases Qianfan-OCR-Fast Model with 66K Context at $0.68 Per 1M Input Tokens

Baidu has released Qianfan-OCR-Fast, a multimodal model specialized for optical character recognition tasks. The model offers a 66,000 token context window and is priced at $0.68 per 1M input tokens and $2.81 per 1M output tokens.

model release

Tencent Releases Hy3 Preview: Mixture-of-Experts Model with 262K Context and Configurable Reasoning

Tencent has released Hy3 preview, a Mixture-of-Experts model with a 262,144 token context window priced at $0.066 per million input tokens and $0.26 per million output tokens. The model features three configurable reasoning modes—disabled, low, and high—designed for agentic workflows and production environments.

model release

IBM Releases 97M-Parameter Granite Embedding Model With 60.3 MTEB Score — Highest Retrieval Quality Under 100M Parameter

IBM released two new multilingual embedding models under Apache 2.0: a 97M-parameter compact model scoring 60.3 on MTEB Multilingual Retrieval (highest in its size class) and a 311M full-size model scoring 65.2. Both support 200+ languages with enhanced retrieval for 52 languages, handle 32K-token context (64x increase over predecessors), and include code retrieval across 9 programming languages.

Comments

Loading...