model release

Google releases Lyria 3 Clip Preview for music generation via API

TL;DR

Google has released Lyria 3 Clip Preview, a music generation model available through the Gemini API as of March 30, 2026. The model generates 30-second audio clips from text prompts or images at $0.04 per clip, with a 1,048,576 token context window.

2 min read
0

Google Releases Lyria 3 Clip Preview Music Generation Model

Google has launched Lyria 3 Clip Preview, a music generation model available through the Gemini API starting March 30, 2026. The model generates short audio clips, loops, and previews from text prompts or images.

Key Specifications

Context Window: 1,048,576 tokens

Pricing: $0.04 per 30-second audio clip. Input and output token pricing is listed as $0/M, suggesting the per-clip pricing model supersedes traditional token-based billing.

Audio Quality: The model generates high-quality, 48kHz stereo audio with structural coherence, including vocals, timed lyrics, and full instrumental arrangements.

Capabilities

Lyria 3 Clip is part of Google's broader Lyria 3 family of music generation models. It can:

  • Generate audio from text prompts
  • Generate audio from images
  • Produce clips up to 30 seconds in duration
  • Create loops and preview content
  • Generate vocals with synchronized lyrics
  • Arrange full instrumental compositions

Availability

The model is accessible through the Gemini API and is routed through OpenRouter, which handles provider selection and fallback management for uptime optimization. Developers can access the model using OpenAI-compatible APIs or through the OpenRouter SDK.

Usage data shows current demand, with prompt activity at 410K and completion activity at 280K tokens tracked across the platform in recent monitoring periods.

What This Means

Google enters the consumer music generation market with a pricing model that simplifies billing compared to token-based systems—developers pay per 30-second clip rather than tracking input/output token consumption. The 1M+ token context window is a technical feature that likely supports longer creative instructions or batch processing capabilities, though typical use cases focus on discrete clip generation. This positions Lyria 3 Clip as a tool for music preview generation, loop creation, and short-form content production, competing with existing music AI tools at a transparent, clip-based price point.

Related Articles

model release

Google releases Gemini 3.1 Flash Lite with 1M context at $0.25 per million input tokens

Google has released Gemini 3.1 Flash Lite, a high-efficiency multimodal model with a 1,048,576 token context window priced at $0.25 per million input tokens and $1.50 per million output tokens. The model supports text, image, video, audio, and PDF inputs with four thinking levels for cost-performance optimization.

model release

Baidu Releases Qianfan-OCR-Fast Model with 66K Context at $0.68 Per 1M Input Tokens

Baidu has released Qianfan-OCR-Fast, a multimodal model specialized for optical character recognition tasks. The model offers a 66,000 token context window and is priced at $0.68 per 1M input tokens and $2.81 per 1M output tokens.

model release

Tencent Releases Hy3 Preview: Mixture-of-Experts Model with 262K Context and Configurable Reasoning

Tencent has released Hy3 preview, a Mixture-of-Experts model with a 262,144 token context window priced at $0.066 per million input tokens and $0.26 per million output tokens. The model features three configurable reasoning modes—disabled, low, and high—designed for agentic workflows and production environments.

model release

IBM Releases 97M-Parameter Granite Embedding Model With 60.3 MTEB Score — Highest Retrieval Quality Under 100M Parameter

IBM released two new multilingual embedding models under Apache 2.0: a 97M-parameter compact model scoring 60.3 on MTEB Multilingual Retrieval (highest in its size class) and a 311M full-size model scoring 65.2. Both support 200+ languages with enhanced retrieval for 52 languages, handle 32K-token context (64x increase over predecessors), and include code retrieval across 9 programming languages.

Comments

Loading...