Google integrates Lyria 3 music generation into Gemini with text-to-music and cover art
Google Deepmind has integrated its Lyria 3 model into Gemini, enabling users to generate 30-second music tracks with vocals, lyrics, and cover art from text prompts or uploaded media. The model represents an expansion of Google's multimodal AI capabilities into creative audio generation.
Google Deepmind Adds Music Generation to Gemini via Lyria 3
Google Deepmind has integrated Lyria 3, its music generation model, directly into Gemini, enabling users to create 30-second audio tracks from text descriptions or media uploads.
What Lyria 3 Does
The model generates complete music compositions that include:
- Vocal performances with sung lyrics
- Instrumental accompaniment
- Generated cover art for each track
Users can create tracks by providing a simple text prompt describing the desired music, or by uploading reference media to guide the generation. The output is fixed at 30 seconds per generation.
Integration into Gemini
The feature is now available as part of Gemini's multimodal capabilities, positioning it alongside the assistant's existing text, image, and video analysis functions. This represents Google's strategy to expand Gemini beyond conversational AI into creative content generation across multiple modalities.
Market Context
Lyria 3 joins an increasingly competitive field of AI music generation tools. Competing offerings include Meta's MusicGen and various other generative audio platforms, though specific comparative benchmarks between Lyria 3 and alternatives have not been published.
Google Deepmind has been developing music generation technology for several years, with previous iterations of the Lyria model demonstrating the company's incremental improvements in audio quality and compositional coherence.
Availability and Limitations
Detailed pricing information, API availability, and regional rollout plans have not yet been disclosed. The 30-second constraint on generated tracks suggests the feature is currently positioned for short-form content generation rather than full-length composition.
What This Means
Google is leveraging its Deepmind research division to add differentiated features to Gemini, attempting to create stickier use cases beyond text generation. While music generation remains technically challenging and output quality varies significantly based on prompt quality, integrating it into a consumer-facing product puts the technology in front of millions of users. This move signals that major AI labs now view creative content generation—not just analysis—as core to their product strategy.
Related Articles
Google triples Gemini usage limits in Antigravity coding tool twice in one week after user complaints
Google has raised Gemini usage limits in its Antigravity coding tool by 3x twice within one week, responding to developers who hit new compute-based quotas within hours. The company also reset weekly quotas for all paid users twice, though limits remain lower than pre-restriction levels.
Google announces Spark AI agent, Information agents, and Android Halo at I/O 2026—all paywalled behind $100/month Ultra
Google announced multiple AI agent products at I/O 2026, including Spark for managing digital tasks, Information agents for 24/7 topic monitoring, and Android Halo for notifications. All features remain paywalled behind the $100/month Gemini Ultra plan, with free access timeline unspecified.
Google cuts AI Ultra plan to $200/month, launches new $100 developer tier
Google announced pricing changes to its Gemini AI subscription tiers at I/O 2026, cutting its top AI Ultra plan from $250 to $200 per month while introducing a new $100/month developer-focused tier. All plans now get access to Gemini 3.5 Flash and the new Gemini Omni video generation model.
llm-gemini Plugin Adds Support for Google's Gemini 3.5 Flash Model
Developer Simon Willison released version 0.32 of the llm-gemini plugin, which adds support for Google's Gemini 3.5 Flash model. The plugin enables command-line access to Google's Gemini model family through the LLM tool.
Comments
Loading...