model release

ElevenLabs launches Music v2 with mid-track genre switching and section-by-section composition

TL;DR

ElevenLabs released Music v2, an AI music generation model that can switch genres within a single track and build songs section-by-section. The model, trained on licensed data cleared for commercial use, can transition from opera to heavy metal, handle fast rap, and add sound effects while maintaining coherence.

2 min read
0

ElevenLabs launches Music v2 with mid-track genre switching and section-by-section composition

ElevenLabs released Music v2, an AI music generation model that can switch genres within a single track, 10 months after launching its first music generation model.

Key capabilities

According to ElevenLabs, Music v2 can:

  • Switch between genres mid-track, transitioning from opera to heavy metal and back
  • Handle fast rap while maintaining coherence
  • Add non-musical sound effects to tracks
  • Edit specific sections of a song using prompts without affecting other parts
  • Build songs section-by-section (intro, verse, chorus) and stitch them together
  • Generate vocals and complex compositions across multiple languages

The model marks a shift from generating short clips to constructing full songs with discrete sections that can be assembled.

Licensing and commercial use

ElevenLabs emphasized that Music v2 is trained on licensed data and cleared for commercial use. This approach differs from competitors Suno and Udio, which both face ongoing copyright litigation from major labels.

Availability

Music v2 is available through:

  • ElevenCreative tool for marketing and branding teams
  • ElevenMusic platform for AI-generated song creation
  • ElevenAPI (coming soon)

Pricing details were not disclosed.

Market context

The release intensifies competition in AI music generation. In recent months:

  • Google added song covers, section editing, and music video generation to its Flow Music tool at Google I/O
  • Stability AI released new music generation capabilities
  • Suno launched updated models for longer, more complex tracks

What this means

Music v2's section-based composition approach addresses a key limitation in AI music generation: the inability to make targeted edits. By allowing artists to modify specific parts without regenerating entire tracks, ElevenLabs moves closer to professional music production workflows. The emphasis on licensed training data positions the company to avoid the legal challenges facing competitors, though questions remain about whether labels will embrace AI-generated music at scale. The mid-track genre switching capability, while technically impressive, may have limited practical applications beyond novelty tracks and experimental compositions.

Related Articles

model release

Google launches Gemini Omni, multimodal AI video generator with avatar cloning and physics modeling

Google has released Gemini Omni, a multimodal AI video generation tool that accepts text, images, audio, and video as inputs. The first tier, Gemini Omni Flash, includes avatar cloning that creates digital versions of users and incorporates physics modeling for realistic motion.

model release

Microsoft Releases Lens: 3.8B-Parameter Text-to-Image Model Trained on 800M Image Dataset

Microsoft released Lens, a 3.8-parameter foundational text-to-image model trained on Lens-800M, an 800 million image-text corpus with GPT-4.1 captions. The model uses a 48-block MMDiT denoiser with FLUX.2 latents and supports generation up to 1440×1440 resolution across aspect ratios from 1:2 to 2:1.

model release

Cohere Releases Command A+ Open Source Model with 25B Active Parameters, 128K Context

Cohere has released Command A+ as an open source model under Apache 2.0 license. The sparse mixture-of-experts architecture features 25 billion active parameters out of 218B total parameters, supports 128K input context length, and includes vision capabilities alongside tool use and reasoning features.

model release

Cohere Releases Command A+: 218B-Parameter MoE Model With 4-Bit Quantization Runs on Single B200 GPU

Cohere has released Command A+, an open-source sparse mixture-of-experts model with 218 billion total parameters and 25 billion active parameters. The model features W4A4 quantization allowing deployment on a single Nvidia B200 GPU, supports 128K input context, and includes built-in chain-of-thought reasoning with vision capabilities.

Comments

Loading...