Stability AI releases Stable Audio 2.5 for enterprise sound production

TL;DR

Stability AI released Stable Audio 2.5, positioned as the first audio generation model built specifically for enterprise sound production. The model introduces improvements in quality and control for creating dynamic compositions adaptable to custom brand needs.

March 24, 2026 · 5:22 PM1 min read

Stability AI has released Stable Audio 2.5, positioning the model as the first audio generation system built for enterprise-grade sound production at scale.

The company claims the model introduces advancements in both quality and control, addressing demand from enterprises needing dynamic audio compositions that can be customized for specific brand requirements.

Key Details

Stability AI has not disclosed specific technical specifications including model size, training data details, context window, pricing, or detailed benchmark comparisons at this time. The release announcement emphasizes the model's targeting of enterprise workflows rather than consumer or research applications.

The distinction as "enterprise-focused" suggests the model is optimized for production reliability, consistency, and commercial use cases—potentially including licensing, support, and integration infrastructure—rather than representing a fundamental capability leap over prior audio generation models.

What This Means

Stability AI is repositioning its audio generation capabilities toward commercial customers willing to pay for reliability and support. The emphasis on "enterprise-grade" and "at scale" indicates focus on businesses needing production-ready audio rather than hobbyist or research users. Without disclosed pricing, benchmarks, or technical specifications, claims about improvements remain unverifiable. The audio generation space remains nascent but competitive, with other players exploring text-to-speech, music generation, and sound design applications. Stability AI's enterprise positioning suggests confidence in production readiness, though the lack of transparent specifications makes independent evaluation impossible at launch.

Source: stability.ai ↗

stability-ai audio-generation enterprise-ai model-release sound-production

model releaseMay 8, 2026

Tencent Releases Hy3 Preview: Mixture-of-Experts Model with 262K Context and Configurable Reasoning

Tencent has released Hy3 preview, a Mixture-of-Experts model with a 262,144 token context window priced at $0.066 per million input tokens and $0.26 per million output tokens. The model features three configurable reasoning modes—disabled, low, and high—designed for agentic workflows and production environments.

model releaseMay 7, 2026

Google releases Gemini 3.1 Flash Lite with 1M context at $0.25 per million input tokens

Google has released Gemini 3.1 Flash Lite, a high-efficiency multimodal model with a 1,048,576 token context window priced at $0.25 per million input tokens and $1.50 per million output tokens. The model supports text, image, video, audio, and PDF inputs with four thinking levels for cost-performance optimization.

model releaseMay 6, 2026

IBM Releases Granite Embedding 311M R2 With 32K Context, 200+ Language Support

IBM released Granite Embedding 311M Multilingual R2, a 311-million parameter dense embedding model with 32,768-token context length and support for 200+ languages. The model scores 64.0 on Multilingual MTEB Retrieval (18 tasks), an 11.8-point improvement over its predecessor, and ships with ONNX and OpenVINO models for production deployment.

model releaseMay 1, 2026

IBM Releases Granite 4.1 30B With 131K Context Window and Enhanced Tool-Calling

IBM released Granite 4.1 30B, a 30-billion parameter instruction-following model with a 131,072 token context window. The model scores 80.16 on MMLU 5-shot and 88.41 on HumanEval pass@1, with enhanced tool-calling capabilities following OpenAI's function definition schema.