model release

Meta releases Muse Spark, first closed-source model from Meta Superintelligence Labs

TL;DR

Meta has released Muse Spark, the first model from Meta Superintelligence Labs, a unit assembled under chief AI officer Alexandr Wang following Meta's $14.3 billion investment in Scale AI. The natively multimodal model features a "Contemplating" reasoning mode with parallel sub-agents and marks Meta's break from its open-source Llama heritage by operating as closed source.

3 min read
0

Meta Releases Muse Spark, First Model From Meta Superintelligence Labs

Meta has released Muse Spark, the first model from Meta Superintelligence Labs (MSL), marking a strategic departure from the company's open-source Llama lineage. The model arrives nine months after Meta established the unit under Alexandr Wang, chief AI officer, following the company's $14.3 billion investment for a 49% stake in Scale AI.

Architecture and Capabilities

Muse Spark is natively multimodal, accepting voice, text, and image inputs with text-only output at launch. The model operates in two modes: a fast mode for standard queries and a new "Contemplating" mode that orchestrates multiple sub-agents to reason in parallel, directly competing with Google's Gemini Deep Think and OpenAI's GPT-4o Pro extended reasoning.

Meta claims the model achieves its reasoning capability using more than ten times less compute than Llama 4 Maverick through a training technique called "thought compression," which penalizes the model during reinforcement learning for excessive thinking time, forcing efficient problem-solving with fewer reasoning tokens.

Benchmark Performance

Muse Spark ranks fourth on Artificial Analysis Intelligence Index v4.0 with a score of 52, trailing Gemini 3.1 Pro Preview and GPT-5.4 (both 57) and Claude Opus 4.6 (53). Performance is mixed across specific benchmarks:

  • GPQA Diamond (graduate-level scientific reasoning): 89.5%, behind Gemini 3.1 Pro (94.3%), GPT-5.4 (92.8%), and Claude Opus 4.6 (92.7%)
  • ARC AGI 2 (abstract reasoning): 42.5 in Contemplating mode, significantly behind Gemini 3.1 Pro (76.5) and GPT-5.4 (76.1)
  • SWE-bench Verified (software engineering): 77.4%
  • CharXiv Reasoning (chart/figure understanding): 86.4, ahead of Gemini 3.1 Pro (80.2) and GPT-5.4 (82.8)
  • HealthBench Hard (medical reasoning): 42.8%, compared to Claude Opus 4.6 (14.8%) and GPT-5.4 (40.1%)

Competitive Differentiation Strategy

Meta's positioning emphasizes domain-specific advantages built on platform data. The model was trained with more than 1,000 physicians and includes dedicated shopping mode leveraging creator content and user interest signals within Meta's ecosystem. CEO Mark Zuckerberg described Muse Spark as strong in "visual understanding, health, social content, shopping, games, and more" — areas where platform data provides genuine competitive advantage.

Muse Spark currently powers Meta AI across the company's Meta AI app and Meta.ai website, with expansion planned to Facebook, Instagram, and WhatsApp.

The Closed-Source Break

The most significant detail is Muse Spark's closed-source status, marking a reversal from Meta's Llama series, which established the foundation for thousands of open-source applications and research projects. Meta has indicated this closure is temporary, framing future versions as potentially open source. However, the decision signals that Meta now prioritizes competitive advantage from proprietary architectural innovations over ecosystem development during a critical capability gap phase.

The shift reflects the stakes of the reorganization: nine months of complete infrastructure rebuild from data pipelines to training architecture, rebuilding the entire AI stack rather than iterating on existing systems. The model was internally known as Avocado and had been delayed earlier in 2026 after falling short in internal testing for reasoning, coding, and writing.

What This Means

Meta's Muse Spark represents a calculated pivot toward vertical integration and platform-specific competitive advantage. The closed-source model signals management's confidence in both the rebuild's progress and the company's ability to leverage proprietary data advantages. Mixed benchmarks and the significant gap on abstract reasoning tasks suggest the model remains in early competitive phases, but the domain-specific strength in medical reasoning and chart understanding points toward Meta's differentiation strategy: general reasoning capability combined with platform data advantages that competitors cannot replicate. The health benchmark performance particularly validates the physician collaboration approach. Whether this architecture sustains against frontier competitors remains uncertain, but Meta's infrastructure investment and closed-source stance indicate management views this as a multi-year competition requiring sustained proprietary advantage.

Related Articles

model release

Meta launches Muse Spark, its first frontier model and first closed-weight AI system

Meta Superintelligence Labs has launched Muse Spark, a native multimodal reasoning model that scores 52 on the Artificial Analysis Intelligence Index, placing it in the top 5 frontier models. This marks Meta's first frontier-class model and its first AI system without open weights, representing a strategic shift from its open-source Llama strategy. The model achieves comparable efficiency to Gemini 3.1 Pro while matching Llama 4 Maverick capabilities with over an order of magnitude less compute.

model release

Meta launches Muse Spark, its first model from revamped AI labs

Meta Superintelligence Labs has launched Muse Spark, its first model since Mark Zuckerberg restructured the company's AI division. The multimodal model now powers Meta AI's app and website in the US, with rollout planned for WhatsApp, Instagram, Facebook, Messenger, and Meta's smart glasses in coming weeks.

model release

Google DeepMind releases Gemma 4 with four model sizes, up to 256K context, multimodal support

Google DeepMind released Gemma 4, an open-weights multimodal model family in four sizes (2.3B to 31B parameters) with context windows up to 256K tokens. All models support text and image input, with audio native to E2B and E4B variants. The Gemma 4 31B dense model scores 85.2% on MMLU Pro, 89.2% on AIME 2026, and 80.0% on LiveCodeBench—significant improvements over Gemma 3.

model release

Meta launches Muse Spark, proprietary AI model built by Wang's Superintelligence Labs

Meta announced Muse Spark, its first major large language model since hiring Scale AI's Alexandr Wang nine months ago for a $14.3 billion deal. The proprietary model emphasizes efficiency and multimodal reasoning over top-tier performance, marking a strategic shift from Meta's previous open-source Llama approach. Muse Spark will power Meta's AI assistant across Facebook, Instagram, WhatsApp, Messenger, and Ray-Ban glasses starting in coming weeks.

Comments

Loading...