model release

Meta launches Muse Spark model with private API preview and 16 integrated tools

TL;DR

Meta announced Muse Spark today, its first model release since Llama 4 a year ago. The hosted model is available in private API preview and on meta.ai with Instant and Thinking modes, benchmarking competitively against Anthropic's Opus 4.6 and Google's Gemini 3.1 Pro, though behind on Terminal-Bench 2.0.

3 min read
0

Meta Launches Muse Spark: First Major Model Since Llama 4

Meta announced Muse Spark today, marking its first significant model release since Llama 4 in April 2025. The model is available as a private API preview to select users and on meta.ai's chat interface, which requires Facebook or Instagram login.

Performance and Positioning

Meta's self-reported benchmarks show Muse Spark performing at parity with Anthropic's Opus 4.6, Google's Gemini 3.1 Pro, and OpenAI's GPT 5.4 on selected metrics. However, the model notably underperforms on Terminal-Bench 2.0. Meta explicitly acknowledges performance gaps in long-horizon agentic systems and coding workflows, stating they "continue to invest in areas with current performance gaps."

Deployment Modes

Muse Spark is accessible through two distinct modes on meta.ai:

Instant Mode: Faster inference with direct SVG/HTML output capabilities. Testing showed this mode generates SVG code directly with embedded comments.

Thinking Mode: Extended reasoning with wrapped output and enhanced problem-solving. Early benchmarking demonstrates notably better performance on complex tasks compared to Instant mode.

Meta plans a future Contemplating Mode offering significantly longer reasoning times, comparable to Google's Gemini Deep Think and OpenAI's GPT-5.4 Pro.

16 Integrated Tools Exposed

Meta's chat interface includes 16 publicly accessible tools. Notably, Meta did not restrict disclosure of these tools, allowing direct enumeration without workarounds.

Web Capabilities:

  • browser.search: Web search through undisclosed engine
  • browser.open: Full page retrieval from search results
  • browser.find: Pattern matching on page content

Meta Platform Integration:

  • meta_1p.content_search: Semantic search across Instagram, Threads, and Facebook posts (user-accessible content only, post-2025-01-01)
  • meta_1p.meta_catalog_search: Product catalog search
  • container.download_meta_1p_media: Pull media from Meta platforms into sandbox

Code Execution and Artifacts:

  • container.python_execution: Code Interpreter running Python 3.9.25 with pandas, numpy, matplotlib, plotly, scikit-learn, PyMuPDF, Pillow, and OpenCV. Note: Python 3.9 reached end-of-life status in October 2024.
  • container.create_web_artifact: HTML/JavaScript sandboxed rendering (Claude Artifacts-style)
  • container.visual_grounding: Image analysis with object detection, bounding box generation, point localization, and object counting

File Operations:

  • container.view, container.insert, container.str_replace: Text editor commands mirroring Claude's implementation
  • container.file_search: Semantic search across uploaded files

Media and Agents:

  • media.image_gen: Image generation with "artistic" and "realistic" modes; returns "square", "vertical", or "landscape" formats
  • subagents.spawn_agent: Sub-agent spawning for delegated research and analysis
  • third_party.link_third_party_account: Account linking for Google Calendar, Outlook Calendar, Gmail, Outlook

Technical Observations

Direct testing revealed Python 3.9.25 and SQLite 3.34.1 (January 2021 release) running in the sandbox environment. The Thinking mode's tendency to wrap SVG in HTML shells with Playables SDK v1.0.0 JavaScript libraries suggests integration with Meta's interactive content framework.

What This Means

Muse Spark positions Meta as a credible competitor in the hosted LLM space, though the private preview status limits immediate adoption. The 16-tool architecture demonstrates Meta's integration strategy: leveraging Facebook, Instagram, and Threads data access as competitive moat while matching feature parity with Claude and ChatGPT. The acknowledged gaps in agentic systems and coding—combined with Python 3.9 EOL tooling—suggest this release prioritizes chat functionality over agent-grade reliability. The planned Contemplating mode indicates Meta is responding directly to Deep Research and Pro reasoning mode adoption patterns.

Related Articles

model release

Meta launches Muse Spark, its first frontier model and first closed-weight AI system

Meta Superintelligence Labs has launched Muse Spark, a native multimodal reasoning model that scores 52 on the Artificial Analysis Intelligence Index, placing it in the top 5 frontier models. This marks Meta's first frontier-class model and its first AI system without open weights, representing a strategic shift from its open-source Llama strategy. The model achieves comparable efficiency to Gemini 3.1 Pro while matching Llama 4 Maverick capabilities with over an order of magnitude less compute.

model release

Meta launches Muse Spark, its first model from revamped AI labs

Meta Superintelligence Labs has launched Muse Spark, its first model since Mark Zuckerberg restructured the company's AI division. The multimodal model now powers Meta AI's app and website in the US, with rollout planned for WhatsApp, Instagram, Facebook, Messenger, and Meta's smart glasses in coming weeks.

model release

Meta releases Muse Spark, first closed-source model from Meta Superintelligence Labs

Meta has released Muse Spark, the first model from Meta Superintelligence Labs, a unit assembled under chief AI officer Alexandr Wang following Meta's $14.3 billion investment in Scale AI. The natively multimodal model features a "Contemplating" reasoning mode with parallel sub-agents and marks Meta's break from its open-source Llama heritage by operating as closed source.

model release

Meta replaces Llama with Muse Spark AI, launches Contemplating mode for complex reasoning

Meta has discontinued its Llama model line and launched Muse Spark as the foundation of its new AI strategy under Meta Superintelligence Labs. The model features a Contemplating mode for complex reasoning tasks and specializes in multimodal perception, health applications, and agentic tasks. Muse Spark is available today in Meta AI apps, with a private API preview for select partners.

Comments

Loading...