model release

Meta launches Muse Spark model with private API preview and 16 integrated tools

TL;DR

Meta announced Muse Spark today, its first model release since Llama 4 a year ago. The hosted model is available in private API preview and on meta.ai with Instant and Thinking modes, benchmarking competitively against Anthropic's Opus 4.6 and Google's Gemini 3.1 Pro, though behind on Terminal-Bench 2.0.

April 8, 2026 · 11:21 PM3 min read

Muse Spark — Quick Specs

Compare Muse Spark with other models →

Meta Launches Muse Spark: First Major Model Since Llama 4

Meta announced Muse Spark today, marking its first significant model release since Llama 4 in April 2025. The model is available as a private API preview to select users and on meta.ai's chat interface, which requires Facebook or Instagram login.

Performance and Positioning

Meta's self-reported benchmarks show Muse Spark performing at parity with Anthropic's Opus 4.6, Google's Gemini 3.1 Pro, and OpenAI's GPT 5.4 on selected metrics. However, the model notably underperforms on Terminal-Bench 2.0. Meta explicitly acknowledges performance gaps in long-horizon agentic systems and coding workflows, stating they "continue to invest in areas with current performance gaps."

Deployment Modes

Muse Spark is accessible through two distinct modes on meta.ai:

Instant Mode: Faster inference with direct SVG/HTML output capabilities. Testing showed this mode generates SVG code directly with embedded comments.

Thinking Mode: Extended reasoning with wrapped output and enhanced problem-solving. Early benchmarking demonstrates notably better performance on complex tasks compared to Instant mode.

Meta plans a future Contemplating Mode offering significantly longer reasoning times, comparable to Google's Gemini Deep Think and OpenAI's GPT-5.4 Pro.

16 Integrated Tools Exposed

Meta's chat interface includes 16 publicly accessible tools. Notably, Meta did not restrict disclosure of these tools, allowing direct enumeration without workarounds.

Web Capabilities:

browser.search: Web search through undisclosed engine
browser.open: Full page retrieval from search results
browser.find: Pattern matching on page content

Meta Platform Integration:

meta_1p.content_search: Semantic search across Instagram, Threads, and Facebook posts (user-accessible content only, post-2025-01-01)
meta_1p.meta_catalog_search: Product catalog search
container.download_meta_1p_media: Pull media from Meta platforms into sandbox

Code Execution and Artifacts:

container.python_execution: Code Interpreter running Python 3.9.25 with pandas, numpy, matplotlib, plotly, scikit-learn, PyMuPDF, Pillow, and OpenCV. Note: Python 3.9 reached end-of-life status in October 2024.
container.create_web_artifact: HTML/JavaScript sandboxed rendering (Claude Artifacts-style)
container.visual_grounding: Image analysis with object detection, bounding box generation, point localization, and object counting

File Operations:

container.view, container.insert, container.str_replace: Text editor commands mirroring Claude's implementation
container.file_search: Semantic search across uploaded files

Media and Agents:

media.image_gen: Image generation with "artistic" and "realistic" modes; returns "square", "vertical", or "landscape" formats
subagents.spawn_agent: Sub-agent spawning for delegated research and analysis
third_party.link_third_party_account: Account linking for Google Calendar, Outlook Calendar, Gmail, Outlook

Technical Observations

Direct testing revealed Python 3.9.25 and SQLite 3.34.1 (January 2021 release) running in the sandbox environment. The Thinking mode's tendency to wrap SVG in HTML shells with Playables SDK v1.0.0 JavaScript libraries suggests integration with Meta's interactive content framework.

What This Means

Muse Spark positions Meta as a credible competitor in the hosted LLM space, though the private preview status limits immediate adoption. The 16-tool architecture demonstrates Meta's integration strategy: leveraging Facebook, Instagram, and Threads data access as competitive moat while matching feature parity with Claude and ChatGPT. The acknowledged gaps in agentic systems and coding—combined with Python 3.9 EOL tooling—suggest this release prioritizes chat functionality over agent-grade reliability. The planned Contemplating mode indicates Meta is responding directly to Deep Research and Pro reasoning mode adoption patterns.

Source: simonwillison.net ↗

meta-ai muse-spark model-release llm reasoning code-interpreter hosted-api multimodal

model releaseJuly 8, 2026

Poolside releases Laguna XS 2.1: 33B parameter MoE coding model with 262K context window

Poolside has released Laguna XS 2.1, a 33B total parameter Mixture-of-Experts model with 3B activated parameters per token and a 262,144-token context window. The model achieves 70.9% on SWE-bench Verified and 63.1% on SWE-bench Multilingual, representing a 5.4% improvement over its predecessor on multilingual coding tasks.

model releaseJuly 6, 2026

Nex AGI releases Nex-N2-Mini: open-source agentic MoE model with 262K context window

Nex AGI has released Nex-N2-Mini, an open-source agentic mixture-of-experts model with a 262K-token context window. The model accepts text and image inputs and is priced at $0.025 per 1M input tokens and $0.10 per 1M output tokens.

model releaseJuly 6, 2026

Tencent Releases Hy3: 295B MoE Model with 256K Context and Configurable Reasoning Modes

Tencent has released Hy3, a 295-billion parameter Mixture-of-Experts model with 21 billion active parameters and a 256,000-token context window. The model features configurable reasoning modes and is available free through OpenRouter, with deployment ending July 21, 2026.

model releaseJuly 6, 2026

Tencent Releases Hy3: 295B-Parameter MoE Model with 21B Active Parameters at 256K Context

Tencent has released Hy3, a 295-billion parameter Mixture-of-Experts model with 21 billion active parameters and 3.8 billion MTP layer parameters. The model features a 256K context window and is released under Apache 2.0 license, with pricing not yet disclosed.

Meta launches Muse Spark model with private API preview and 16 integrated tools

Muse Spark — Quick Specs

Meta Launches Muse Spark: First Major Model Since Llama 4

Performance and Positioning

Deployment Modes

16 Integrated Tools Exposed

Technical Observations

What This Means

Related Articles

Poolside releases Laguna XS 2.1: 33B parameter MoE coding model with 262K context window

Nex AGI releases Nex-N2-Mini: open-source agentic MoE model with 262K context window

Tencent Releases Hy3: 295B MoE Model with 256K Context and Configurable Reasoning Modes

Tencent Releases Hy3: 295B-Parameter MoE Model with 21B Active Parameters at 256K Context

Comments