large-language-model

3 articles tagged with large-language-model

April 8, 2026

model release

Meta launches Muse Spark, proprietary AI model built by Wang's Superintelligence Labs

Meta announced Muse Spark, its first major large language model since hiring Scale AI's Alexandr Wang nine months ago for a $14.3 billion deal. The proprietary model emphasizes efficiency and multimodal reasoning over top-tier performance, marking a strategic shift from Meta's previous open-source Llama approach. Muse Spark will power Meta's AI assistant across Facebook, Instagram, WhatsApp, Messenger, and Ray-Ban glasses starting in coming weeks.

April 8, 2026 · 4:20 PM

April 2, 2026

model release

Alibaba releases Qwen 3.6 Plus with 1M context window, free tier now available

Alibaba's Qwen division released Qwen 3.6 Plus on April 2, 2026, offering free access to a model with a 1,000,000 token context window. The model combines linear attention with sparse mixture-of-experts routing and achieves a 78.8 score on SWE-bench Verified for software engineering tasks.

April 2, 2026 · 1:05 PM

March 23, 2026

model releaseNVIDIA+1

Nvidia releases Nemotron 3 Super: 120B MoE model with 1M token context

Nvidia has released Nemotron 3 Super, a 120-billion parameter hybrid Mamba-Transformer Mixture-of-Experts model that activates only 12 billion parameters during inference. The open-weight model features a 1-million token context window, multi-token prediction capabilities, and pricing at $0.10 per million input tokens and $0.50 per million output tokens.

March 23, 2026 · 3:35 PM

← Back to all news