large-language-model
3 articles tagged with large-language-model
Meta launches Muse Spark, proprietary AI model built by Wang's Superintelligence Labs
Meta announced Muse Spark, its first major large language model since hiring Scale AI's Alexandr Wang nine months ago for a $14.3 billion deal. The proprietary model emphasizes efficiency and multimodal reasoning over top-tier performance, marking a strategic shift from Meta's previous open-source Llama approach. Muse Spark will power Meta's AI assistant across Facebook, Instagram, WhatsApp, Messenger, and Ray-Ban glasses starting in coming weeks.
Alibaba releases Qwen 3.6 Plus with 1M context window, free tier now available
Alibaba's Qwen division released Qwen 3.6 Plus on April 2, 2026, offering free access to a model with a 1,000,000 token context window. The model combines linear attention with sparse mixture-of-experts routing and achieves a 78.8 score on SWE-bench Verified for software engineering tasks.
Nvidia releases Nemotron 3 Super: 120B MoE model with 1M token context
Nvidia has released Nemotron 3 Super, a 120-billion parameter hybrid Mamba-Transformer Mixture-of-Experts model that activates only 12 billion parameters during inference. The open-weight model features a 1-million token context window, multi-token prediction capabilities, and pricing at $0.10 per million input tokens and $0.50 per million output tokens.