ai-infrastructure
9 articles tagged with ai-infrastructure
Meta launches proprietary Muse Spark, abandoning open-source strategy after $14.3B rebuild
Meta launched Muse Spark on April 8, 2026, a natively multimodal reasoning model with tool-use and visual chain-of-thought capabilities. Unlike Llama, it is entirely proprietary with no open weights. The model scores 52 on AI Index v4.0 and excels on health benchmarks but represents Meta's departure from its open-source identity.
Google's TurboQuant compresses AI memory use by 6x, but won't ease DRAM shortage
Google has unveiled TurboQuant, a KV cache quantization technology that claims to reduce memory consumption during AI inference by up to 6x by compressing data from 16-bit precision to as low as 2.5 bits. While the compression technique delivers meaningful efficiency gains for inference providers, it is unlikely to resolve the DRAM shortage that has driven memory prices to record highs, as expanding context windows offset memory savings.
Multiverse Computing launches API portal for compressed AI models to reduce cloud dependence
Multiverse Computing, a Spanish startup, has launched a self-serve API portal giving developers direct access to compressed versions of models from OpenAI, Meta, DeepSeek, and Mistral AI. The move targets enterprises seeking to reduce cloud infrastructure dependence and lower compute costs through edge deployment. The company claims its HyperNova 60B 2602 model delivers faster responses at lower cost than the original OpenAI model it was derived from.
Meta acquires Moltbook, social network for AI agents, hires founders into Superintelligence Labs
Meta has acquired Moltbook, a social network designed for AI agents, bringing founders Matt Schlicht and Ben Parr into Meta Superintelligence Labs under former Scale AI CEO Alexandr Wang. The move positions Meta alongside OpenAI's OpenClaw in acquiring AI agent platforms.
Nvidia-backed Nscale raises $2B, hits $14.6B valuation with Sandberg and Clegg joining board
Nvidia-backed British AI infrastructure startup Nscale has raised $2 billion in a new funding round, bringing its valuation to $14.6 billion. The round marks a significant milestone for the infrastructure-focused startup, with Meta's former COO Sheryl Sandberg and Meta's former VP of Global Affairs Nick Clegg joining the board.
OpenAI building GitHub competitor to challenge Microsoft's code platform
OpenAI is building an internal alternative to GitHub, Microsoft's dominant code collaboration platform, according to The Information. The move marks OpenAI's entry into developer tools and could create tension with its largest investor, Microsoft.
Apple asks Google to host servers for Gemini-powered Siri upgrade
Apple has asked Google to set up servers specifically for hosting a new Gemini-powered version of Siri that meets Apple's privacy requirements, according to The Information. This represents a deeper infrastructure partnership beyond the January announcement that Google's Gemini models would power upgraded Siri features.
Nvidia invests $4 billion in photonics companies Lumentum and Coherent
Nvidia announced Monday it is investing $2 billion each into photonics companies Lumentum and Coherent to develop optical transceivers, circuit switches, and lasers for next-generation AI data centers. The technology aims to improve energy efficiency, data transfer speeds, and bandwidth in data center infrastructure.
G42 and Cerebras to deploy 8 exaflops of compute infrastructure in India
Abu Dhabi-based G42 has partnered with U.S. chipmaker Cerebras to deploy 8 exaflops of computational capacity through a new system in India. The partnership represents a significant infrastructure expansion for AI training and inference workloads in South Asia.