LLM News

Every LLM release, update, and milestone.

0
model release

Alibaba releases Qwen3.5-35B-A3B-FP8, a quantized multimodal model for efficient deployment

Alibaba's Qwen team released Qwen3.5-35B-A3B-FP8 on Hugging Face, a quantized version of their 35-billion parameter multimodal model. The FP8 quantization reduces model size and memory requirements while maintaining the base model's image-text-to-text capabilities. The model is compatible with standard Transformers endpoints and Azure deployment.

1 min readvia huggingface.co
0
product update

Meta signs multi-billion dollar TPU rental deal with Google, challenging Nvidia's chip dominance

Meta has signed a multi-billion dollar deal to rent Google's TPU (Tensor Processing Unit) chips for training its AI models, marking a significant shift away from Nvidia's dominance in AI infrastructure. The arrangement provides Meta with alternative compute capacity while signaling growing competition in the specialized AI chip market.

2 min readvia the-decoder.com
0
model releaseGoogle DeepMind

Google DeepMind releases Nano Banana 2 image model with Pro-level capabilities at faster speeds

Google DeepMind has released Nano Banana 2, an image generation model that combines advanced world knowledge and subject consistency with faster inference speeds comparable to its Flash offering. The model is positioned as production-ready with capabilities previously associated with Pro-tier performance.

0
product update

Google reveals AppFunctions: Gemini's MCP-equivalent for controlling Android apps

Google has detailed AppFunctions, a system that allows Gemini to directly control and interact with Android applications, functioning similarly to Anthropic's Model Context Protocol (MCP). The capability enables AI agents to automate tasks across the Android ecosystem by providing structured access to app functionality.

2 min readvia 9to5google.com