local-inference

3 articles tagged with local-inference

June 9, 2026
model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 12B: encoder-free multimodal model runs on 16GB RAM

Google DeepMind has released Gemma 4 12B, a 12-billion parameter multimodal model that runs locally on laptops with 16GB of RAM. The model eliminates separate vision and audio encoders, processing raw inputs directly through its language model backbone under an Apache 2.0 license.

June 2, 2026
model release

H Company Ships Holo3.1 with Local Inference, Mobile Support, and 79.3% AndroidWorld Score

H Company released Holo3.1, a computer-use agent model family ranging from 0.8B to 35B parameters. The 35B-A3B variant scores 79.3% on AndroidWorld, up from 67% in Holo3. For the first time, H Company ships quantized checkpoints (FP8, Q4 GGUF, NVFP4) enabling local inference with 1.74× throughput gains and sub-4-second agent step times.

March 18, 2026
product update

Meta's Manus launches desktop app enabling AI agents to access local files and applications

Meta's recently acquired AI startup Manus launched a desktop application enabling its AI agent to directly access local files, tools, and applications on personal computers through a 'My Computer' feature. Previously cloud-only, the move positions Manus to compete with OpenClaw, the open-source AI agent that sparked recent industry momentum. Unlike OpenClaw's free, MIT-licensed offering, Manus operates as a paid subscription service.