LLM
3 articles tagged with LLM
IBM's Granite 4.1: 8B Dense Model Matches 32B MoE Performance on 15T Tokens
IBM released Granite 4.1, a family of dense decoder-only LLMs (3B, 8B, 30B parameters) trained on approximately 15 trillion tokens using a five-phase pre-training pipeline. The 8B instruct model matches or surpasses the previous Granite 4.0-H-Small (32B-A9B MoE) despite using fewer parameters and a simpler dense architecture. All models support up to 512K context windows and are released under Apache 2.0 license.
DeepSeek releases V4 model preview with agent optimization, pricing undisclosed
DeepSeek released a preview of its V4 large language model on April 24, 2026, available in 'pro' and 'flash' versions. The Hangzhou-based company claims the open-source model achieves strong performance on agent-based tasks and has been optimized for tools like Anthropic's Claude Code and OpenClaw.
OpenRouter Releases Elephant Alpha: 100B-Parameter Model with 256K Context Window and Free Pricing
OpenRouter has released Elephant Alpha, a 100B-parameter text model with a 256K context window and 32K output token limit. The model is available at no cost through OpenRouter's platform, supporting function calling, structured output, and prompt caching.