Long Context

2 articles tagged with Long Context

July 8, 2026

model releaseDeepSeek

DeepSeek Releases V4-Flash: 284B Parameter MoE Model with 1M Context Window at Q8 162GB

Unsloth has released optimized GGUF quantizations of DeepSeek-V4-Flash, a 284B parameter Mixture-of-Experts model that activates 13B parameters and supports 1 million token context windows. The Q8 quantization (UD-Q8_K_XL) runs at 162GB with claimed lossless precision, only 7GB larger than the Q4 variant.

July 8, 2026 · 2:51 PM

April 27, 2026

model release

Alibaba Qwen Releases 35B Sparse MoE Model with 262K Context and Multimodal Support

Alibaba Cloud has released Qwen3.6-35B-A3B, an open-weight sparse mixture-of-experts model with 35 billion total parameters but only 3 billion active parameters per token. The model features a 262K native context window (expandable to 1M tokens), multimodal input support, and integrated reasoning mode with preserved thinking traces.

April 27, 2026 · 3:51 AM

← Back to all news