Poolside releases Laguna XS.2, free fp8-quantized coding agent with 128K context
Poolside has released Laguna XS.2, the second-generation model in its XS size class for agentic coding workflows. The model offers 128K context window, up to 8K output tokens, and is quantized to fp8 for efficiency, available free via OpenRouter.
Poolside releases Laguna XS.2, free fp8-quantized coding agent with 128K context
Poolside has released Laguna XS.2, the second-generation model in its XS size class designed for agentic coding workflows. The model is available free on OpenRouter as of April 28, 2025.
Technical specifications
Laguna XS.2 offers a 131,072-token context window (128K) with up to 8K output tokens. The model is quantized to fp8 precision, optimizing for speed and cost efficiency in production environments.
According to Poolside, the model combines tool calling and reasoning capabilities within a compact footprint. The company describes it as part of their "efficient coding agent series."
Pricing and availability
The model is available at zero cost through OpenRouter:
- Input: $0 per million tokens
- Output: $0 per million tokens
OpenRouter routes requests across multiple providers with automatic fallbacks for uptime optimization.
Reasoning capabilities
Laguna XS.2 supports OpenRouter's reasoning parameter, allowing developers to access step-by-step thinking processes through the reasoning_details array in API responses. The model can preserve reasoning context across conversation turns when the complete reasoning_details are passed back in subsequent requests.
What this means
Poolside is positioning itself in the increasingly competitive coding agent market with a free, quantized model that prioritizes deployment efficiency over raw capability. The fp8 quantization represents a pragmatic trade-off—reduced precision for faster inference—targeting production workflows where cost and latency matter more than maximum accuracy. At 128K context, Laguna XS.2 can handle substantial codebases, though it remains to be seen how the XS size class compares to larger coding models like Claude 3.5 Sonnet or GPT-4 on complex refactoring tasks. The free tier may be a customer acquisition strategy, with Poolside likely planning premium tiers or enterprise offerings.
Related Articles
Poolside Launches Laguna M.1, Free-Tier Coding Agent Model with 128K Context Window
Poolside has released Laguna M.1, its flagship coding agent model available for free on OpenRouter. The model features a 128K context window, up to 8K output tokens, and is optimized for agentic coding workflows with tool calling and reasoning capabilities.
Nvidia releases Nemotron 3 Nano Omni: 30B-parameter multimodal model with 256K context, free on OpenRouter
Nvidia has released Nemotron 3 Nano Omni, a 30-billion-parameter multimodal model available free on OpenRouter. The model features a 256,000-token context window, accepts text, image, video, and audio inputs, and claims 2× higher throughput for video reasoning compared to separate vision and speech pipelines.
OpenAI Launches GPT Mini Latest with 400,000 Token Context Window
OpenAI released GPT Mini Latest on April 27, 2025, featuring a 400,000 token context window. The model automatically redirects to the latest version in the OpenAI GPT Mini family, allowing developers to stay current without manual updates.
Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window
Moonshot AI released Kimi Latest, a router endpoint that automatically redirects to the most recent model in the Kimi family. The model features a 262,144 token context window, though specific pricing and performance benchmarks have not been disclosed.
Comments
Loading...