model release

DeepReinforce Releases Ornith-1.0, Open-Source Agentic Coding Model in 9B to 397B Sizes

TL;DR

DeepReinforce has released Ornith-1.0, an MIT-licensed model designed for agentic coding tasks with variants ranging from 9B to 397B parameters. Built on top of Apache 2.0-licensed Gemma 4 and Qwen 3.5 base models, the company claims it achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks.

2 min read
0

DeepReinforce Releases Ornith-1.0, Open-Source Agentic Coding Model in 9B to 397B Sizes

DeepReinforce has released Ornith-1.0, an MIT-licensed model designed for agentic coding tasks with variants ranging from 9B to 397B parameters. The model is available in four configurations: 9B Dense, 31B Dense, 35B MoE (Mixture of Experts), and 397B MoE.

Technical Foundation

Ornith-1.0 is built on top of pretrained Gemma 4 and Qwen 3.5 base models, both of which carry Apache 2.0 licenses. This represents a notable shift from earlier Gemma models, which were encumbered by additional Terms of Use restrictions. The Apache 2.0 licensing on both base models makes the derivative work's MIT license legally compatible.

DeepReinforce claims the model achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks, though specific benchmark scores have not been disclosed.

Deployment and Performance

The model is available as GGUF quantized weights on Hugging Face. The 35B variant in Q4_K_M quantization weighs 20GB and can run locally via LM Studio. Early testing shows the model handles multi-step agentic workflows competently.

In documented testing against a Datasette codebase, Ornith-1.0 successfully navigated complex code search queries including "find the code that decodes the actor cookie" and "find the code that opens the insert dialog when the button is clicked." The model demonstrated the ability to chain multiple tool calls to complete these tasks.

Performance testing on consumer hardware shows the 35B quantized model running at 103 tokens per second for generation tasks.

Company Background

DeepReinforce appears to be a new entrant in the AI model space. The earliest available research from the organization is a June 2025 paper titled "CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning." No prior model releases from DeepReinforce have been documented.

Availability

All Ornith-1.0 variants are available immediately with open weights under the MIT license. GGUF quantized versions are distributed through Hugging Face for local deployment.

What This Means

Ornith-1.0 represents a significant addition to open-source agentic coding models, particularly for developers seeking local deployment options. The MIT licensing and compatibility with existing Apache 2.0 base models removes common licensing friction. However, without disclosed benchmark scores or extensive independent testing, claims of state-of-the-art performance among comparable models remain unverified. The availability of multiple size variants, including the computationally efficient MoE architectures, provides deployment flexibility across different hardware constraints.

Related Articles

model release

DeepSeek Releases V4-Pro with 1.6T Parameters, 1M Token Context at 27% Inference Cost of V3

DeepSeek has released two Mixture-of-Experts models: V4-Pro with 1.6 trillion parameters (49B activated) and V4-Flash with 284B parameters (13B activated), both supporting 1 million token context windows. V4-Pro requires only 27% of inference FLOPs and 10% of KV cache compared to V3.2 at 1M token context, trained on over 32 trillion tokens.

model release

DeepSeek Releases V4 Models: 1M Context Window, 90% Less KV Cache Than V3

DeepSeek has released two new MoE models: DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated). Both models support a one million token context window and use a hybrid attention architecture that requires only 27% of single-token inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2.

model release

China's Z.ai releases GLM-5.2, open-source model matching Claude and GPT-5.5 in cybersecurity tasks

Z.ai's GLM-5.2 performs on par with Claude Opus 4.8 and OpenAI's GPT-5.5 in cybersecurity benchmarks while costing roughly half as much to run. Security evaluations from Graphistry and Semgrep confirm the open-weight model's capabilities in vulnerability discovery and cyber investigation, raising concerns about accessibility of advanced hacking tools.

model release

Alibaba Qwen Releases 35B Language World Model for Agent Environment Simulation Across 7 Domains

Alibaba's Qwen team released Qwen-AgentWorld-35B-A3B, a 35 billion parameter language world model designed for agentic environment simulation. The model covers seven domains—MCP tool calling, Search, Terminal, Software Engineering, Android, Web, and OS—in a single model with a 262,144 token context window.

Comments

Loading...