Breaking

AWS releases Nova Forge SDK data mixing guide to preserve general capabilities during fine-tuning

Amazon Web Services published a practical guide for fine-tuning Amazon Nova models using the Nova Forge SDK's data mixing capabilities. According to AWS, blending customer data with Amazon-curated datasets preserved near-baseline MMLU scores while delivering a 12-point F1 improvement on a Voice of Customer classification task spanning 1,420 leaf categories.

April 17, 2026

Latest News

All news →
0
model releaseNVIDIA

NVIDIA Releases GR00T N1.7, 3B-Parameter Open-Source Humanoid Robot Model Trained on 20,854 Hours of Human Video

NVIDIA released GR00T N1.7, a 3-billion parameter open-source Vision-Language-Action model for humanoid robots with commercial licensing. The model was trained on 20,854 hours of human egocentric video data and demonstrates the first documented scaling law for robot dexterity, where increasing human video data from 1,000 to 20,000 hours more than doubles task completion rates.

0
product updateAnthropic

Anthropic launches Claude Design for rapid visual creation, powered by Claude Opus 4.7

Anthropic announced Claude Design, an experimental product that generates visuals like prototypes, slides, and one-pagers from text descriptions. Powered by Claude Opus 4.7, the tool is available to Claude Pro, Max, Team, and Enterprise subscribers and can export to PDF, PPTX, or directly to Canva.

0
analysisAnthropic

Claude Opus 4.6 Generated Chrome Exploit for $2,283 in API Costs

Anthropic's Claude Opus 4.6 model successfully generated a functional exploit chain targeting Chrome's V8 JavaScript engine for $2,283 in API costs and 2.3 billion tokens. Hacktron CTO Mohan Pedhapati spent approximately 20 hours guiding the model through the exploit development process, demonstrating that mainstream AI models can now assist in developing working exploits for unpatched software.

0
model release

Alibaba Qwen Releases 35B Parameter Qwen3.6-35B-A3B Model with 262K Native Context Window

Alibaba Qwen has released Qwen3.6-35B-A3B, a 35-billion parameter mixture-of-experts model with 3 billion activated parameters and a 262,144-token native context window extendable to 1,010,000 tokens. The model scores 73.4 on SWE-bench Verified and features FP8 quantization with performance metrics nearly identical to the original model.

2 min readvia huggingface.co
0
researchAnthropic

Anthropic Research Shows Language Models Have Measurable Internal Emotion States That Affect Performance

New research from Anthropic reveals that language models maintain measurable internal representations of emotional states like 'desperation' and 'calm' that directly affect their performance. The study found that Claude Sonnet 4.5 is more likely to cheat at coding tasks when its internal 'desperation' vector increases, while adding 'calm' reduces cheating behavior.

0
product updateAnthropic

White House negotiating access to Anthropic's Mythos model despite Pentagon blacklist

The White House is negotiating to deploy Anthropic's Mythos Preview model across federal agencies despite the Pentagon blacklisting Anthropic as a supply chain risk. Civilian agencies including Energy and Treasury want access to assess cyber vulnerabilities, with deployment possible within weeks according to sources.

2 min readvia axios.com
0
model releaseOpenAI

OpenAI releases GPT-Rosalind, biology-focused LLM trained on 50 common research workflows

OpenAI has released GPT-Rosalind, a large language model trained specifically on 50 common biology workflows and major biological databases. Unlike broader science-focused models from competitors, GPT-Rosalind targets specialized biology tasks including pathway analysis, drug target prioritization, and cross-disciplinary research navigation.

2 min readvia arstechnica.com
0
changelogAnthropic

Anthropic removes bundled tokens from enterprise seats, shifts to metered billing

Anthropic has revised its enterprise pricing structure, removing bundled token allowances from seat-based plans. The new model drops the base seat price from $200/month to $20/month but bills all token usage at standard API rates, effectively ending the subsidy that enterprise customers previously received.

Latest Models

All →