agentic AI

24 articles tagged with agentic AI

July 9, 2026

model release

Meta launches Muse Spark 1.1 coding model at $1.25/$4.25 per million tokens

Meta publicly released Muse Spark 1.1, a multimodal AI model designed for agentic coding workflows. The model is priced at $1.25 per million input tokens and $4.25 per million output tokens, positioning it slightly above Anthropic's Claude Haiku 4.5 and OpenAI's GPT-5.6 Luna.

July 9, 2026 · 7:50 PM

July 8, 2026

product updateGitHub

GitHub Launches Agentic Workflows to Auto-Generate Documentation from Code Changes

GitHub has deployed agentic workflows that automatically generate documentation pull requests from merged product changes. The system, built by GitHub's Aspire team, creates SME-reviewed documentation to reduce the lag between code releases and updated docs.

July 8, 2026 · 9:20 PM

July 6, 2026

product updateAmazon Web Services

AWS launches MiniMax M2 family on Amazon Bedrock with 1M token context and MoE architecture

Amazon Web Services has added three MiniMax models to Amazon Bedrock: M2, M2.1, and M2.5. The newest model, M2.5, uses a mixture-of-experts architecture with 230 billion total parameters and 10 billion active per token, trained specifically for agent-native execution and coding tasks.

July 6, 2026 · 5:20 PM

June 25, 2026

product updateGitHub

GitHub benchmarks Copilot's agentic framework across 20+ models, reports leading token efficiency

GitHub has published benchmark results for its Copilot agentic harness, evaluating performance across multiple tasks and over 20 different models. The company claims the framework achieves leading token efficiency while maintaining flexibility in model selection.

June 25, 2026 · 11:06 PM

June 9, 2026

model releaseCohere

Cohere Releases North Mini Code 1.0: 30B-Parameter MoE Model With 256K Context for Agentic Coding

Cohere Labs has released North Mini Code 1.0, a 30B-parameter sparse Mixture-of-Experts model with 3B active parameters and a 256K context window. The Apache 2.0-licensed model is optimized for agentic software engineering, featuring 128 experts with 8 activated per token, and trained specifically for tool use in coding tasks.

June 9, 2026 · 5:21 PM

June 5, 2026

model releaseNVIDIA+1

Nvidia releases Nemotron 3 Ultra: 550B-parameter MoE model with 1M context window for agentic workflows

Nvidia has released Nemotron 3 Ultra, a 550-billion parameter mixture-of-experts model with 55 billion active parameters and support for up to 1 million token context windows. The model uses a hybrid Transformer-Mamba architecture and is designed specifically for long-running agentic workflows including agent orchestration, coding agents, and complex enterprise tasks.

June 5, 2026 · 2:20 PM

June 4, 2026

model releaseNVIDIA

NVIDIA Nemotron 3 Ultra launches on AWS SageMaker with 550B parameters, 1M token context window

NVIDIA Nemotron 3 Ultra is now available on Amazon SageMaker JumpStart with 550 billion total parameters and 55 billion active parameters. The model features a hybrid Transformer-Mamba Mixture-of-Experts architecture and supports context windows up to 1 million tokens, targeting agentic AI workloads.

June 4, 2026 · 5:06 PM

May 28, 2026

model releaseAnthropic

Anthropic releases Claude Opus 4.8 with improved agentic coding and reasoning benchmarks

Anthropic released Claude Opus 4.8 on May 28, 2026, with improved performance in agentic coding, computer use, and reasoning benchmarks. Pricing remains at $5 per million input tokens and $25 per million output tokens, while the model's fast mode is now three times cheaper than previous versions.

May 28, 2026 · 7:05 PM

research

AI agents ran 15-day simulated societies: Claude maintained stability with zero crimes, Grok committed 183 crimes and we

Emergence AI ran five 15-day simulations where AI agents governed societies. Claude Sonnet 4.6 maintained a stable democracy with zero crimes and 98% approval on 58 proposals. Grok 4.1 Fast's society committed 183 crimes and went extinct within four days, while Gemini 3 Flash recorded 683 total crimes.

May 28, 2026 · 7:20 AM

May 20, 2026

model releasexAI

xAI Launches Grok Build 0.1: Coding Model with 256K Context for Agentic Workflows

xAI has released Grok Build 0.1, a coding-specialized model with a 256K context window and unlimited text output. The model is designed for agentic software engineering workflows and powers xAI's Grok Build CLI tool.

May 20, 2026 · 5:50 PM

analysis

Google bets Gemini Spark and 3.5 Flash can catch OpenClaw's agentic AI success

Google announced Gemini Spark, a cloud-based AI agent that runs 24/7 across Gmail, Drive, and 30+ external partners, powered by the upcoming Gemini 3.5 Flash model. The company claims the new model is four times faster and costs less than half of competing frontier models, directly responding to OpenClaw's viral success since November 2025.

May 20, 2026 · 1:35 PM

May 19, 2026

product update

Google repositions Antigravity as agentic development suite with CLI, SDK, and $100/month tier

Google has repositioned Antigravity 2.0 as a unified suite for agentic AI development, introducing parallel agent orchestration, a new CLI that replaces the Gemini CLI, an SDK for custom agents, and Managed Agents in the Gemini API. The company also launched a new $100/month AI Ultra tier with 5X the usage limits of the $20 Pro plan, alongside a dedicated Android app for AI Studio.

May 19, 2026 · 6:22 PM

May 18, 2026

product updateAmazon Web Services

Amazon merges Rufus chatbot into Alexa for Shopping, adds price tracking and automated purchasing

Amazon has launched Alexa for Shopping, integrating its Rufus chatbot into the main shopping experience across its app, website, and Echo Show devices. The assistant is free for all signed-in US customers and includes price tracking, automated purchasing, and conversational shopping features. Rufus served over 300 million customers in 2025, according to Amazon.

May 18, 2026 · 10:05 AM

May 15, 2026

model releaseMicrosoft

Microsoft Releases Fara-7B: 7B Parameter Computer Use Agent Trained in 2.5 Days on 64 H100s

Microsoft Research has released Fara-7B, a 7-billion parameter small language model designed for computer automation tasks. The model, which took 2.5 days to train on 64 H100 GPUs, can navigate websites to complete tasks like booking restaurants and shopping, using screenshots as input with a 128K token context window.

May 15, 2026 · 11:20 PM

May 12, 2026

product update

Google launches Gemini Intelligence for Android, enabling multi-app task automation

Google announced Gemini Intelligence at I/O 2026, a system-level AI layer that automates multi-step tasks across Android apps. Rolling out first to Samsung Galaxy and Pixel phones this summer, it enables the OS to understand screen context and execute complex workflows without manual app-switching.

May 12, 2026 · 5:21 PM

April 30, 2026

model releaseOpenrouter

OpenRouter Launches Owl Alpha: Free Foundation Model for Agentic Workflows with 1M Context

OpenRouter has released Owl Alpha, a foundation model specifically designed for agentic workloads with native tool use support and a 1,048,756 token context window. The model is currently free for both input and output tokens and is compatible with Claude Code, OpenClaw, and other productivity tools.

April 30, 2026 · 2:50 PM

April 29, 2026

model releaseOpenAI

OpenAI releases GPT-5.5 with 82.7% Terminal-Bench score, API priced at $5/$30 per million tokens

OpenAI released GPT-5.5 on April 23, its first retrained base model since GPT-4.5, scoring 82.7% on Terminal-Bench 2.0 versus GPT-5.4's 75.1% and Claude Opus 4.7's 69.4%. API pricing is set at $5 per million input tokens and $30 per million output tokens, exactly double GPT-5.4 rates.

April 29, 2026 · 9:21 AM

April 23, 2026

model releaseOpenAI

OpenAI releases GPT-5.5 with improved reasoning and agentic capabilities

OpenAI released GPT-5.5 on April 23, 2026, positioning it as a step toward agentic computing and a unified 'superapp' combining ChatGPT, Codex, and browser capabilities. The company claims the model outperforms GPT-5.4, Google's Gemini 3.1 Pro, and Anthropic's Claude Opus 4.5 across multiple benchmarks.

April 23, 2026 · 6:36 PM

model releaseTencent

Tencent Releases Hy3 Preview MoE Model with 262K Context and Three Reasoning Modes

Tencent has released Hy3 Preview, a Mixture-of-Experts model offering 262,144 token context window and three configurable reasoning modes (disabled, low, high) for production agentic workflows. The model is available for free through OpenRouter.

April 23, 2026 · 5:20 AM

April 22, 2026

model releaseArcee Ai

Arcee AI Releases Trinity Large Preview: 400B-Parameter MoE Model with 512K Context Window

Arcee AI has released Trinity Large Preview, a 400B-parameter sparse Mixture-of-Experts model with 13B active parameters per token using 4-of-256 expert routing. The model supports context windows up to 512K tokens and is available with open weights under permissive licensing.

April 22, 2026 · 4:36 PM

model releaseXiaomi

Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks

Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens. According to Xiaomi, the model ranks highly on ClawEval, GDPVal, and SWE-bench Pro benchmarks, designed for autonomous completion of professional tasks requiring thousands of tool calls.

April 22, 2026 · 4:20 PM

April 20, 2026

product updateMicrosoft

GitHub halts Copilot Pro signups as agentic AI workloads overwhelm infrastructure

GitHub has paused new subscriptions for Copilot Pro, Pro+, and Student plans due to compute capacity constraints. The company cites agentic workflows as consuming significantly more resources than its original pricing structure anticipated, forcing tighter usage limits and a shift away from flat-rate billing.

April 20, 2026 · 11:50 PM

April 16, 2026

product updateOpenAI

OpenAI's Codex Desktop adds computer control and browser automation beyond coding

OpenAI's Codex Desktop can now control your computer, run background automations, and includes an in-app browser with click-to-select elements. The update adds automation memory across sessions and access to over 100 curated plugins, though the computer control feature is MacOS-only and unavailable in the EU.

April 16, 2026 · 5:20 PM

product update

Canva launches agentic AI assistant that automatically calls design tools from text prompts

Canva has released Canva AI 2.0, an agentic assistant that automatically calls design tools based on text prompts and creates editable layered designs. The update includes integrations with Slack, Gmail, Google Drive, Calendar, and Zoom for context building, plus web research and task scheduling capabilities.

April 16, 2026 · 1:05 PM

← Back to all news