enterprise-ai
50 articles tagged with enterprise-ai
Sakana AI releases Fugu orchestration model to route tasks across multiple AI vendors
Sakana AI released Fugu, an orchestration language model that routes tasks across multiple AI providers to reduce vendor lock-in risks. The Japanese AI firm positions Fugu as a solution to enterprise dependency on single monolithic AI APIs.
GitHub details Qubot, internal Copilot-powered data analytics agent for plain language queries
GitHub has released technical details on Qubot, an internal analytics agent powered by GitHub Copilot that enables employees to query company data using natural language. The agent represents GitHub's implementation of AI-assisted data analysis for internal operations.
GitHub built Qubot, an internal data analytics agent using Copilot to query company data in natural language
GitHub has built Qubot, an internal analytics agent powered by GitHub Copilot that allows employees to query company data using natural language. The project represents GitHub's approach to building domain-specific AI agents for data analysis tasks.
Mistral Rebrands Le Chat as Vibe, Launches Agentic Work and Code Modes with VS Code Extension
Mistral has rebranded Le Chat as Vibe, launching new agentic capabilities for long-running work tasks and software development. The platform now includes Work Mode for enterprise knowledge search and document synthesis, Code Mode with GitHub integration and sandboxed execution, and a new VS Code extension. Pricing starts at $14.99/month for Pro and $24.99/user/month for Team plans.
Mistral Acquires Emmi AI, Launches Physics Simulation Models for Industrial Engineering
Mistral has acquired Emmi AI and launched a physics AI capability that reduces computational fluid dynamics and finite element simulations from hours to seconds on a single GPU. The company is deploying the technology with ASML, Airbus, Safran, and Siemens Energy for design optimization, tooling, and real-time digital twins.
Mistral AI Launches Forge for Enterprise Model Training on Proprietary Data
Mistral AI has launched Forge, a platform that allows enterprises to train custom AI models on their proprietary data including codebases, compliance policies, and operational documentation. The system supports both dense and mixture-of-experts architectures with pre-training, post-training, and reinforcement learning capabilities.
Mistral Launches AI Studio Platform for Enterprise Model Deployment and Governance
Mistral AI launched AI Studio, a production platform designed to move enterprise AI systems from prototype to deployment. The platform includes three core components: Observability for tracking model performance, an Agent Runtime built on Temporal for durable execution, and an AI Registry for asset versioning and governance.
Amazon QuickSight launches autonomous AI agents that work continuously in background
Amazon has launched autonomous agents in QuickSight (branded as Quick) that execute tasks continuously in the background while users attend meetings or focus on other work. The update includes 16 new data source integrations, an activity feed that consolidates communications across tools, and cross-system query capabilities that join data from multiple sources in real time.
Microsoft restricts Claude Fable 5 internally over 30-day data retention requirement
Microsoft has restricted internal employee access to Anthropic's newly released Claude Fable 5 model while its legal teams evaluate the company's new data retention requirements. The model requires storing prompts and outputs for 30 days to operate safety classifiers, with some content potentially retained for up to two years if flagged for policy violations.
Anthropic releases Claude Fable 5 with Mythos-class capabilities at $10/$50 per million tokens
Anthropic released Claude Fable 5, a Mythos-class model, to enterprise customers and paid subscribers two months after limiting its advanced Mythos model to select users. The new model costs $10 per million input tokens and $50 per million output tokens—twice the price of Claude Opus 4.8—and includes safeguards that block responses in high-risk areas like cybersecurity and biology.
Anthropic releases Claude Fable 5, a safety-limited version of Mythos, at $10/$50 per million tokens
Anthropic released Claude Fable 5, the first publicly available version of its Mythos model, with built-in safety restrictions that automatically block high-risk queries in cybersecurity, biology, chemistry, and related fields. The model costs $10 per million input tokens and $50 per million output tokens, double the price of Claude Opus 4.8.
Canva adds Perplexity Computer connector for autonomous design asset creation
Canva launched a connector for Perplexity Computer that enables the AI agent platform to autonomously create editable design assets based on user prompts and data. The integration is available for Perplexity Pro, Max, Enterprise Pro, and Enterprise Max subscribers.
Google expands Ask Gemini in Drive to search Gmail for Workspace and AI subscribers
Google has expanded its Ask Gemini in Drive feature to search Gmail, allowing eligible subscribers to query email threads alongside files and folders. The feature requires Google AI Pro, AI Ultra, or Workspace Business/Enterprise subscriptions.
Microsoft releases MAI-Thinking-1, its first reasoning model with 35B parameters
Microsoft released seven AI models at Build 2026, headlined by MAI-Thinking-1, its first reasoning model with 35 billion parameters. The company claims the model matches Anthropic's Claude Opus 4.6 on SWE Bench Pro coding benchmarks and beats Sonnet 4.61 in blind tests.
AWS launches hyperparameter optimization guide for Amazon Nova Forge custom model training
AWS has published a technical guide on hyperparameter optimization for Amazon Nova Forge, its platform for building custom frontier models from Amazon Nova checkpoints. The guide addresses three core challenges: catastrophic forgetting during domain specialization, learning rate calibration when mixing proprietary and curated training data, and baseline performance constraints for reinforcement fine-tuning.
Anthropic's Claude Opus 4.8 launches on AWS Bedrock in four regions
Anthropic's Claude Opus 4.8 is now available on Amazon Bedrock and Claude Platform on AWS. The model is designed for autonomous multi-stage tasks, agentic coding, and long-running workflows with reduced supervision.
Mistral acquires Emmi AI to launch physics simulation models for engineering design
Mistral has acquired Emmi AI and launched physics AI models that predict structural, thermal, and fluid dynamics behavior in seconds on a single GPU, compared to hours-to-weeks for traditional solvers. The company is targeting aerospace, automotive, semiconductor, and energy sectors with partners including ASML, Airbus, Safran, and Siemens Energy.
Mistral launches Workflows orchestration platform for production AI with durable execution and human-in-the-loop approva
Mistral has released Workflows in public preview, an orchestration layer for production AI systems built on Temporal's durable execution engine. The platform enables long-running AI processes to survive network failures, pause for human approval with a single line of code, and provides full execution history through Studio. Organizations including ASML, ABANCA, and CMA-CGM are already using Workflows for critical business automation.
Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version
Mistral AI released OCR 3, a document parsing model priced at $2 per 1,000 pages with a 50% batch API discount. The company claims a 74% overall win rate compared to Mistral OCR 2 on forms, scanned documents, complex tables, and handwriting.
Mistral Releases Codestral 25.08 with 30% Higher Completion Acceptance, Ships Full Enterprise Coding Stack
Mistral AI released Codestral 25.08, showing 30% more accepted code completions and 10% higher retention rates. The company also shipped Devstral Small, a 24B-parameter agentic coding model scoring 53.6% on SWE-Bench Verified, alongside new embedding and IDE integration tools aimed at enterprise deployment.
Mistral AI Releases Magistral Reasoning Models: 24B Open-Source and Enterprise Versions Score 70.7% and 73.6% on AIME202
Mistral AI has released Magistral, its first reasoning model line, in two versions: Magistral Small (24B parameters, Apache 2.0) and Magistral Medium (enterprise). Magistral Medium scored 73.6% on AIME2024 (90% with majority voting at 64 samples), while the open-source Small version achieved 70.7% (83.3% with voting).
Mistral AI launches Le Chat Enterprise with new Medium 3 model, enterprise search and agent builders
Mistral AI has launched Le Chat Enterprise, powered by its new Mistral Medium 3 model. The platform includes enterprise search across Google Drive, Sharepoint, OneDrive, Gmail and Google Calendar, no-code agent builders, custom data connectors, and hybrid deployment options including self-hosted and cloud.
Mistral Medium 3 launches at $0.4/$2 per million tokens, matching 90% of Claude 3.7 Sonnet performance
Mistral AI launched Mistral Medium 3 on May 7, 2025, priced at $0.4 per million input tokens and $2 per million output tokens. The company claims the model performs at or above 90% of Claude Sonnet 3.7 on benchmarks while being significantly less expensive, and surpasses Llama 4 Maverick and Cohere Command A.
Mistral Launches Saba: 24B-Parameter Regional Model for Arabic and South Asian Languages
Mistral AI has released Saba, a 24B-parameter model trained specifically for Arabic and South Asian languages including Tamil. The model runs on single-GPU systems at over 150 tokens per second and is available via API or for on-premises deployment.
Mistral's Le Chat to integrate AFP newswire with 2,300 daily stories in six languages
Mistral AI announced a partnership with Agence France-Presse (AFP) to integrate the news agency's newswire into Le Chat. The integration will provide access to 2,300 daily stories in French, English, Spanish, Portuguese, German, and Arabic from AFP's network of 1,700 journalists.
AWS deploys AgentCore orchestration layer across 20+ sales agents, cutting latency 41% and saving reps 2 hours weekly
AWS has deployed Amazon Bedrock AgentCore to orchestrate more than 20 specialized AI agents across its global sales organization through a unified interface called Field Advisor. The system has processed over 120,000 prompts since launch, delivering a 41% latency reduction compared to previous infrastructure and saving large-scale sales reps up to 2 hours per week on CRM tasks.
DeepSeek cuts V4 Pro pricing by 75% to $0.003625 per million input tokens
DeepSeek has permanently reduced pricing for its V4 Pro model by 75%, bringing input token costs down to $0.003625 per million tokens from $0.0145. The move makes permanent a promotional discount that was set to expire May 31, 2026.
Google opens CodeMender API to select testers, pitching AI security tool to governments and enterprises
Google announced at I/O 2026 that it is opening API access for CodeMender, its AI agent for code security, to select expert groups. The company is positioning the tool to compete with Anthropic's Mythos Preview, which flagged unknown security vulnerabilities and secured major government and enterprise contracts.
IBM Releases Granite Embedding 311M R2 With 32K Context, 200+ Language Support
IBM released Granite Embedding 311M Multilingual R2, a 311-million parameter dense embedding model with 32,768-token context length and support for 200+ languages. The model scores 64.0 on Multilingual MTEB Retrieval (18 tasks), an 11.8-point improvement over its predecessor, and ships with ONNX and OpenVINO models for production deployment.
Anthropic's Mythos model finds tens of thousands of vulnerabilities, CEO warns of 6-12 month patching window
Anthropic CEO Dario Amodei disclosed that the company's Mythos model has uncovered tens of thousands of software vulnerabilities, including nearly 300 in Firefox alone compared to 20 found by earlier Claude models. Amodei warned of a 6-12 month window to patch these vulnerabilities before Chinese AI systems catch up in capability.
Perplexity's Mac-Native 'Personal Computer' Platform Claims $2.8B in Labor-Equivalent Work
Perplexity CEO Aravind Srinivas revealed that the company's Mac-native Personal Computer platform has performed more than $2.8B in labor-equivalent work for Pro, Max, and Enterprise subscribers since launch. The announcement follows Apple CFO Kevan Parekh citing Perplexity as an example of developers building enterprise-grade AI assistants on Mac during Apple's Q2 2026 earnings call.
Microsoft reports 20M paid Copilot users, weekly engagement now matches Outlook
Microsoft CEO Satya Nadella disclosed that M365 Copilot has reached 20 million paid enterprise seats during the company's quarterly earnings call. Weekly engagement now matches Outlook usage levels, with queries per user up 20% quarter-over-quarter.
Amazon launches Quick desktop app with persistent context tracking across Google Workspace, Microsoft 365, Zoom, and Sal
Amazon has released a desktop version of its Quick AI assistant that integrates with Google Workspace, Microsoft 365, Zoom, and Salesforce, storing persistent context about user activities to automate tasks. The company also split Amazon Connect into four vertical-specific products: Connect Decisions, Connect Talent, Connect Health, and Connect Customer AI.
Google rebrands Vertex AI as Gemini Enterprise Agent Platform with governance tools for managing agent fleets
Google has rebranded its Vertex AI developer platform as the Gemini Enterprise Agent Platform, introducing tools for building, deploying, governing, and monitoring large-scale AI agent deployments. The platform includes Agent Studio for low-code agent creation, Agent Gateway for security enforcement, and cryptographic identity management for each agent.
Google launches Workspace Intelligence semantic layer and TPU 8t/8i chips with 2.8x training performance
Google announced Workspace Intelligence, a semantic understanding layer that connects data across Gmail, Docs, and other Workspace apps to power Gemini features. The company also released TPU 8t chips for training (2.8x better price/performance) and TPU 8i chips for inference (80% better performance-per-dollar).
Altman criticizes Anthropic's restricted Mythos cybersecurity model as 'fear-based marketing'
OpenAI CEO Sam Altman criticized Anthropic's new cybersecurity model Mythos during a podcast appearance, calling the company's decision to restrict public access 'fear-based marketing.' Anthropic claims Mythos is too powerful to release publicly due to potential weaponization by cybercriminals.
Anthropic launches Claude Design for rapid visual creation, powered by Claude Opus 4.7
Anthropic announced Claude Design, an experimental product that generates visuals like prototypes, slides, and one-pagers from text descriptions. Powered by Claude Opus 4.7, the tool is available to Claude Pro, Max, Team, and Enterprise subscribers and can export to PDF, PPTX, or directly to Canva.
Microsoft developing local AI agent to compete with open-source OpenClaw
Microsoft is testing OpenClaw-like features for Microsoft 365 Copilot aimed at enterprise customers, the company confirmed to The Information. The agent would run continuously to complete multi-step tasks over extended periods, distinguishing it from Microsoft's existing cloud-based agents like Copilot Cowork and Copilot Tasks.
Enterprise AI gap widens as open-weight models mature into production-ready alternatives
Open-weight models from Google, Alibaba, Microsoft, and Nvidia have crossed a threshold from research projects to enterprise-grade systems. The shift reflects a growing divide: frontier models from OpenAI and Anthropic are too expensive and pose data security risks for most enterprises, while open alternatives now deliver sufficient capability at a fraction of the cost.
Anthropic exits Claude Cowork research preview with enterprise features, launches Claude Managed Agents beta
Anthropic has promoted Claude Cowork from research preview to general availability, adding six enterprise features including role-based access controls, group spend limits, and usage analytics. The company simultaneously launched Claude Managed Agents in public beta—a composable API suite for building and deploying cloud-hosted agents without custom infrastructure work.
Stability AI launches Brand Studio for enterprise image generation with brand-specific models
Stability AI has launched Brand Studio, a commercial platform designed for creative teams to generate AI images aligned with their brand identity. The platform includes Brand Central for training custom models, Producer Mode for automated visual workflows, and Curated Model Routing that selects optimal models for specific tasks.
Google launches Gemma 4 open-weights models with Apache 2.0 license to compete with Chinese LLMs
Google released Gemma 4, a new line of open-weights models available in sizes from 2 billion to 31 billion parameters, under a permissive Apache 2.0 license. The release includes multimodal capabilities, support for 140+ languages, native function calling, and a 256,000-token context window for the larger variants.
Holo3 achieves 78.85% on OSWorld benchmark with only 10B active parameters
H Company unveiled Holo3, a computer use model that scores 78.85% on the OSWorld-Verified benchmark—the highest on the leading desktop automation benchmark. The model achieves this with only 10B active parameters (122B total), positioning it as a lower-cost alternative to proprietary models like GPT 5.4 and Opus 4.6.
Google releases Gemini 3.1 Flash Live, its highest-quality audio model for real-time voice AI
Google has released Gemini 3.1 Flash Live, its highest-quality audio model designed for natural and reliable real-time voice interactions. The model scores 90.8% on ComplexFuncBench Audio and 36.1% on Scale AI's Audio MultiChallenge with thinking enabled. It's now available to developers via the Gemini Live API, enterprises through Gemini Enterprise for Customer Experience, and consumers in Search Live and Gemini Live across 200+ countries.
Mistral releases Voxtral TTS, open-source speech model for enterprise voice agents
Mistral AI released Voxtral TTS, an open-source text-to-speech model designed for enterprise voice agents and edge devices. The model supports nine languages, adapts custom voices from samples under five seconds, and achieves 90ms time-to-first-audio latency with a 6x real-time factor.
Stability AI releases Stable Audio 2.5 for enterprise sound production
Stability AI released Stable Audio 2.5, positioned as the first audio generation model built specifically for enterprise sound production. The model introduces improvements in quality and control for creating dynamic compositions adaptable to custom brand needs.
Anthropic releases Claude computer use feature to compete with OpenClaw
Anthropic announced Monday that Claude can now complete tasks on users' computers, including opening apps, navigating browsers, and filling spreadsheets, after receiving prompts from a smartphone. The feature positions Anthropic directly against OpenClaw, the viral AI agent that went mainstream this year. The capability comes with safeguards requiring Claude to request permission before accessing new applications.
Multiverse Computing launches API portal for compressed AI models to reduce cloud dependence
Multiverse Computing, a Spanish startup, has launched a self-serve API portal giving developers direct access to compressed versions of models from OpenAI, Meta, DeepSeek, and Mistral AI. The move targets enterprises seeking to reduce cloud infrastructure dependence and lower compute costs through edge deployment. The company claims its HyperNova 60B 2602 model delivers faster responses at lower cost than the original OpenAI model it was derived from.
Perplexity launches Computer for Enterprise, claims $1.6M labor savings in internal test
Perplexity made Computer for Enterprise generally available to enterprise customers on March 12, claiming an internal study of 16,000+ queries showed $1.6 million in labor cost savings and 3.2 years of equivalent work completed in four weeks. The service integrates with Gmail, Outlook, GitHub, Linear, Slack, Notion, Snowflake, Databricks, and Salesforce, orchestrating tasks across 20 frontier models with agentic internet access.
Alibaba consolidates AI under new "Token Hub" unit led by CEO Eddie Wu
Alibaba has consolidated its AI operations into a new business unit called "Alibaba Token Hub" (ATH), reporting directly to CEO Eddie Wu. The restructuring merges the Qwen research team, consumer apps, DingTalk communication platform, and Quark-branded devices to accelerate collaboration and monetization across the company.