enterprise-ai
30 articles tagged with enterprise-ai
IBM Releases Granite Embedding 311M R2 With 32K Context, 200+ Language Support
IBM released Granite Embedding 311M Multilingual R2, a 311-million parameter dense embedding model with 32,768-token context length and support for 200+ languages. The model scores 64.0 on Multilingual MTEB Retrieval (18 tasks), an 11.8-point improvement over its predecessor, and ships with ONNX and OpenVINO models for production deployment.
Anthropic's Mythos model finds tens of thousands of vulnerabilities, CEO warns of 6-12 month patching window
Anthropic CEO Dario Amodei disclosed that the company's Mythos model has uncovered tens of thousands of software vulnerabilities, including nearly 300 in Firefox alone compared to 20 found by earlier Claude models. Amodei warned of a 6-12 month window to patch these vulnerabilities before Chinese AI systems catch up in capability.
Perplexity's Mac-Native 'Personal Computer' Platform Claims $2.8B in Labor-Equivalent Work
Perplexity CEO Aravind Srinivas revealed that the company's Mac-native Personal Computer platform has performed more than $2.8B in labor-equivalent work for Pro, Max, and Enterprise subscribers since launch. The announcement follows Apple CFO Kevan Parekh citing Perplexity as an example of developers building enterprise-grade AI assistants on Mac during Apple's Q2 2026 earnings call.
Microsoft reports 20M paid Copilot users, weekly engagement now matches Outlook
Microsoft CEO Satya Nadella disclosed that M365 Copilot has reached 20 million paid enterprise seats during the company's quarterly earnings call. Weekly engagement now matches Outlook usage levels, with queries per user up 20% quarter-over-quarter.
Amazon launches Quick desktop app with persistent context tracking across Google Workspace, Microsoft 365, Zoom, and Sal
Amazon has released a desktop version of its Quick AI assistant that integrates with Google Workspace, Microsoft 365, Zoom, and Salesforce, storing persistent context about user activities to automate tasks. The company also split Amazon Connect into four vertical-specific products: Connect Decisions, Connect Talent, Connect Health, and Connect Customer AI.
Google rebrands Vertex AI as Gemini Enterprise Agent Platform with governance tools for managing agent fleets
Google has rebranded its Vertex AI developer platform as the Gemini Enterprise Agent Platform, introducing tools for building, deploying, governing, and monitoring large-scale AI agent deployments. The platform includes Agent Studio for low-code agent creation, Agent Gateway for security enforcement, and cryptographic identity management for each agent.
Google launches Workspace Intelligence semantic layer and TPU 8t/8i chips with 2.8x training performance
Google announced Workspace Intelligence, a semantic understanding layer that connects data across Gmail, Docs, and other Workspace apps to power Gemini features. The company also released TPU 8t chips for training (2.8x better price/performance) and TPU 8i chips for inference (80% better performance-per-dollar).
Altman criticizes Anthropic's restricted Mythos cybersecurity model as 'fear-based marketing'
OpenAI CEO Sam Altman criticized Anthropic's new cybersecurity model Mythos during a podcast appearance, calling the company's decision to restrict public access 'fear-based marketing.' Anthropic claims Mythos is too powerful to release publicly due to potential weaponization by cybercriminals.
Anthropic launches Claude Design for rapid visual creation, powered by Claude Opus 4.7
Anthropic announced Claude Design, an experimental product that generates visuals like prototypes, slides, and one-pagers from text descriptions. Powered by Claude Opus 4.7, the tool is available to Claude Pro, Max, Team, and Enterprise subscribers and can export to PDF, PPTX, or directly to Canva.
Microsoft developing local AI agent to compete with open-source OpenClaw
Microsoft is testing OpenClaw-like features for Microsoft 365 Copilot aimed at enterprise customers, the company confirmed to The Information. The agent would run continuously to complete multi-step tasks over extended periods, distinguishing it from Microsoft's existing cloud-based agents like Copilot Cowork and Copilot Tasks.
Enterprise AI gap widens as open-weight models mature into production-ready alternatives
Open-weight models from Google, Alibaba, Microsoft, and Nvidia have crossed a threshold from research projects to enterprise-grade systems. The shift reflects a growing divide: frontier models from OpenAI and Anthropic are too expensive and pose data security risks for most enterprises, while open alternatives now deliver sufficient capability at a fraction of the cost.
Anthropic exits Claude Cowork research preview with enterprise features, launches Claude Managed Agents beta
Anthropic has promoted Claude Cowork from research preview to general availability, adding six enterprise features including role-based access controls, group spend limits, and usage analytics. The company simultaneously launched Claude Managed Agents in public beta—a composable API suite for building and deploying cloud-hosted agents without custom infrastructure work.
Stability AI launches Brand Studio for enterprise image generation with brand-specific models
Stability AI has launched Brand Studio, a commercial platform designed for creative teams to generate AI images aligned with their brand identity. The platform includes Brand Central for training custom models, Producer Mode for automated visual workflows, and Curated Model Routing that selects optimal models for specific tasks.
Google launches Gemma 4 open-weights models with Apache 2.0 license to compete with Chinese LLMs
Google released Gemma 4, a new line of open-weights models available in sizes from 2 billion to 31 billion parameters, under a permissive Apache 2.0 license. The release includes multimodal capabilities, support for 140+ languages, native function calling, and a 256,000-token context window for the larger variants.
Holo3 achieves 78.85% on OSWorld benchmark with only 10B active parameters
H Company unveiled Holo3, a computer use model that scores 78.85% on the OSWorld-Verified benchmark—the highest on the leading desktop automation benchmark. The model achieves this with only 10B active parameters (122B total), positioning it as a lower-cost alternative to proprietary models like GPT 5.4 and Opus 4.6.
Google releases Gemini 3.1 Flash Live, its highest-quality audio model for real-time voice AI
Google has released Gemini 3.1 Flash Live, its highest-quality audio model designed for natural and reliable real-time voice interactions. The model scores 90.8% on ComplexFuncBench Audio and 36.1% on Scale AI's Audio MultiChallenge with thinking enabled. It's now available to developers via the Gemini Live API, enterprises through Gemini Enterprise for Customer Experience, and consumers in Search Live and Gemini Live across 200+ countries.
Mistral releases Voxtral TTS, open-source speech model for enterprise voice agents
Mistral AI released Voxtral TTS, an open-source text-to-speech model designed for enterprise voice agents and edge devices. The model supports nine languages, adapts custom voices from samples under five seconds, and achieves 90ms time-to-first-audio latency with a 6x real-time factor.
Stability AI releases Stable Audio 2.5 for enterprise sound production
Stability AI released Stable Audio 2.5, positioned as the first audio generation model built specifically for enterprise sound production. The model introduces improvements in quality and control for creating dynamic compositions adaptable to custom brand needs.
Anthropic releases Claude computer use feature to compete with OpenClaw
Anthropic announced Monday that Claude can now complete tasks on users' computers, including opening apps, navigating browsers, and filling spreadsheets, after receiving prompts from a smartphone. The feature positions Anthropic directly against OpenClaw, the viral AI agent that went mainstream this year. The capability comes with safeguards requiring Claude to request permission before accessing new applications.
Multiverse Computing launches API portal for compressed AI models to reduce cloud dependence
Multiverse Computing, a Spanish startup, has launched a self-serve API portal giving developers direct access to compressed versions of models from OpenAI, Meta, DeepSeek, and Mistral AI. The move targets enterprises seeking to reduce cloud infrastructure dependence and lower compute costs through edge deployment. The company claims its HyperNova 60B 2602 model delivers faster responses at lower cost than the original OpenAI model it was derived from.
Perplexity launches Computer for Enterprise, claims $1.6M labor savings in internal test
Perplexity made Computer for Enterprise generally available to enterprise customers on March 12, claiming an internal study of 16,000+ queries showed $1.6 million in labor cost savings and 3.2 years of equivalent work completed in four weeks. The service integrates with Gmail, Outlook, GitHub, Linear, Slack, Notion, Snowflake, Databricks, and Salesforce, orchestrating tasks across 20 frontier models with agentic internet access.
Alibaba consolidates AI under new "Token Hub" unit led by CEO Eddie Wu
Alibaba has consolidated its AI operations into a new business unit called "Alibaba Token Hub" (ATH), reporting directly to CEO Eddie Wu. The restructuring merges the Qwen research team, consumer apps, DingTalk communication platform, and Quark-branded devices to accelerate collaboration and monetization across the company.
OpenAI acquires Promptfoo, an AI security and testing platform
OpenAI is acquiring Promptfoo, an AI security platform that helps enterprises identify and remediate vulnerabilities in AI systems during development. Terms of the acquisition were not disclosed.
AI agent compromised McKinsey's internal platform in 2 hours using SQL injection
An AI agent deployed by security firm Codewall gained full read and write access to McKinsey's internal AI platform Lilli within two hours without credentials or insider knowledge. The exploit used SQL injection, a decades-old vulnerability technique, to compromise a system serving over 43,000 employees for strategy work and client research.
OpenAI acquires Promptfoo to strengthen AI agent security capabilities
OpenAI has acquired Promptfoo, a platform for testing and evaluating AI agents. The acquisition signals frontier labs' intensifying focus on proving their technology can operate safely in critical business environments.
Tabnine launches Enterprise Context Engine to ground AI coding in production environments
Tabnine has introduced its Enterprise Context Engine, designed to give AI models the contextual understanding needed to operate safely within real production development environments. The tool addresses a gap between raw model capability and practical enterprise deployment, where understanding an organization's codebase, dependencies, and architecture is critical.
Anthropic expands Claude Cowork with enterprise app integrations and multi-step automation
Anthropic expanded Claude Cowork with integrations to Google Workspace, DocuSign, and WordPress, plus pre-built plugins for HR, design, engineering, and finance tasks. The platform can now execute multi-step workflows across Excel and PowerPoint automatically.
GitHub Copilot adds model picker to coding agent for Business and Enterprise users
GitHub has added a model picker feature to Copilot's autonomous coding agent for Business and Enterprise tier users. The feature allows teams to select which AI model powers the asynchronous background agent that handles delegated development tasks.
AIG deploys agentic AI system with orchestration layer for underwriting
American International Group (AIG) has deployed an agentic AI system with an orchestration layer, reporting faster-than-expected productivity gains in underwriting and portfolio management. The deployment demonstrates measurable improvements in throughput and workflow efficiency, according to recent investor disclosures.
Goldman Sachs deploys Claude for trade accounting and client onboarding
Goldman Sachs is deploying Anthropic's Claude model in trade accounting and client onboarding operations. The deployment represents a broader adoption of generative AI among large financial institutions to improve operational efficiency in back-office processes.