product updateMicrosoft

Microsoft Copilot Researcher adds multi-model features using GPT and Claude

TL;DR

Microsoft has enabled its Copilot Researcher tool to simultaneously leverage OpenAI's GPT and Anthropic's Claude through two new features: Critique, which uses GPT responses refined by Claude, and Model Council, which displays side-by-side outputs with agreement/disagreement analysis. Both features are rolling out in the Microsoft 365 Copilot Frontier early access program.

2 min read
0

Microsoft Copilot Researcher Adds Multi-Model Architecture Using GPT and Claude

Microsoft has expanded its Copilot Researcher tool with dual new features that combine OpenAI's GPT and Anthropic's Claude simultaneously for research tasks, according to a blog post announcing the Copilot Cowork platform.

Critique Feature: Sequential Model Refinement

The Critique feature generates initial responses using GPT, then refines them through Claude. Microsoft claims this architecture "creates a powerful feedback loop that delivers higher-quality results across factual accuracy, analytical breadth, and presentation."

The company states that Researcher's process mirrors "academic and professional research settings." According to Microsoft, the upgrade scores higher than Perplexity's Deep Research models on the Deep Research Accuracy, Completeness, and Objectivity benchmark—though specific benchmark numbers were not disclosed.

Model Council: Comparative Analysis

Alternatively, users can select Model Council to receive side-by-side responses from both Anthropic and OpenAI models. The feature includes a report highlighting where the two models agree and disagree, allowing researchers to evaluate multiple perspectives on complex queries.

Microsoft designed Researcher specifically for multi-step research tasks, distinguishing it from standard Copilot. Anthropic independently operates a Research feature using multiple Claude agents for similar purposes.

Availability and Context

Both Critique and Model Council features are currently available exclusively in Microsoft 365 Copilot's Frontier program, which functions as an early access testing ground for the company's AI innovations. No general availability date has been announced.

The move reflects a broader industry trend of combining complementary AI models to offset individual model weaknesses. Neither OpenAI nor Anthropic has publicly objected to this architecture, suggesting potential partnership or licensing arrangements, though details remain undisclosed.

What this means

Microsoft is positioning multi-model research workflows as enterprise standard practice rather than single-model reliance. This approach benefits organizations requiring high-confidence outputs on complex research tasks. The lack of published benchmark numbers limits independent verification of claims versus Perplexity's comparable offering. Availability remains gated to early access users, suggesting Microsoft is gathering feedback before broader rollout.

Related Articles

product update

Microsoft expands Copilot Cowork with AI model critique feature and cross-model comparison

Microsoft is expanding Copilot Cowork availability and introducing a Critique function that enables one AI model to review another's output. The update also includes a new Researcher agent claiming best-in-class deep research performance, outperforming Perplexity by 7 points, and a Model Council feature for direct model comparison.

product update

Claude adds memory import tool to help users switch from ChatGPT and other AI services

Anthropic has launched a memory import feature for Claude that lets users transfer their stored preferences, personal details, and conversation context from other AI services like ChatGPT, Google Gemini, and Microsoft Copilot. The tool generates copy-paste instructions that extract all memories from a competing service and import them into Claude, eliminating the need to rebuild your AI profile from scratch.

model release

Microsoft releases Harrier embedding models with 32K token context, tops multilingual benchmark

Microsoft has released Harrier-OSS-v1, a family of multilingual text embedding models trained with contrastive learning and knowledge distillation. The 0.6B parameter variant achieves a 69.0 score on the Multilingual MTEB v2 benchmark with support for 32,768 token context windows and 45+ languages.

product update

OpenAI shuts down Sora and indefinitely pauses ChatGPT adult mode in March purge

OpenAI shut down two projects in March 2026: the Sora AI video app (launched September 2025, operational for six months) and indefinitely paused the planned ChatGPT adult mode. The company cited sexual dataset management and illegal content elimination as barriers to the adult feature launch.

Comments

Loading...