product updateMicrosoft

Microsoft Copilot Researcher adds multi-model features using GPT and Claude

TL;DR

Microsoft has enabled its Copilot Researcher tool to simultaneously leverage OpenAI's GPT and Anthropic's Claude through two new features: Critique, which uses GPT responses refined by Claude, and Model Council, which displays side-by-side outputs with agreement/disagreement analysis. Both features are rolling out in the Microsoft 365 Copilot Frontier early access program.

2 min read
0

Microsoft Copilot Researcher Adds Multi-Model Architecture Using GPT and Claude

Microsoft has expanded its Copilot Researcher tool with dual new features that combine OpenAI's GPT and Anthropic's Claude simultaneously for research tasks, according to a blog post announcing the Copilot Cowork platform.

Critique Feature: Sequential Model Refinement

The Critique feature generates initial responses using GPT, then refines them through Claude. Microsoft claims this architecture "creates a powerful feedback loop that delivers higher-quality results across factual accuracy, analytical breadth, and presentation."

The company states that Researcher's process mirrors "academic and professional research settings." According to Microsoft, the upgrade scores higher than Perplexity's Deep Research models on the Deep Research Accuracy, Completeness, and Objectivity benchmark—though specific benchmark numbers were not disclosed.

Model Council: Comparative Analysis

Alternatively, users can select Model Council to receive side-by-side responses from both Anthropic and OpenAI models. The feature includes a report highlighting where the two models agree and disagree, allowing researchers to evaluate multiple perspectives on complex queries.

Microsoft designed Researcher specifically for multi-step research tasks, distinguishing it from standard Copilot. Anthropic independently operates a Research feature using multiple Claude agents for similar purposes.

Availability and Context

Both Critique and Model Council features are currently available exclusively in Microsoft 365 Copilot's Frontier program, which functions as an early access testing ground for the company's AI innovations. No general availability date has been announced.

The move reflects a broader industry trend of combining complementary AI models to offset individual model weaknesses. Neither OpenAI nor Anthropic has publicly objected to this architecture, suggesting potential partnership or licensing arrangements, though details remain undisclosed.

What this means

Microsoft is positioning multi-model research workflows as enterprise standard practice rather than single-model reliance. This approach benefits organizations requiring high-confidence outputs on complex research tasks. The lack of published benchmark numbers limits independent verification of claims versus Perplexity's comparable offering. Availability remains gated to early access users, suggesting Microsoft is gathering feedback before broader rollout.

Related Articles

product update

Trump Administration Permits Anthropic's Claude Mythos 5 for 100+ US Organizations After Two-Week Ban

The Trump administration is allowing Anthropic to deploy Claude Mythos 5 to over 100 specific US government agencies and companies, two weeks after banning the cybersecurity model. Commerce Secretary Howard Lutnick approved access for organizations operating critical infrastructure, including non-American employees, though Fable 5 remains unavailable.

product update

Apple adds Google Gemini to Xcode 26.6 as third coding assistant option alongside Claude and OpenAI Codex

Apple released Xcode 26.6 on June 25, 2026, adding Google Gemini as a third AI coding assistant option for developers. The IDE now supports Gemini alongside Anthropic Claude Agent and OpenAI Codex, plus compatibility with other agents through the Agent Client Protocol.

product update

GitHub Copilot agentic harness supports 20+ models with leading token efficiency across benchmarks

GitHub published benchmark results for its Copilot agentic harness, which supports more than 20 models from providers including Anthropic, OpenAI, and others. The company claims the harness delivers leading token efficiency while maintaining flexibility across model choices.

product update

Anthropic launches Claude Tag for Slack, writes 65% of its product team's code

Anthropic released Claude Tag, a beta feature that integrates Claude into Slack for Enterprise and Team customers. The company says the tool writes 65% of its product team's code and can work proactively with ambient mode enabled.

Comments

Loading...