Microsoft Copilot Researcher adds multi-model features using GPT and Claude
Microsoft has enabled its Copilot Researcher tool to simultaneously leverage OpenAI's GPT and Anthropic's Claude through two new features: Critique, which uses GPT responses refined by Claude, and Model Council, which displays side-by-side outputs with agreement/disagreement analysis. Both features are rolling out in the Microsoft 365 Copilot Frontier early access program.
Microsoft Copilot Researcher Adds Multi-Model Architecture Using GPT and Claude
Microsoft has expanded its Copilot Researcher tool with dual new features that combine OpenAI's GPT and Anthropic's Claude simultaneously for research tasks, according to a blog post announcing the Copilot Cowork platform.
Critique Feature: Sequential Model Refinement
The Critique feature generates initial responses using GPT, then refines them through Claude. Microsoft claims this architecture "creates a powerful feedback loop that delivers higher-quality results across factual accuracy, analytical breadth, and presentation."
The company states that Researcher's process mirrors "academic and professional research settings." According to Microsoft, the upgrade scores higher than Perplexity's Deep Research models on the Deep Research Accuracy, Completeness, and Objectivity benchmark—though specific benchmark numbers were not disclosed.
Model Council: Comparative Analysis
Alternatively, users can select Model Council to receive side-by-side responses from both Anthropic and OpenAI models. The feature includes a report highlighting where the two models agree and disagree, allowing researchers to evaluate multiple perspectives on complex queries.
Microsoft designed Researcher specifically for multi-step research tasks, distinguishing it from standard Copilot. Anthropic independently operates a Research feature using multiple Claude agents for similar purposes.
Availability and Context
Both Critique and Model Council features are currently available exclusively in Microsoft 365 Copilot's Frontier program, which functions as an early access testing ground for the company's AI innovations. No general availability date has been announced.
The move reflects a broader industry trend of combining complementary AI models to offset individual model weaknesses. Neither OpenAI nor Anthropic has publicly objected to this architecture, suggesting potential partnership or licensing arrangements, though details remain undisclosed.
What this means
Microsoft is positioning multi-model research workflows as enterprise standard practice rather than single-model reliance. This approach benefits organizations requiring high-confidence outputs on complex research tasks. The lack of published benchmark numbers limits independent verification of claims versus Perplexity's comparable offering. Availability remains gated to early access users, suggesting Microsoft is gathering feedback before broader rollout.
Related Articles
Microsoft expands Copilot Cowork with AI model critique feature and cross-model comparison
Microsoft is expanding Copilot Cowork availability and introducing a Critique function that enables one AI model to review another's output. The update also includes a new Researcher agent claiming best-in-class deep research performance, outperforming Perplexity by 7 points, and a Model Council feature for direct model comparison.
Claude adds memory import tool to help users switch from ChatGPT and other AI services
Anthropic has launched a memory import feature for Claude that lets users transfer their stored preferences, personal details, and conversation context from other AI services like ChatGPT, Google Gemini, and Microsoft Copilot. The tool generates copy-paste instructions that extract all memories from a competing service and import them into Claude, eliminating the need to rebuild your AI profile from scratch.
Microsoft releases Harrier embedding models with 32K token context, tops multilingual benchmark
Microsoft has released Harrier-OSS-v1, a family of multilingual text embedding models trained with contrastive learning and knowledge distillation. The 0.6B parameter variant achieves a 69.0 score on the Multilingual MTEB v2 benchmark with support for 32,768 token context windows and 45+ languages.
OpenAI shuts down Sora and indefinitely pauses ChatGPT adult mode in March purge
OpenAI shut down two projects in March 2026: the Sora AI video app (launched September 2025, operational for six months) and indefinitely paused the planned ChatGPT adult mode. The company cited sexual dataset management and illegal content elimination as barriers to the adult feature launch.
Comments
Loading...