performance
5 articles tagged with performance
GitHub Copilot CLI reduces unnecessary LLM handoffs through improved orchestration logic
GitHub has updated the orchestration logic in Copilot CLI to make it more selective about when to delegate tasks between language models. The changes reduce unnecessary handoffs and improve response times without introducing additional configuration settings.
Microsoft 365 Copilot gains 2x faster load times and progressive disclosure interface
Microsoft is rolling out a redesigned Microsoft 365 Copilot that loads twice as fast, according to the company. The update introduces "progressive disclosure" — showing tools and controls contextually based on prompts rather than displaying all options at once.
OpenAI investigating elevated latency across ChatGPT and API
OpenAI confirmed it is investigating elevated latency issues affecting both ChatGPT and its API as of May 27, 2026. The company is monitoring a separate issue for FEDRamp users, though a fix has been applied to that specific problem.
NVIDIA Releases Nemotron-Labs Diffusion Models With 6.4× Faster Token Generation Than Autoregressive Decoding
NVIDIA has released Nemotron-Labs Diffusion, a family of diffusion language models at 3B, 8B, and 14B scales that generate multiple tokens in parallel rather than one at a time. The 8B model achieves 6.4× higher tokens per forward pass than autoregressive models in self-speculation mode while maintaining comparable accuracy.
Anthropic reverts three system changes that degraded Claude Code performance in March and April
Anthropic confirmed three separate system changes in March and April degraded Claude Code, Claude Agent SDK, and Claude Cowork performance. The company reduced default reasoning effort from high to medium on March 4, introduced a caching bug on March 26 that cleared session data with every turn, and added restrictive word limits on April 16 that caused a 3% performance drop.