ai-research

7 articles tagged with ai-research

April 12, 2026

AI agent skills fail in real-world conditions, researchers find testing 34,000 skills

A large-scale study testing 34,198 real-world skills reveals that AI agent performance drops drastically when moving from curated benchmarks to realistic conditions. Claude Opus 4.6 saw pass rates fall from 55.4% with hand-selected skills to 38.4% in truly realistic scenarios, while weaker models like Kimi K2.5 actually perform below their no-skill baseline.

April 12, 2026 · 10:50 AM

April 2, 2026

analysisOpenAI

OpenAI's Brockman claims GPT reasoning models have 'line of sight' to AGI

OpenAI President Greg Brockman stated that GPT reasoning models have 'line of sight' to AGI and represents a settled debate on whether text-based models can achieve general intelligence. The company is prioritizing this approach over multimodal world models like Sora, which Brockman views as 'a different branch of the tech tree.' The stance contradicts prominent AI researchers including Yann LeCun and Demis Hassabis, who argue LLMs alone are insufficient for human-level intelligence.

April 2, 2026 · 11:20 AM

March 28, 2026

research

Meta's hyperagents learn to improve their own improvement mechanisms across multiple domains

Researchers at Meta, University of British Columbia, and partner institutions have developed hyperagents—AI systems that optimize both their task performance and the mechanisms controlling their self-improvement. Unlike previous self-improvement approaches locked to coding tasks, DGM-Hyperagents (DGM-H) demonstrate significant gains across four domains and can transfer improvement strategies to entirely new tasks.

March 28, 2026 · 10:50 AM

March 14, 2026

fundingNVIDIA

Nvidia to spend $26B on open-weight AI models, filing reveals

Nvidia will invest $26 billion over the next five years to build open-weight AI models, according to a 2025 financial filing confirmed by executives. The move signals a strategic shift from chipmaker to AI frontier lab, with the company releasing Nemotron 3 Super (128B parameters) and claiming it outperforms GPT-OSS on multiple benchmarks.

March 14, 2026 · 7:08 PM

March 10, 2026

funding

Yann LeCun's AMI Labs raises $1.03B to develop world models

AMI Labs, cofounded by Turing Prize winner Yann LeCun, has raised $1.03 billion at a $3.5 billion pre-money valuation. The funding will support the company's effort to develop world models, marking a major commitment to foundational AI research outside of existing tech giants.

March 10, 2026 · 5:05 AM

February 24, 2026

researchApple

Apple Research Identifies 'Text-Speech Understanding Gap' Limiting LLM Speech Performance

Apple researchers have identified a fundamental limitation in speech-adapted large language models: they consistently underperform their text-based counterparts on language understanding tasks. The team terms this the 'text-speech understanding gap' and documents that speech-adapted LLMs lag behind both their original text versions and cascaded speech-to-text pipelines.

February 24, 2026 · 11:35 PM

researchGoogle DeepMind

DeepMind proposes AI should assign humans busywork to maintain job skills

A new Google DeepMind research paper on AI agent delegation recommends that AI systems occasionally assign humans tasks they could easily complete themselves. The goal: preventing workforce skill atrophy as automation increases.

February 24, 2026 · 7:05 PM

← Back to all news