Breaking

ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture

ByteDance released Bernini-R, an open-source video generation and editing model that combines an MLLM-based semantic planner with a DiT-based renderer. The model requires Hopper-class GPUs (H100/H800/H200) for optimal performance and supports multiple tasks including text-to-video, video editing, and reference-guided generation.

June 3, 2026

Latest News

All news →
0
model releaseMicrosoft

Microsoft releases MAI-Thinking-1 reasoning model at 35B parameters, MAI-Code-1-Flash for GitHub Copilot

Microsoft announced two new language models: MAI-Thinking-1, a 35B parameter reasoning model available to select early partners, and MAI-Code-1-Flash, a 5B parameter coding model rolling out to GitHub Copilot individual users in VS Code. Both models were trained on commercially licensed data without distillation from third-party models.

0
product updateOpenrouter

OpenRouter Launches Fusion: Multi-Model Consensus System That Runs Expert Panels in Parallel

OpenRouter has released Fusion, a multi-model routing system that processes prompts through parallel expert model panels with web search enabled, then uses a judge model to synthesize consensus, contradictions, and unique insights. Users pay the sum of all underlying model completions rather than a single model price.

0
product updateMicrosoft

Microsoft releases ASSERT, open-source framework for testing application-specific AI behavior using natural language

Microsoft released ASSERT (Adaptive Spec-driven Scoring for Evaluation and Regression Testing), an open-source framework that converts natural language descriptions of expected AI behavior into structured test cases. The tool addresses a gap in AI evaluation by testing application-specific behaviors that general benchmarks cannot capture.

0
model releaseMicrosoft

Microsoft launches MAI-Code-1 and MAI-Thinking-1 models to reduce OpenAI dependence

Microsoft announced two proprietary AI models at its Build developer conference: MAI-Code-1 for code generation and MAI-Thinking-1 for reasoning tasks. The models are designed to run on Azure infrastructure, allowing Microsoft to reduce costs from its $13 billion OpenAI investment while competing directly with Anthropic and Google.

2 min readvia cnbc.com
0
product update

Perplexity Computer adds hybrid inference to split tasks between local and cloud models

Perplexity announced that its Computer agentic system will gain hybrid inference in July 2026, automatically splitting tasks between local models for sensitive data and cloud-based frontier models for complex operations. The feature aims to balance privacy with computational power without requiring manual model selection.

0
model releaseNVIDIA

NVIDIA Releases Cosmos3-Super-Text2Image: 64B Parameter Model for Physical AI Applications

NVIDIA released Cosmos3-Super-Text2Image, a 64-billion parameter text-to-image generation model as part of its Cosmos3 collection of omnimodal world models. The model uses a Mixture-of-Transformers architecture combining autoregressive and diffusion transformers, designed for Physical AI applications including robotics and autonomous vehicles.

2 min readvia huggingface.co
0
product updateAmazon Web Services

AWS launches hyperparameter optimization guide for Amazon Nova Forge custom model training

AWS has published a technical guide on hyperparameter optimization for Amazon Nova Forge, its platform for building custom frontier models from Amazon Nova checkpoints. The guide addresses three core challenges: catastrophic forgetting during domain specialization, learning rate calibration when mixing proprietary and curated training data, and baseline performance constraints for reinforcement fine-tuning.

0
product updateAmazon Web Services

AWS demonstrates object detection using Amazon Nova 2 Lite multimodal model with no training required

AWS published a technical guide showing how Amazon Nova 2 Lite performs object detection through natural language prompts without requiring model training. The multimodal model returns bounding box coordinates in JSON format at $0.0003 per thousand input tokens and $0.0025 per thousand output tokens, with typical images costing approximately $0.00057 to process.

Latest Models

All →