Perplexity Computer adds hybrid inference to split tasks between local and cloud models
Perplexity announced that its Computer agentic system will gain hybrid inference in July 2026, automatically splitting tasks between local models for sensitive data and cloud-based frontier models for complex operations. The feature aims to balance privacy with computational power without requiring manual model selection.
Perplexity Computer adds hybrid inference to split tasks between local and cloud models
Perplexity announced that its Computer agentic system will gain the ability to automatically split tasks between on-device and cloud-based models in July 2026.
The hybrid inference feature allows Perplexity Computer to route different parts of a single task to appropriate models: sensitive data processing stays on local models running on the user's device, while complex operations requiring frontier model capabilities run on cloud servers.
How hybrid agentic inference works
According to Perplexity, a compact model running locally determines which data should remain on-device. The system then coordinates between local and cloud execution for different parts of the same task.
Perplexity positions this as solving a key problem with current AI systems: "Unlike tools that ask you to pick local or cloud up front, this happens on its own, task by task."
The company cites use cases involving financial records, health information, and personal files as scenarios where users need both privacy and powerful AI capabilities.
Technical details
Perplexity claims the hybrid approach "maximizes token efficiency" by routing work to appropriate compute resources. The announcement did not specify which local models will run on-device, what minimum hardware requirements exist, or which frontier models will handle cloud-side processing.
No pricing changes were announced for the feature, which will be available to Perplexity Computer subscribers.
Availability
The hybrid agentic inference capability will launch in July 2026 for Perplexity Computer on Mac. The company has not announced plans for other platforms.
Perplexity Computer, the company's agentic AI system, launched earlier in 2026 as a Mac-native platform for subscribers of Perplexity's paid service.
What this means
This represents a practical approach to the privacy-versus-capability tradeoff in AI applications. By automating the decision of what runs where, Perplexity aims to eliminate a manual step that many users lack the technical knowledge to handle. The success of this approach depends on the accuracy of the local model's routing decisions and the actual performance difference users experience compared to cloud-only execution. If the local routing logic proves reliable, this could become a standard architecture for privacy-sensitive AI applications.
Related Articles
Google's Gemini Spark AI agent uses personal data to plan trips, raising privacy concerns
Google's Gemini Spark, an AI agent rolling out to the company's $99/month AI Ultra plan, demonstrates advanced capabilities by mining users' Gmail, calendar, photos, and location data to create detailed trip itineraries. The agent can perform actions across apps and operate computers, though third-party services like Airbnb currently block its booking attempts.
Microsoft strips color from Copilot interface in pursuit of 'intelligence that feels present but not imposing'
Microsoft has rolled out a visual overhaul for Copilot in Microsoft 365, replacing the colorful interface with a predominantly black-and-white, text-forward design. The redesign, aimed at making the AI assistant feel "present but not imposing," includes a new adaptive prompt surface and consistent side panel placement across Word, PowerPoint, and Excel.
Microsoft releases ASSERT, open-source framework for testing application-specific AI behavior using natural language
Microsoft released ASSERT (Adaptive Spec-driven Scoring for Evaluation and Regression Testing), an open-source framework that converts natural language descriptions of expected AI behavior into structured test cases. The tool addresses a gap in AI evaluation by testing application-specific behaviors that general benchmarks cannot capture.
Google's Gemini Android overlay adds Dynamic Color theming and relocates Screen content capture
Google is rolling out interface updates to the Gemini overlay on Android. The overlay now supports Dynamic Color theming in version 17.28 of the Google app, with the Screen content capture feature relocated from the tools menu to the main carousel alongside Photos, Camera, Files, Drive, and Notebooks.
Comments
Loading...