OpenAI expands ChatGPT memory to free users, doubles storage capacity for paid tiers
OpenAI is rolling out an upgraded memory system for ChatGPT that synthesizes context more efficiently across conversations. The company reduced compute requirements by approximately 5x, enabling it to offer the memory feature to free users for the first time while doubling storage capacity for Plus and Pro subscribers.
OpenAI expands ChatGPT memory to free users, doubles storage capacity for paid tiers
OpenAI announced June 4, 2026 that it is expanding ChatGPT's memory feature to free users for the first time while releasing an upgraded memory system for paid subscribers in the United States.
Compute efficiency gains enable free tier expansion
The company reduced the compute required to serve its "dreaming-based" memory system by approximately 5x, according to OpenAI's announcement. This efficiency improvement makes it practical to offer the feature to free users at scale.
Free users will receive access to the memory feature "over the coming weeks," OpenAI says. The system automatically tracks important details and preferences across conversations without requiring users to repeat context.
New memory architecture for paid users
Plus and Pro subscribers in the US began receiving access to a "more capable and compute-efficient memory architecture" starting June 4. The upgraded system addresses what OpenAI describes as "staleness, correctness, and scalability challenges" when serving hundreds of millions of users over multi-year timeframes.
The new architecture includes a memory summary page where users can review what ChatGPT knows about them, add or update information, and provide instructions on which topics the system should reference and when.
OpenAI is also doubling memory storage capacity for Plus and Pro users, though specific storage limits were not disclosed.
Three focus areas for memory improvements
The upgraded system targets three specific capabilities:
- Carrying forward useful context across conversations
- Following preferences and constraints set by users
- Staying current with the passage of time
Users who prefer the previous saved memories experience can revert to the legacy system through settings.
What this means
The 5x compute reduction represents a significant engineering achievement that directly enables product expansion—free users gain a feature previously limited to paying customers, while paid tiers get expanded capacity. The move also signals OpenAI's focus on long-term context as a competitive differentiator, particularly as memory becomes table stakes for conversational AI systems that millions use daily. The phased US-only rollout for paid tiers suggests the company is stress-testing the new architecture before global deployment.
Related Articles
Perplexity Computer adds hybrid inference to split tasks between local and cloud models
Perplexity announced that its Computer agentic system will gain hybrid inference in July 2026, automatically splitting tasks between local models for sensitive data and cloud-based frontier models for complex operations. The feature aims to balance privacy with computational power without requiring manual model selection.
LM Studio adds iPhone-to-Mac connection for running local LLMs via Locally app
LM Studio has released LM Link, a feature connecting its Mac app with the Locally AI iOS app to let users access local LLMs running on their Macs from their iPhones. The connection uses end-to-end encryption via Tailscale mesh VPNs and will be free during the preview period.
Replit Agent now generates custom Shopify storefronts in 10 minutes from a single prompt
Replit launched an integration allowing its AI Agent to design and deploy custom Shopify storefronts from natural language prompts. The system generates the front end, provisions a Shopify store, and adds products in a single conversation, with the entire process from first prompt to accepting orders taking roughly 10 minutes.
Apple to integrate Google Gemini into Siri, launch standalone AI app at WWDC 2026
Apple will unveil a major Siri upgrade powered by Google's Gemini technology at WWDC 2026, according to reports. The company is also launching a standalone Siri app to compete with ChatGPT and Claude, plus an AI agent integration in the App Store.
Comments
Loading...