GitHub will train Copilot models on user interaction data starting April 2026
GitHub will use Copilot interaction data from Free, Pro, and Pro+ plan users to train AI models starting April 24, 2026, unless users actively opt out. The policy does not affect Copilot Business and Enterprise customers. Data shared will include prompts, outputs, code snippets, filenames, and repository structures.
GitHub Will Train Copilot Models on User Interaction Data Starting April 2026
GitHub announced a significant change to its Copilot data policy effective April 24, 2026. Starting that date, interaction data from users on Free, Pro, and Pro+ plans will be used to train AI models unless users explicitly opt out.
What Data Will Be Collected
The data collection will include:
- User prompts and model outputs
- Code snippets
- Filenames
- Repository structures
- User feedback on model suggestions
Users who have previously opted out of data collection will retain their existing settings and will not be automatically enrolled.
Scope and Limitations
Copilot Business and Enterprise customers are exempt from this policy change. GitHub clarified that collected data can be shared with Microsoft but will not be shared with third-party AI model providers.
GitHub Chief Product Officer Mario Rodriguez stated that real-world usage data improves model quality. Internal testing with data from Microsoft employees already demonstrated higher acceptance rates for code suggestions, suggesting the approach yields measurable improvements.
Opt-Out Process
Users who wish to prevent their data from being used for training can opt out through Copilot settings under the "Privacy" section. GitHub indicated that more details are available on the GitHub blog.
What This Means
This policy represents a shift toward extracting training value from Copilot's large user base of developers. The limitation to Free, Pro, and Pro+ plans—while exempting Enterprise customers—suggests GitHub is balancing competitive advantage with enterprise customer expectations around data usage. The explicit opt-out structure (rather than opt-in) will likely result in significantly higher participation rates unless developers proactively change settings. The exclusion of third-party providers indicates Microsoft intends to keep this training data advantage internal rather than licensing it broadly, positioning GitHub Copilot improvements as a Microsoft-exclusive benefit.
Related Articles
Amazon Bedrock adds three video analysis workflows for multimodal understanding at scale
Amazon Bedrock has introduced three distinct video analysis workflows that leverage multimodal foundation models to extract insights from video content at scale. The approaches—frame-based, shot-based, and multimodal embedding—are designed for different use cases and cost-performance trade-offs, with open-source reference implementations available on GitHub.
Google's Gemini app now creates 3-minute songs with Lyria 3 Pro
Google announced Lyria 3 Pro, expanding the Gemini app's music generation capability from 30-second tracks to full 3-minute songs. The model improves structural understanding of musical composition, allowing users to prompt for specific elements like intros, verses, choruses, and bridges. Available now for Gemini subscribers with tier-based daily limits (10-50 tracks/day) and in Vertex AI, Google AI Studio, and the Gemini API for developers.
Google DeepMind launches Lyria 3 Pro with 3-minute track generation and structural awareness
Google DeepMind introduced Lyria 3 Pro, an advanced music generation model capable of creating tracks up to 3 minutes long with structural awareness of musical composition elements like intros, verses, choruses, and bridges. The model is rolling out across multiple Google products including Vertex AI, Google Vids, Gemini app, and the new ProducerAI collaborative tool.
Anthropic launches 'safer' auto mode for Claude Code to prevent unintended autonomous actions
Anthropic has launched an auto mode for Claude Code that blocks potentially dangerous autonomous actions before execution. The feature, now available as a research preview for Team plan users, acts as a middle ground between constant user oversight and unrestricted agent autonomy.
Comments
Loading...