GitHub will train Copilot models on user interaction data starting April 2026
GitHub will use Copilot interaction data from Free, Pro, and Pro+ plan users to train AI models starting April 24, 2026, unless users actively opt out. The policy does not affect Copilot Business and Enterprise customers. Data shared will include prompts, outputs, code snippets, filenames, and repository structures.
GitHub Will Train Copilot Models on User Interaction Data Starting April 2026
GitHub announced a significant change to its Copilot data policy effective April 24, 2026. Starting that date, interaction data from users on Free, Pro, and Pro+ plans will be used to train AI models unless users explicitly opt out.
What Data Will Be Collected
The data collection will include:
- User prompts and model outputs
- Code snippets
- Filenames
- Repository structures
- User feedback on model suggestions
Users who have previously opted out of data collection will retain their existing settings and will not be automatically enrolled.
Scope and Limitations
Copilot Business and Enterprise customers are exempt from this policy change. GitHub clarified that collected data can be shared with Microsoft but will not be shared with third-party AI model providers.
GitHub Chief Product Officer Mario Rodriguez stated that real-world usage data improves model quality. Internal testing with data from Microsoft employees already demonstrated higher acceptance rates for code suggestions, suggesting the approach yields measurable improvements.
Opt-Out Process
Users who wish to prevent their data from being used for training can opt out through Copilot settings under the "Privacy" section. GitHub indicated that more details are available on the GitHub blog.
What This Means
This policy represents a shift toward extracting training value from Copilot's large user base of developers. The limitation to Free, Pro, and Pro+ plans—while exempting Enterprise customers—suggests GitHub is balancing competitive advantage with enterprise customer expectations around data usage. The explicit opt-out structure (rather than opt-in) will likely result in significantly higher participation rates unless developers proactively change settings. The exclusion of third-party providers indicates Microsoft intends to keep this training data advantage internal rather than licensing it broadly, positioning GitHub Copilot improvements as a Microsoft-exclusive benefit.
Related Articles
GitHub Copilot CLI Gets Redesigned Terminal Interface in General Availability
GitHub has released the redesigned terminal interface for GitHub Copilot CLI to general availability. The update, previewed at Microsoft Build 2026, introduces a tabbed layout for working with GitHub directly from the command line.
GitHub details Qubot, internal Copilot-powered data analytics agent for plain language queries
GitHub has released technical details on Qubot, an internal analytics agent powered by GitHub Copilot that enables employees to query company data using natural language. The agent represents GitHub's implementation of AI-assisted data analysis for internal operations.
GitHub built Qubot, an internal data analytics agent using Copilot to query company data in natural language
GitHub has built Qubot, an internal analytics agent powered by GitHub Copilot that allows employees to query company data using natural language. The project represents GitHub's approach to building domain-specific AI agents for data analysis tasks.
Google expands Gemini Android overlay menu with six new tools accessible without opening app
Google has expanded the Gemini overlay plus menu on Android to include six tools: Videos, Music, Canvas, and Guided Learning join the existing Images and Personal Intelligence options. The update, rolling out in Google app version 17.32, allows users to access most Gemini features from anywhere on Android without opening the full app.
Comments
Loading...