product updateGitHub

GitHub will train Copilot models on user interaction data starting April 2026

TL;DR

GitHub will use Copilot interaction data from Free, Pro, and Pro+ plan users to train AI models starting April 24, 2026, unless users actively opt out. The policy does not affect Copilot Business and Enterprise customers. Data shared will include prompts, outputs, code snippets, filenames, and repository structures.

2 min read
0

GitHub Will Train Copilot Models on User Interaction Data Starting April 2026

GitHub announced a significant change to its Copilot data policy effective April 24, 2026. Starting that date, interaction data from users on Free, Pro, and Pro+ plans will be used to train AI models unless users explicitly opt out.

What Data Will Be Collected

The data collection will include:

  • User prompts and model outputs
  • Code snippets
  • Filenames
  • Repository structures
  • User feedback on model suggestions

Users who have previously opted out of data collection will retain their existing settings and will not be automatically enrolled.

Scope and Limitations

Copilot Business and Enterprise customers are exempt from this policy change. GitHub clarified that collected data can be shared with Microsoft but will not be shared with third-party AI model providers.

GitHub Chief Product Officer Mario Rodriguez stated that real-world usage data improves model quality. Internal testing with data from Microsoft employees already demonstrated higher acceptance rates for code suggestions, suggesting the approach yields measurable improvements.

Opt-Out Process

Users who wish to prevent their data from being used for training can opt out through Copilot settings under the "Privacy" section. GitHub indicated that more details are available on the GitHub blog.

What This Means

This policy represents a shift toward extracting training value from Copilot's large user base of developers. The limitation to Free, Pro, and Pro+ plans—while exempting Enterprise customers—suggests GitHub is balancing competitive advantage with enterprise customer expectations around data usage. The explicit opt-out structure (rather than opt-in) will likely result in significantly higher participation rates unless developers proactively change settings. The exclusion of third-party providers indicates Microsoft intends to keep this training data advantage internal rather than licensing it broadly, positioning GitHub Copilot improvements as a Microsoft-exclusive benefit.

Related Articles

product update

Amazon Bedrock adds three video analysis workflows for multimodal understanding at scale

Amazon Bedrock has introduced three distinct video analysis workflows that leverage multimodal foundation models to extract insights from video content at scale. The approaches—frame-based, shot-based, and multimodal embedding—are designed for different use cases and cost-performance trade-offs, with open-source reference implementations available on GitHub.

product update

Google's Gemini app now creates 3-minute songs with Lyria 3 Pro

Google announced Lyria 3 Pro, expanding the Gemini app's music generation capability from 30-second tracks to full 3-minute songs. The model improves structural understanding of musical composition, allowing users to prompt for specific elements like intros, verses, choruses, and bridges. Available now for Gemini subscribers with tier-based daily limits (10-50 tracks/day) and in Vertex AI, Google AI Studio, and the Gemini API for developers.

product update

Google DeepMind launches Lyria 3 Pro with 3-minute track generation and structural awareness

Google DeepMind introduced Lyria 3 Pro, an advanced music generation model capable of creating tracks up to 3 minutes long with structural awareness of musical composition elements like intros, verses, choruses, and bridges. The model is rolling out across multiple Google products including Vertex AI, Google Vids, Gemini app, and the new ProducerAI collaborative tool.

product update

Anthropic launches 'safer' auto mode for Claude Code to prevent unintended autonomous actions

Anthropic has launched an auto mode for Claude Code that blocks potentially dangerous autonomous actions before execution. The feature, now available as a research preview for Team plan users, acts as a middle ground between constant user oversight and unrestricted agent autonomy.

Comments

Loading...