OpenAI Releases Privacy Filter: 1.5B-Parameter On-Premises PII Detection Model with 128K Context
OpenAI has released Privacy Filter, a 1.5B-parameter bidirectional token classification model designed for detecting and masking personally identifiable information in text. The model processes 128,000-token sequences in a single pass and is available under Apache 2.0 license for on-premises deployment.
OpenAI Privacy Filter — Quick Specs
OpenAI Releases Privacy Filter: 1.5B-Parameter On-Premises PII Detection Model with 128K Context
OpenAI has released Privacy Filter, a 1.5B-parameter bidirectional token classification model designed for detecting and masking personally identifiable information (PII) in text. The model is available under Apache 2.0 license and can run on-premises, including in web browsers.
Technical Architecture
Privacy Filter uses 1.5 billion total parameters with 50 million active parameters through a sparse mixture-of-experts architecture. According to OpenAI, the model was first pretrained autoregressively similar to GPT-OSS, then converted into a bidirectional token classifier with supervised classification training.
The architecture includes:
- 8 transformer blocks with grouped-query attention (14 query heads, 2 KV heads)
- Sparse mixture-of-experts feed-forward blocks (128 experts total, top-4 routing)
- Bidirectional banded attention with band size 128 (effective 257-token attention window)
- 640-dimensional residual stream width
Unlike autoregressive models that generate text token-by-token, Privacy Filter labels entire input sequences in one forward pass, then decodes coherent spans using a constrained Viterbi procedure.
Detection Capabilities
The model detects 8 privacy categories:
- Account numbers
- Private addresses
- Private emails
- Private persons (names)
- Private phone numbers
- Private URLs
- Private dates
- Secrets
For token-level classification, each category expands into BIOES (Begin, Inside, Outside, End, Single) boundary tags, producing 33 total output classes per token.
Context and Performance
Privacy Filter supports a 128,000-token context window, enabling processing of long documents without chunking. The model includes runtime controls for configuring precision-recall tradeoffs through adjustable operating points that modify span detection aggressiveness.
The sequence decoder uses six transition-bias parameters controlling background persistence, span entry, continuation, closure, and boundary handoff to produce coherent span boundaries rather than per-token independent predictions.
Deployment Options
OpenAI states the model can run in web browsers via WebGPU using Transformers.js with quantization (q4), or on laptops and on-premises infrastructure. The model is available through Hugging Face Transformers with standard pipeline API support.
The Apache 2.0 license permits commercial deployment and fine-tuning on specific data distributions.
Limitations Disclosed
OpenAI explicitly states Privacy Filter is "not an anonymization, compliance, or a safety guarantee" and warns against over-reliance. The model only identifies PII matching its trained taxonomy of 8 categories, which may not cover all privacy use cases or regulatory requirements. OpenAI recommends using it as one layer in a broader privacy-by-design approach rather than a standalone solution.
What This Means
This release addresses a specific enterprise need: fast, on-premises PII detection for data sanitization workflows where cloud APIs are unsuitable due to data residency or throughput requirements. The 128K context window and single-pass labeling design prioritize throughput over the iterative accuracy of larger models. The Apache 2.0 license and small parameter count make it accessible for fine-tuning on domain-specific PII patterns, though organizations must validate it meets their specific privacy requirements rather than treating it as a compliance checkbox.
Related Articles
OpenAI launches Lockdown Mode to block prompt injection data exfiltration attacks
OpenAI has released Lockdown Mode, an optional security setting that protects against prompt injection attacks by limiting network requests and image fetching in ChatGPT. The feature is designed for users handling sensitive data and disables some ChatGPT capabilities including Deep Research and Agent Mode.
OpenAI upgrades ChatGPT memory architecture with automatic 'dreaming' synthesis, now available to free users
OpenAI is rolling out a new memory architecture for ChatGPT that automatically synthesizes information across conversations without explicit user prompts. The company announced free tier users will access memory features for the first time, while Plus and Pro users receive expanded memory capacity.
LM Studio adds iPhone-to-Mac connection for running local LLMs via Locally app
LM Studio has released LM Link, a feature connecting its Mac app with the Locally AI iOS app to let users access local LLMs running on their Macs from their iPhones. The connection uses end-to-end encryption via Tailscale mesh VPNs and will be free during the preview period.
OpenAI expands ChatGPT memory to free users, doubles storage capacity for paid tiers
OpenAI is rolling out an upgraded memory system for ChatGPT that synthesizes context more efficiently across conversations. The company reduced compute requirements by approximately 5x, enabling it to offer the memory feature to free users for the first time while doubling storage capacity for Plus and Pro subscribers.
Comments
Loading...