Meta releases SAM 3.1, adding 7x faster multi-object tracking to vision foundation model
Meta has released SAM 3.1, an update to its Segment Anything Model that adds Object Multiplex, a shared-memory approach for joint multi-object tracking. The new version achieves approximately 7x faster inference when tracking 128 objects on a single H100 GPU while improving video object segmentation (VOS) performance on 6 out of 7 benchmarks.
Meta Releases SAM 3.1 with 7x Faster Multi-Object Tracking
Meta has released SAM 3.1, an updated version of its Segment Anything Model that introduces Object Multiplex, a shared-memory architecture designed for efficient joint multi-object tracking in videos.
Key Improvements
SAM 3.1 builds on SAM 3, Meta's unified foundation model for promptable segmentation that handles both images and videos. The new version maintains backward compatibility with SAM 3's core capabilities—detecting, segmenting, and tracking objects via text or visual prompts (points, boxes, masks)—while adding significant performance gains for tracking scenarios.
The Object Multiplex approach delivers approximately 7x faster inference when tracking 128 objects on a single H100 GPU, according to Meta's claims. The company states this speedup occurs without sacrificing accuracy. Additionally, SAM 3.1 shows improved video object segmentation (VOS) performance across 6 out of 7 benchmarks tested.
Concept Vocabulary Scale
SAM 3's underlying capability to handle open-vocabulary concepts remains central to SAM 3.1. The model can exhaustively segment all instances of concepts specified via short text phrases, handling over 50x more unique concepts than existing benchmarks, according to Meta.
Deployment Details
The SAM 3.1 model checkpoints are available on Hugging Face at facebook/sam3.1. Meta notes that there is no Hugging Face Transformers integration for this release. Users must access the SAM 3 GitHub repository for installation instructions, code examples, and full documentation. The model requires users to share contact information via Meta's privacy framework before access.
As of the latest data, the model has recorded 1,865 downloads in the previous month on Hugging Face. Currently, no inference providers have deployed SAM 3.1, meaning users must run the model locally or on their own infrastructure.
What This Means
SAM 3.1 positions Meta's segmentation foundation model for production use cases in video understanding and multi-object tracking—domains where inference speed directly impacts real-world feasibility. The 7x speedup at scale (128 objects) suggests the model targets enterprise applications in video analysis, autonomous systems, and robotics rather than single-object or lightweight use cases. The lack of inference provider deployment currently limits accessibility to developers with GPU infrastructure, though the open availability of checkpoints aligns with Meta's broader strategy of releasing foundational models for research and production adoption.
Related Articles
DeepSeek Releases V4-Pro with 1.6T Parameters, 1M Token Context at 27% Inference Cost of V3
DeepSeek has released two Mixture-of-Experts models: V4-Pro with 1.6 trillion parameters (49B activated) and V4-Flash with 284B parameters (13B activated), both supporting 1 million token context windows. V4-Pro requires only 27% of inference FLOPs and 10% of KV cache compared to V3.2 at 1M token context, trained on over 32 trillion tokens.
Anthropic's Fable 5 model expected to return next week after 15-day government shutdown
The Trump administration is close to allowing Anthropic to restore access to its Fable 5 model, which has been offline for 15 days due to national security concerns. Insiders expect restrictions could be lifted as soon as next week, though Pentagon and NSA approval is still required.
OpenAI previews GPT-5.6 to select partners with three variants priced from $1 to $30 per million tokens
OpenAI has begun previewing its GPT-5.6 series to a limited group of trusted partners after government review. The release includes three variants: Sol at $5 input/$30 output per million tokens, Terra at $2.50/$15, and Luna at $1/$6.
US government allows Anthropic to release Claude Mythos 5 to 100+ institutions after two-week export control block
The US Commerce Department has partially lifted export controls on Anthropic's Claude Mythos 5 model, permitting its release to over 100 US institutions including major companies and government agencies. The restrictions, imposed two weeks ago alongside a block on Claude Fable 5, reportedly stemmed from concerns about potential jailbreaks and Chinese access.
Comments
Loading...