Llama 4 Scout

Meta AI🇺🇸 United States
active

First Llama 4 model with 10M token context and native multimodal support.

Context window10000K tokens
Input / 1M tokens$0.17
Output / 1M tokens$0.66

Version History

Llama-4-Scout-17B-16E-0921patch

Llama 4 Scout patch fixing multimodal tokenization issues and improving throughput for long-context document tasks.

Llama-4-Scout-17B-16Emajor

First MoE Llama with 10M token context and native image and video understanding.

Benchmark Scores

Full leaderboard →
94.4%
DocVQA
88.7%
MMLU
160.0 tokens_per_sec
Speed (tok/s)