Llama 4 Scout
Meta AI🇺🇸 United States
First Llama 4 model with 10M token context and native multimodal support.
Context window10000K tokens
Input / 1M tokens$0.17
Output / 1M tokens$0.66
Version History
Llama-4-Scout-17B-16E-0921patch
Llama 4 Scout patch fixing multimodal tokenization issues and improving throughput for long-context document tasks.
Llama-4-Scout-17B-16Emajor
First MoE Llama with 10M token context and native image and video understanding.
Benchmark Scores
Full leaderboard →94.4%
DocVQA
88.7%
MMLU
160.0 tokens_per_sec
Speed (tok/s)