TPSTokens Per Second
NewsModelsIDEsRankingsBenchmarksSecurity

Navigate

NewsModelsIDEsRankingsChangelogBenchmarksSecurityModel QuizWhat I MissedHistorySaved

Companies

AnthropicOpenAIDeepMindMeta AIDeepSeekxAIMistral AIPerplexity AI

Meta AI

Meta's AI division, maker of Llama

https://ai.meta.com →

News

No articles yet.

Models

Llama 4 Maverick

Meta AI

active

Llama 4's multimodal MoE model with 1M context. Matches GPT-4o on most benchmarks.

Context1000K
Input/1M$0.27

Apr 5, 2025

Open weights

Llama 4 Scout

Meta AI

active

First Llama 4 model with 10M token context and native multimodal support.

Context10000K
Input/1M$0.17

Apr 5, 2025

Open weights

Llama 3.3 70B

Meta AI

active

Meta's open-weights 70B model matching Llama 3.1 405B quality at lower cost.

Context128K
Input/1M$0.58

Dec 6, 2024

Open weights

Llama 3.2 11B

Meta AI

deprecated

Lightweight multimodal Llama. Efficient vision + text on-device model.

Context128K

Sep 25, 2024

Open weights

Llama 3.2 90B

Meta AI

deprecated

Multimodal Llama with vision. First Llama model with image understanding.

Context128K

Sep 25, 2024

Open weights

Llama 3.2 3B Instruct

Meta AI

active
Context131K
0

Sep 25, 2024

Llama 3.1 405B

Meta AI

deprecated

Meta's largest open-weights model. 405B parameters, instruction-tuned.

Context128K

Jul 23, 2024

Open weights

Top Benchmark Scores

Full leaderboard →

DocVQA

Llama 4 Maverick
94.4%

GPQA

Llama 4 Maverick
69.8%

HumanEval

Llama 4 Maverick
88.4%

MATH

Llama 4 Maverick
88.9%

MMLU

Llama 4 Maverick
91.4%

MMMU

Llama 4 Maverick
73.5%

Speed (tok/s)

Llama 4 Scout
160 tokens_per_sec

SWE-bench Verified

Llama 4 Maverick
51.2%
TPS

Tokens Per Second. The fastest LLM news on the internet — tracked automatically every 15 minutes.

Coverage

  • Latest News
  • Model Database
  • AI IDEs
  • Compare IDEs
  • Changelog
  • Benchmarks
  • Compare Models

Companies

  • Anthropic
  • OpenAI
  • Google DeepMind
  • Meta AI
  • DeepSeek
  • xAI
  • Mistral AI
  • Perplexity AI

Guides

  • All Rankings
  • Best by Use Case
  • Best Coding LLM
  • Best Cheap LLM
  • Best Reasoning LLM
  • Best AI IDE
  • Compare Models
  • Compare IDEs
  • About TPS
  • RSS Feed
  • Atom Feed

© 2026 TPS — Tokens Per Second.

The fastest LLM news. All signal, no noise.