Yi-Lightning

01.AI
active

01.AI fast MoE API model with 40% better inference speed than prior Yi models for high-throughput workloads.

Context window16K tokens
Input / 1M tokens$0.14
Output / 1M tokens$0.14