Yi-Lightning
01.AI01.AI fast MoE API model with 40% better inference speed than prior Yi models for high-throughput workloads.
Context window16K tokens
Input / 1M tokens$0.14
Output / 1M tokens$0.14
01.AI fast MoE API model with 40% better inference speed than prior Yi models for high-throughput workloads.