Version History

120B-A6Bmajor

Leanstral released as 120B-parameter agent for formal code verification using Lean, available with open weights (Apache 2.0) and free API endpoint. Claims superiority over larger open-source models and 85% cost savings versus Claude Sonnet on FLTEval benchmarks.

Coverage

analysis

Mistral's Leanstral code verification agent outperforms Claude Sonnet at 15% of the cost

Mistral has released Leanstral, a 120B-parameter code verification agent built with the Lean programming language, claiming it outperforms larger open-source models and offers significant cost advantages over Anthropic's Claude suite. The model achieves a pass@2 score of 26.3—beating Claude Sonnet by 2.6 points—while costing $36 to run compared to Sonnet's $549.

2 min read