benchmark
Arcada Labs benchmark tests five AI models as autonomous X agents
Arcada Labs, an AI benchmarking startup, has created a new benchmark that pits five leading AI models against each other as autonomous social media agents on X. The test measures how well different models can operate independently on the platform.