This is a list of AI benchmarks I’m watching. Last updated March 12, 2026.
- ARC Prize Leaderboard
- Artificial Analysis LLM Leaderboard
- Bullshit Benchmark Explorer
- Design Arena Leaderboard
- EQ-Bench 3
- EQ-Bench Creative Writing
- FutureSearch Benchmarks
- Kotlin-bench Leaderboard
- lechmazur Repositories
- LiveBench
- LMArena
- Mercor APEX
- OpenRouter LLM Rankings
- SimpleBench
- SWE-bench
- Terminal-Bench
- Vending-Bench 2
- Vending-Bench Arena
- VoxelBench
- WeirdML
- Yupp Leaderboard