A ranking of models specifically for complex reasoning tasks — math, logic, multi-step problem solving, and strategic thinking.
“Excellent general reasoning with much faster response times than o3.”
“Google's 'thinking' mode is surprisingly competitive. Great for scientific reasoning.”
“Open-source reasoning powerhouse. The chain-of-thought traces are educational.”