What's more, they show a counter-intuitive scaling limit: their reasoning hard work improves with difficulty complexity up to some extent, then declines Regardless of owning an sufficient token price range. By comparing LRMs with their typical LLM counterparts underneath equivalent inference compute, we establish 3 overall performance regimes: (1) https://www.youtube.com/watch?v=snr3is5MTiU