Existing long-CoT reasoning models have achieved state-of-the-art performance in mathematical reasoning by generating reasoning trajectories with iterative self-verification and refinement. However, open-source long-CoT models…
DualDistill and Agentic-R1: How AI Combines Natural Language and Tool Use for Superior Math Problem Solving
