Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective Paper • 2501.11110 • Published Jan 19 • 4
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning Paper • 2505.21668 • Published May 27 • 2
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models Paper • 2404.02575 • Published Apr 3, 2024 • 50