Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published Jul 14 • 70
Self-Training Elicits Concise Reasoning in Large Language Models Paper • 2502.20122 • Published Feb 27 • 4