Large Reasoning Models Learn Better Alignment from Flawed Thinking Paper • 2510.00938 • Published 25 days ago • 56
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT Paper • 2509.19284 • Published Sep 23 • 22
Learning to Reason as Action Abstractions with Scalable Mid-Training RL Paper • 2509.25810 • Published 27 days ago • 5