The Amazing Agent Race: Strong Tool Users, Weak Navigators Paper • 2604.10261 • Published 9 days ago • 7
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 13 days ago • 100
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 24 days ago • 489
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 18 days ago • 321
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published Mar 17 • 51
Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning Paper • 2604.01152 • Published 25 days ago • 5
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published Mar 17 • 308