Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model Paper • 2510.18855 • Published 13 days ago • 62 • 3
AgentFold: Long-Horizon Web Agents with Proactive Context Management Paper • 2510.24699 • Published 6 days ago • 63 • 4
Multi-Agent Evolve: LLM Self-Improve through Co-evolution Paper • 2510.23595 • Published 7 days ago • 8 • 2
Robust Layerwise Scaling Rules by Proper Weight Decay Tuning Paper • 2510.15262 • Published 17 days ago • 5 • 3
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper • 2510.15624 • Published 17 days ago • 14 • 5
Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition Paper • 2510.15280 • Published 17 days ago • 14 • 4
A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning Paper • 2510.12838 • Published 21 days ago • 22 • 3
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning Paper • 2510.15444 • Published 17 days ago • 144 • 6
Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers Paper • 2509.24317 • Published Sep 29 • 8 • 2
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources Paper • 2509.21268 • Published Sep 25 • 101 • 3
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System Paper • 2509.18091 • Published Sep 22 • 33 • 3
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization Paper • 2509.13313 • Published Sep 16 • 78 • 5