RoboOmni: Proactive Robot Manipulation in Omni-modal Context Paper • 2510.23763 • Published 4 days ago • 52
PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning Paper • 2510.13809 • Published 16 days ago • 36
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models Paper • 2510.13626 • Published 16 days ago • 43
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance Paper • 2510.00499 • Published about 1 month ago • 18
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data Paper • 2509.15221 • Published Sep 18 • 109
WideSearch: Benchmarking Agentic Broad Info-Seeking Paper • 2508.07999 • Published Aug 11 • 109
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning Paper • 2506.23127 • Published Jun 29 • 1