PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs Paper โข 2510.09507 โข Published 24 days ago โข 10
Visual Representation Alignment for Multimodal Large Language Models Paper โข 2509.07979 โข Published Sep 9 โข 83
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model Paper โข 2509.00676 โข Published Aug 31 โข 83
A Survey of Reinforcement Learning for Large Reasoning Models Paper โข 2509.08827 โข Published Sep 10 โข 186
MolmoAct: Action Reasoning Models that can Reason in Space Paper โข 2508.07917 โข Published Aug 11 โข 43
Emerging Properties in Unified Multimodal Pretraining Paper โข 2505.14683 โข Published May 20 โข 134