VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation Paper • 2510.14902 • Published 19 days ago • 13
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 22 days ago • 160
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction Paper • 2509.26633 • Published Sep 30 • 5
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 29 days ago • 463
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Paper • 2509.24695 • Published Sep 29 • 43