MotionVLA: Vision-Language-Action Model for Humanoid Motion Paper • 2606.15142 • Published 18 days ago • 5
Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models Paper • 2606.03988 • Published 28 days ago • 126
electricsheepasia/asia-owid-annual-working-hours-per-worker Viewer • Updated 29 days ago • 1.44k • 71 • 1
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published May 25 • 103
SAM 3D Animal: Promptable Animal 3D Reconstruction from Images in the Wild Paper • 2605.07604 • Published May 8 • 4
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper • 2605.14386 • Published May 14 • 63
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution Paper • 2605.18401 • Published May 18 • 130
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published May 13 • 274
Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation Paper • 2605.04128 • Published May 5 • 17