WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper • 2606.09426 • Published 13 days ago • 101
WorldOlympiad: Can Your World Model Survive a Triathlon? Paper • 2606.11129 • Published 12 days ago • 31
WorldOlympiad: Can Your World Model Survive a Triathlon? Paper • 2606.11129 • Published 12 days ago • 31
MMAE: A Massive Multitask Audio Editing Benchmark Paper • 2606.07229 • Published 16 days ago • 44
Cosmos 3: Omnimodal World Models for Physical AI Paper • 2606.02800 • Published 20 days ago • 129
Streaming Communication in Multi-Agent Reasoning Paper • 2606.05158 • Published 18 days ago • 30
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 26 days ago • 143
TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction Paper • 2605.26115 • Published 27 days ago • 52
TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction Paper • 2605.26115 • Published 27 days ago • 52
TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction Paper • 2605.26115 • Published 27 days ago • 52
FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation Paper • 2605.09430 • Published May 10