FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution Paper • 2510.12747 • Published Oct 14 • 37
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning Paper • 2510.08555 • Published Oct 9 • 63
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper • 2508.10881 • Published Aug 14 • 52
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training Paper • 2508.00414 • Published Aug 1 • 91
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Paper • 2507.22827 • Published Jul 30 • 98
From One to More: Contextual Part Latents for 3D Generation Paper • 2507.08772 • Published Jul 11 • 25
4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture Paper • 2507.05163 • Published Jul 7 • 41
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Paper • 2410.21845 • Published Oct 29, 2024 • 15
Learning Humanoid Standing-up Control across Diverse Postures Paper • 2502.08378 • Published Feb 12 • 1
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction Paper • 2402.16174 • Published Feb 25, 2024 • 1
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scenes Paper • 2505.20294 • Published May 26 • 4