Running on Zero Featured 937 MMAudio — generating synchronized audio from video/text 🔊 937 Generate synchronized audio from video or text prompts
ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks Paper • 2603.27862 • Published 10 days ago • 30
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation Paper • 2603.23500 • Published 15 days ago • 35
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 17 days ago • 121
VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction Paper • 2602.13294 • Published Feb 9 • 13
VideoMaMa: Mask-Guided Video Matting via Generative Prior Paper • 2601.14255 • Published Jan 20 • 15
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published Dec 18, 2025 • 47