InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published 17 days ago • 95
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation Paper • 2512.17040 • Published 17 days ago • 27
Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure Paper • 2512.14336 • Published 20 days ago • 28
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 27 days ago • 116
ACG: Action Coherence Guidance for Flow-based VLA models Paper • 2510.22201 • Published Oct 25, 2025 • 36
DesignLab: Designing Slides Through Iterative Detection and Correction Paper • 2507.17202 • Published Jul 23, 2025 • 50
Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models Paper • 2506.00996 • Published Jun 1, 2025 • 39
SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation Paper • 2504.14396 • Published Apr 19, 2025 • 27
Scaling Up Personalized Aesthetic Assessment via Task Vector Customization Paper • 2407.07176 • Published Jul 9, 2024 • 6
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model Paper • 2309.03550 • Published Sep 7, 2023 • 12
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis Paper • 2308.08157 • Published Aug 16, 2023 • 2