zyf515730395
's Collections
Video Generation
updated
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper
•
2506.09113
•
Published
•
105
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video
Diffusion
Paper
•
2506.08009
•
Published
•
30
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper
•
2506.08279
•
Published
•
27
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal
Interaction and Enhancement
Paper
•
2506.07848
•
Published
•
4
SeedVR2: One-Step Video Restoration via Diffusion Adversarial
Post-Training
Paper
•
2506.05301
•
Published
•
58
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video
Diffusion Transformers
Paper
•
2506.00830
•
Published
•
7
Video World Models with Long-term Spatial Memory
Paper
•
2506.05284
•
Published
•
55
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable
3D Scene Generation
Paper
•
2506.04225
•
Published
•
28
IllumiCraft: Unified Geometry and Illumination Diffusion for
Controllable Video Generation
Paper
•
2506.03150
•
Published
•
21
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Paper
•
2504.08685
•
Published
•
130
Any2Caption:Interpreting Any Condition to Caption for Controllable Video
Generation
Paper
•
2503.24379
•
Published
•
76
Seedream 3.0 Technical Report
Paper
•
2504.11346
•
Published
•
70
JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical
Spatio-Temporal Prior Synchronization
Paper
•
2503.23377
•
Published
•
57
Audio-visual Controlled Video Diffusion with Masked Selective State
Spaces Modeling for Natural Talking Head Generation
Paper
•
2504.02542
•
Published
•
51
SkyReels-A2: Compose Anything in Video Diffusion Transformers
Paper
•
2504.02436
•
Published
•
39
Long-Context Autoregressive Video Modeling with Next-Frame Prediction
Paper
•
2503.19325
•
Published
•
73
Wan: Open and Advanced Large-Scale Video Generative Models
Paper
•
2503.20314
•
Published
•
56
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Paper
•
2503.09151
•
Published
•
32
ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs
Paper
•
2506.18792
•
Published
•
30
VMoBA: Mixture-of-Block Attention for Video Diffusion Models
Paper
•
2506.23858
•
Published
•
31
Tora2: Motion and Appearance Customized Diffusion Transformer for
Multi-Entity Video Generation
Paper
•
2507.05963
•
Published
•
12
StreamDiT: Real-Time Streaming Text-to-Video Generation
Paper
•
2507.03745
•
Published
•
31
Lumos-1: On Autoregressive Video Generation from a Unified Model
Perspective
Paper
•
2507.08801
•
Published
•
30
Captain Cinema: Towards Short Movie Generation
Paper
•
2507.18634
•
Published
•
41
Omni-Effects: Unified and Spatially-Controllable Visual Effects
Generation
Paper
•
2508.07981
•
Published
•
58
Waver: Wave Your Way to Lifelike Video Generation
Paper
•
2508.15761
•
Published
•
36
Lumen: Consistent Video Relighting and Harmonious Background Replacement
with Video Generative Models
Paper
•
2508.12945
•
Published
•
14
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal
Conditioning
Paper
•
2509.08519
•
Published
•
128
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
Paper
•
2510.02283
•
Published
•
96
UniVideo: Unified Understanding, Generation, and Editing for Videos
Paper
•
2510.08377
•
Published
•
71
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper
•
2510.20888
•
Published
•
45
Uniform Discrete Diffusion with Metric Path for Video Generation
Paper
•
2510.24717
•
Published
•
40
LongLive: Real-time Interactive Long Video Generation
Paper
•
2509.22622
•
Published
•
184
SANA-Video: Efficient Video Generation with Block Linear Diffusion
Transformer
Paper
•
2509.24695
•
Published
•
44
Simulating the Visual World with Artificial Intelligence: A Roadmap
Paper
•
2511.08585
•
Published
•
29
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper
•
2512.16093
•
Published
•
91
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation
Paper
•
2512.17040
•
Published
•
27