sascha-kirch 's Collections Diffusion Models
updated
Instruct-Imagen: Image Generation with Multi-modal Instruction
Paper
• 2401.01952
• Published • 31
ODIN: A Single Model for 2D and 3D Perception
Paper
• 2401.02416
• Published • 13
Bigger is not Always Better: Scaling Properties of Latent Diffusion
Models
Paper
• 2404.01367
• Published • 22
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion
Models
Paper
• 2404.02747
• Published • 13
PointInfinity: Resolution-Invariant Point Diffusion Models
Paper
• 2404.03566
• Published • 16
ControlNet++: Improving Conditional Controls with Efficient Consistency
Feedback
Paper
• 2404.07987
• Published • 48
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
Controls to Any Diffusion Model
Paper
• 2404.09967
• Published • 21
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Paper
• 2404.14507
• Published • 23
Semantica: An Adaptable Image-Conditioned Diffusion Model
Paper
• 2405.14857
• Published • 11
Improved Distribution Matching Distillation for Fast Image Synthesis
Paper
• 2405.14867
• Published • 15
Paper
• 2405.18407
• Published • 48
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper
• 2403.03206
• Published • 71
Kaleido Diffusion: Improving Conditional Diffusion Models with
Autoregressive Latent Modeling
Paper
• 2405.21048
• Published • 16
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
Paper
• 2405.20674
• Published • 15
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Paper
• 2406.01493
• Published • 23
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Paper
• 2406.04333
• Published • 38
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few
Steps Image Generation
Paper
• 2406.02347
• Published • 3
Step-aware Preference Optimization: Aligning Preference with Denoising
Performance at Each Step
Paper
• 2406.04314
• Published • 30
Alleviating Distortion in Image Generation via Multi-Resolution
Diffusion Models
Paper
• 2406.09416
• Published • 29
Interpreting the Weight Space of Customized Diffusion Models
Paper
• 2406.09413
• Published • 20
ExVideo: Extending Video Diffusion Models via Parameter-Efficient
Post-Tuning
Paper
• 2406.14130
• Published • 10
Paper
• 2402.09470
• Published • 13
ControlNeXt: Powerful and Efficient Control for Image and Video
Generation
Paper
• 2408.06070
• Published • 55
Paper
• 2408.07009
• Published • 62
Discrete Diffusion Modeling by Estimating the Ratios of the Data
Distribution
Paper
• 2310.16834
• Published • 5
Transfusion: Predict the Next Token and Diffuse Images with One
Multi-Modal Model
Paper
• 2408.11039
• Published • 63
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
Paper
• 2411.18613
• Published • 59
SNOOPI: Supercharged One-step Diffusion Distillation with Proper
Guidance
Paper
• 2412.02687
• Published • 113
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion
Models
Paper
• 2312.09608
• Published • 16
Audio-visual Controlled Video Diffusion with Masked Selective State
Spaces Modeling for Natural Talking Head Generation
Paper
• 2504.02542
• Published • 52