MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation Paper • 2510.18692 • Published 13 days ago • 38
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published 13 days ago • 107
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback Paper • 2510.16888 • Published 15 days ago • 18
QueST: Incentivizing LLMs to Generate Difficult Problems Paper • 2510.17715 • Published 14 days ago • 31
GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer Paper • 2510.16136 • Published 17 days ago • 2
Embody 3D: A Large-scale Multimodal Motion and Behavior Dataset Paper • 2510.16258 • Published 16 days ago • 7
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper • 2510.04849 • Published 28 days ago • 110
Trace Anything: Representing Any Video in 4D via Trajectory Fields Paper • 2510.13802 • Published 19 days ago • 30
FlashWorld: High-quality 3D Scene Generation within Seconds Paper • 2510.13678 • Published 19 days ago • 70
Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models Paper • 2510.11057 • Published 21 days ago • 30
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 21 days ago • 160
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models Paper • 2510.11341 • Published 21 days ago • 33