Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Paper • 2506.09040 • Published Jun 10 • 34
Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs Paper • 2506.07045 • Published Jun 8 • 8
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper • 2412.02259 • Published Dec 3, 2024 • 60
IDEA-Research/grounding-dino-base Zero-Shot Object Detection • 0.2B • Updated May 12, 2024 • 1.45M • 137
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO Text Generation • 47B • Updated Apr 30, 2024 • 9.19k • • 450