Video-BrowseComp: Benchmarking Agentic Video Research on Open Web Paper • 2512.23044 • Published 3 days ago • 9
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper • 2512.05103 • Published 27 days ago • 16
MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval Paper • 2509.26378 • Published Sep 30, 2025
MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos Paper • 2502.12558 • Published Feb 18, 2025
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval Paper • 2502.11431 • Published Feb 17, 2025
VideoDeepResearch: Long Video Understanding With Agentic Tool Using Paper • 2506.10821 • Published Jun 12, 2025 • 19
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published Jun 23, 2025 • 78
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published Jun 23, 2025 • 78 • 4
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published Jun 23, 2025 • 78