Reasoning Efficiency Research Collection Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs • 3 items • Updated 8 days ago • 7
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning Paper • 2510.15110 • Published 15 days ago • 15
TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control Paper • 2510.09561 • Published 21 days ago • 7
Temporal Prompting Matters: Rethinking Referring Video Object Segmentation Paper • 2510.07319 • Published 23 days ago • 2
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models Paper • 2510.03232 • Published 28 days ago • 1
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published May 30 • 138
V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts Paper • 2509.18053 • Published Sep 22 • 3
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper • 2508.14041 • Published Aug 19 • 59
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation Paper • 2404.04231 • Published Apr 5, 2024 • 1
GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation Paper • 2406.12834 • Published Jun 18, 2024 • 1
Spatio-Temporal Context Prompting for Zero-Shot Action Detection Paper • 2408.15996 • Published Aug 28, 2024 • 1
ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection Paper • 2412.13174 • Published Dec 17, 2024 • 1
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models Paper • 2501.02355 • Published Jan 4 • 1
SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP Paper • 2408.10202 • Published Aug 19, 2024 • 1
AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting Paper • 2502.05176 • Published Feb 7 • 38
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning Paper • 2507.16815 • Published Jul 22 • 39
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published Mar 6 • 96