Video-Understadning - a guaguastandup Collection

guaguastandup 's Collections

Video-Understadning

Video-Understadning

updated about 1 month ago

OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding

Paper • 2512.23646 • Published Dec 29, 2025 • 15
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Paper • 2602.07845 • Published Feb 8 • 69