ViSpeak: Visual Instruction Feedback in Streaming Videos
AI & ML interests
Machine Learning, Computer Vision, Embodied AI
Recent Activity
View all activity
LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning
-
LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning
Paper • 2509.24786 • Published • 5 -
iSEE-Laboratory/love-r1-train
Preview • Updated • 12 -
iSEE-Laboratory/love-r1-stage1
8B • Updated • 13 -
iSEE-Laboratory/love-r1-stage2
8B • Updated • 8
ViSpeak: Visual Instruction Feedback in Streaming Videos
LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning
-
LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning
Paper • 2509.24786 • Published • 5 -
iSEE-Laboratory/love-r1-train
Preview • Updated • 12 -
iSEE-Laboratory/love-r1-stage1
8B • Updated • 13 -
iSEE-Laboratory/love-r1-stage2
8B • Updated • 8
models
6
iSEE-Laboratory/love-r1-stage1
8B
•
Updated
•
13
iSEE-Laboratory/love-r1-stage2
8B
•
Updated
•
8
iSEE-Laboratory/love-r1-stage3
8B
•
Updated
•
10
iSEE-Laboratory/llmdet_large
Zero-Shot Object Detection
•
0.3B
•
Updated
•
213k
•
15
iSEE-Laboratory/llmdet_base
Zero-Shot Object Detection
•
0.2B
•
Updated
•
568k
•
7
iSEE-Laboratory/llmdet_tiny
Zero-Shot Object Detection
•
0.2B
•
Updated
•
3.42k
•
6