VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos Paper • 2510.19488 • Published 9 days ago • 19
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published 11 days ago • 116
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Paper • 2505.13227 • Published May 19 • 45