Junli Wang's picture

Junli Wang PRO

ZeonLap

·

https://ZeonLap.github.io

ZeonLap

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

upvoted a paper 7 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

authored a paper 3 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

View all activity

Organizations

upvoted 2 papers 7 days ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published 8 days ago • 19

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 9 days ago • 114

upvoted a paper 3 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12 • 31

upvoted 2 papers 8 months ago

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published Feb 20 • 24

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 207

upvoted a paper 9 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

upvoted a paper 10 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 285

upvoted an article 10 months ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

By

and 1 other •

Jan 3

• 19

upvoted a paper 10 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

upvoted 3 papers 11 months ago

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Paper • 2412.09605 • Published Dec 12, 2024 • 29

Layerwise Recurrent Router for Mixture-of-Experts

Paper • 2408.06793 • Published Aug 13, 2024 • 32

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 70

upvoted a paper 12 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 50