Collections
Discover the best community collections!
Collections including paper arxiv:2506.08012 
						
					
				- 
	
	
	
Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Paper • 2502.17110 • Published • 13 - 
	
	
	
WebGames: Challenging General-Purpose Web-Browsing AI Agents
Paper • 2502.18356 • Published • 14 - 
	
	
	
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Paper • 2502.18906 • Published • 12 - 
	
	
	
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Paper • 2503.02268 • Published • 11 
- 
	
	
	
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper • 2412.04454 • Published • 71 - 
	
	
	
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Paper • 2506.03143 • Published • 52 - 
	
	
	
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Paper • 2505.12370 • Published - 
	
	
	
UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning
Paper • 2505.12493 • Published 
- 
	
	
	
				lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 260 • 97 - 
	
	
	
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 - 
	
	
	
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 - 
	
	
	
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88 
- 
	
	
	
End-to-End Goal-Driven Web Navigation
Paper • 1602.02261 • Published - 
	
	
	
Learning Language Games through Interaction
Paper • 1606.02447 • Published - 
	
	
	
Naturalizing a Programming Language via Interactive Learning
Paper • 1704.06956 • Published - 
	
	
	
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Paper • 1802.08802 • Published • 1 
- 
	
	
	
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper • 2412.04454 • Published • 71 - 
	
	
	
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Paper • 2506.03143 • Published • 52 - 
	
	
	
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Paper • 2505.12370 • Published - 
	
	
	
UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning
Paper • 2505.12493 • Published 
- 
	
	
	
				lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 260 • 97 - 
	
	
	
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 - 
	
	
	
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 - 
	
	
	
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88 
- 
	
	
	
Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Paper • 2502.17110 • Published • 13 - 
	
	
	
WebGames: Challenging General-Purpose Web-Browsing AI Agents
Paper • 2502.18356 • Published • 14 - 
	
	
	
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Paper • 2502.18906 • Published • 12 - 
	
	
	
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Paper • 2503.02268 • Published • 11 
- 
	
	
	
End-to-End Goal-Driven Web Navigation
Paper • 1602.02261 • Published - 
	
	
	
Learning Language Games through Interaction
Paper • 1606.02447 • Published - 
	
	
	
Naturalizing a Programming Language via Interactive Learning
Paper • 1704.06956 • Published - 
	
	
	
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Paper • 1802.08802 • Published • 1