- 
	
	
	Octopus v2: On-device language model for super agentPaper • 2404.01744 • Published • 58
- 
	
	
	Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMsPaper • 2404.05719 • Published • 82
- 
	
	
	OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer EnvironmentsPaper • 2404.07972 • Published • 50
- 
	
	
	Toward Self-Improvement of LLMs via Imagination, Searching, and CriticizingPaper • 2404.12253 • Published • 55
Shaoguang Mao
dawnmsg
		·
				AI & ML interests
None yet
		
		Organizations
None yet
daily paper selected
			
			
	
	- 
	
	
	Octopus v2: On-device language model for super agentPaper • 2404.01744 • Published • 58
- 
	
	
	Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMsPaper • 2404.05719 • Published • 82
- 
	
	
	OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer EnvironmentsPaper • 2404.07972 • Published • 50
- 
	
	
	Toward Self-Improvement of LLMs via Imagination, Searching, and CriticizingPaper • 2404.12253 • Published • 55
			models
			0
		
			
	None public yet
			datasets
			0
		
			
	None public yet




