Haotian Zhang
haotiz
		AI & ML interests
Vision and Language
		Recent Activity
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
						
						authored 
								a paper
							
						about 1 month ago
						
					
						
						
						MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid
  Vision Tokenizer
						
						authored 
								a paper
							
						about 1 month ago
						
					
						
						
						Ferret-UI 2: Mastering Universal User Interface Understanding Across
  Platforms