- 
	
	
	DreamLLM: Synergistic Multimodal Comprehension and CreationPaper • 2309.11499 • Published • 59
- 
	
	
	An Introduction to Vision-Language ModelingPaper • 2405.17247 • Published • 90
- 
	
	
	Chameleon: Mixed-Modal Early-Fusion Foundation ModelsPaper • 2405.09818 • Published • 131
- 
	
	
	No Time to Waste: Squeeze Time into Channel for Mobile Video UnderstandingPaper • 2405.08344 • Published • 15
Yiming Wu
weleen
		AI & ML interests
Computer Vision
		Recent Activity
						upvoted 
								a
								collection
							
						about 1 month ago
						
					Inference Optimized Checkpoints (with Model Optimizer)
						
						updated
								a dataset
							
						about 2 months ago
						
					
						
						
						
						weleen/take_the_banana_and_insert_into_the_bottle
						
						updated
								a model
							
						about 2 months ago
						
					
						
						
						
						weleen/grab_bread_and_put
						 
								 
								 
								
 
				