- 
	
	
	Slamming: Training a Speech Language Model on One GPU in a DayPaper • 2502.15814 • Published • 69
- 
	
	
	Small Models Struggle to Learn from Strong ReasonersPaper • 2502.12143 • Published • 39
- 
	
	
	HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingPaper • 2502.12574 • Published • 12
- 
	
	
	Large Language Diffusion ModelsPaper • 2502.09992 • Published • 122
Shiwon Jeong
sebastianrcnt
		AI & ML interests
None yet
		
		Organizations
None yet
interesting
			
			
	
	- 
	
	
	Slamming: Training a Speech Language Model on One GPU in a DayPaper • 2502.15814 • Published • 69
- 
	
	
	Small Models Struggle to Learn from Strong ReasonersPaper • 2502.12143 • Published • 39
- 
	
	
	HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingPaper • 2502.12574 • Published • 12
- 
	
	
	Large Language Diffusion ModelsPaper • 2502.09992 • Published • 122
			models
			0
		
			
	None public yet
			datasets
			0
		
			
	None public yet