huangrh9's picture

2 8 1

huangrh9

huangrh9

·

huangrh99

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

upvoted a paper 20 days ago

Visual Spatial Tuning

updated a model 28 days ago

ILLUME-MLLM/dualvitok

View all activity

Organizations

upvoted a paper 5 days ago

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Paper • 2511.20561 • Published 5 days ago • 31

upvoted a paper 20 days ago

Visual Spatial Tuning

Paper • 2511.05491 • Published 23 days ago • 49

upvoted a paper about 1 month ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27 • 172

upvoted a paper about 2 months ago

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published Oct 15 • 36

upvoted a paper 3 months ago

RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10 • 72

upvoted a paper 8 months ago

ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

Paper • 2504.01934 • Published Apr 2 • 22

upvoted a paper 12 months ago

ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance

Paper • 2412.06673 • Published Dec 9, 2024 • 11

upvoted a paper about 1 year ago

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26, 2024 • 40