Min-Hung Chen

cmhungsteve

https://minhungchen.netlify.app/

AI & ML interests

Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning

Recent Activity

upvoted a collection 6 days ago

Reasoning Efficiency Research

authored a paper 12 days ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

upvoted a paper 12 days ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

View all activity

Organizations

authored a paper 12 days ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published 16 days ago • 15

authored a paper 19 days ago

TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control

Paper • 2510.09561 • Published 22 days ago • 7

authored a paper 23 days ago

Temporal Prompting Matters: Rethinking Referring Video Object Segmentation

Paper • 2510.07319 • Published 24 days ago • 2

authored a paper 26 days ago

LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models

Paper • 2510.03232 • Published 29 days ago • 1

authored a paper about 1 month ago

V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts

Paper • 2509.18053 • Published Sep 22 • 3

authored 8 papers 2 months ago

CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models

Paper • 2501.02355 • Published Jan 4 • 1

ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection

Paper • 2412.13174 • Published Dec 17, 2024 • 1

Spatio-Temporal Context Prompting for Zero-Shot Action Detection

Paper • 2408.15996 • Published Aug 28, 2024 • 1

GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation

Paper • 2406.12834 • Published Jun 18, 2024 • 1

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

Paper • 2404.04231 • Published Apr 5, 2024 • 1

authored a paper 3 months ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22 • 39

authored 3 papers 9 months ago

V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models

Paper • 2502.09980 • Published Feb 14 • 5

SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP

Paper • 2408.10202 • Published Aug 19, 2024 • 1

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Paper • 2502.05176 • Published Feb 7 • 38

authored a paper 10 months ago

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Paper • 2501.08326 • Published Jan 14 • 33

authored a paper 11 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 45

authored a paper about 1 year ago

Diffusion-Reward Adversarial Imitation Learning

Paper • 2405.16194 • Published May 25, 2024 • 2

Min-Hung Chen

AI & ML interests

Recent Activity

Organizations

cmhungsteve's activity