Zhihe Yang's picture

3 7

Zhihe Yang

zhyang2226

·

AI & ML interests

Trustworthy RL & Offline RL

Recent Activity

liked a model 13 days ago

tencent/HunyuanImage-3.0

liked a model 4 months ago

tencent/HunyuanVideo

authored a paper 4 months ago

Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key

View all activity

Organizations

Papers 2

arxiv:2505.12929

arxiv:2501.09695

models 2

zhyang2226/opadpo-lora_llava-v1.5-13b

zhyang2226/opadpo-lora_llava-v1.5-7b

datasets 0

None public yet