Xiao Hu
huxiao09
ยท
AI & ML interests
Reinforcement Learning, LLM Reasoning
Recent Activity
upvoted
a
paper
about 17 hours ago
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
liked
a model
about 2 months ago
Kwai-Keye/Keye-VL-671B-A37B
upvoted
a
paper
5 months ago
Thyme: Think Beyond Images
Organizations
None yet