Yuxian Gu
t1101675
AI & ML interests
Efficient methods for language models
Recent Activity
upvoted
a
paper
24 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
updated
a Space
about 2 months ago
t1101675/trackio
published
a Space
about 2 months ago
t1101675/trackio