Yufeng Zhao
epsilondylan
AI & ML interests
LLM Reasoning
Recent Activity
upvoted
a
paper
9 days ago
P1: Mastering Physics Olympiads with Reinforcement Learning
upvoted
a
paper
2 months ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
paper
3 months ago
A Survey of Reinforcement Learning for Large Reasoning Models