Yufeng Zhao's picture

13

Yufeng Zhao

epsilondylan

·

AI & ML interests

LLM Reasoning

Recent Activity

upvoted a paper 9 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

upvoted a paper 2 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a paper 3 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

View all activity

Organizations

epsilondylan 's datasets

None public yet