Rin
hu5enpai
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
commented on
a paper
about 2 months ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised
Fine-Tuning and Reinforcement Learning via Dynamic Weighting