Qianzhouyi's picture

2

Qianzhouyi

Saputello

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

upvoted a paper about 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

authored a paper 7 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1 • 14

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 184