rain's picture

1 1

rain

dd12345789

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

authored a paper 3 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

authored a paper 3 months ago

Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models

View all activity

Organizations

None yet

authored 2 papers 3 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4 • 36

Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models

Paper • 2501.04945 • Published Jan 9