Dexun's picture

3 2

Dexun

Dexun

·

AI & ML interests

LLM, Reinforcement Learning, Optimization, RLHF, Generative Model

Recent Activity

upvoted a paper about 2 months ago

Reinforcement Learning Foundations for Deep Research Systems: A Survey

liked a dataset about 1 year ago

Team-ACE/ToolACE

liked a model about 1 year ago

Team-ACE/ToolACE-8B

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Reinforcement Learning Foundations for Deep Research Systems: A Survey

Paper • 2509.06733 • Published Sep 8 • 31

liked a dataset about 1 year ago

Team-ACE/ToolACE

Viewer • Updated Sep 4, 2024 • 11.3k • 1.3k • 144

liked a model about 1 year ago

Team-ACE/ToolACE-8B

8B • Updated Aug 20 • 23.1k • 65

upvoted 2 papers over 1 year ago

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14, 2024 • 40

Aligning Crowd Feedback via Distributional Preference Reward Modeling

Paper • 2402.09764 • Published Feb 15, 2024 • 1