Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xiong-Hui Chen's picture
3 3 14

Xiong-Hui Chen

xionghuichen
hamzzi's profile picture SteveSHEN's profile picture ZJUPeng's profile picture
·
http://www.lamda.nju.edu.cn/chenxh/
  • xionghuichen

AI & ML interests

None yet

Organizations

Polixir's profile picture

authored a paper 3 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306
authored a paper 5 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 185
authored 5 papers 7 months ago

Language Model Self-improvement by Reinforcement Learning Contemplation

Paper • 2305.14483 • Published May 23, 2023 • 1

AFlow: Automating Agentic Workflow Generation

Paper • 2410.10762 • Published Oct 14, 2024 • 1

Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems

Paper • 2305.04832 • Published May 3, 2023

A Survey on Model-based Reinforcement Learning

Paper • 2206.09328 • Published Jun 19, 2022

Offline Reinforcement Learning with Causal Structured World Models

Paper • 2206.01474 • Published Jun 3, 2022
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs