arxiv:2502.06155
Hangliang Ding
foreverpiano
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative
Decoders
upvoted
an
article
about 1 month ago
Proximal Policy Optimization (PPO)
upvoted
a
paper
about 2 months ago
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Organizations
None yet