arxiv:2505.24850
Shuyao Xu
Tim-Xu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
26 days ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
upvoted
a
paper
about 2 months ago
Taming the Chaos: Coordinated Autoscaling for Heterogeneous and
Disaggregated LLM Inference
authored
a paper
5 months ago
Harnessing Negative Signals: Reinforcement Distillation from Teacher
Data for LLM Reasoning