Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
18.4
TFLOPS
1
7
2
Shuyao Xu
Tim-Xu
Follow
SteveSHEN's profile picture
entropyhu's profile picture
21world's profile picture
3 followers
·
5 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
26 days ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
upvoted
a
paper
about 2 months ago
Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference
authored
a paper
5 months ago
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
View all activity
Organizations
Tim-Xu
's models
1
Sort: Recently updated
Tim-Xu/Qwen2.5-7B-kk-GRPO-s380
Updated
Mar 18