arxiv:2502.18137
Xiangchendong
Xiang-cd
AI & ML interests
pre-train models
Recent Activity
upvoted
an
article
1 day ago
From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate
upvoted
a
paper
29 days ago
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable
Sparse-Linear Attention
liked
a dataset
5 months ago
waltsun/MOAT
Organizations
None yet