Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tony Congqian Wang's picture
6 14 1

Tony Congqian Wang

TonyCWang

AI & ML interests

None yet

Recent Activity

upvoted an article 6 days ago
The Optimal Architecture for Small Language Models
upvoted a paper 25 days ago
TiDAR: Think in Diffusion, Talk in Autoregression
upvoted an article about 2 months ago
Why Did MiniMax M2 End Up as a Full Attention Model?
View all activity

Organizations

None yet

TonyCWang 's collections 1

Llm training
  • Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

    Paper • 2502.13063 • Published Feb 18, 2025 • 73
Llm training
  • Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

    Paper • 2502.13063 • Published Feb 18, 2025 • 73
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs