Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
min's picture
2 5 1

min

qiyang-attn
21world's profile picture
·
  • velconia

AI & ML interests

GNN, LLM, Generative Models, MultiModal, Recommendation Models

Organizations

None yet

authored 2 papers 3 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 226

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 38
authored 3 papers 5 months ago

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Paper • 2503.16057 • Published Mar 20, 2025 • 14

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10, 2025 • 4

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Paper • 2508.18756 • Published Aug 26, 2025 • 36
authored a paper 11 months ago

Frac-Connections: Fractional Extension of Hyper-Connections

Paper • 2503.14125 • Published Mar 18, 2025 • 22
authored 2 papers about 1 year ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28, 2025 • 32

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 23
authored a paper over 1 year ago

Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 26
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs