Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hongshen Xu's picture
4 2

Hongshen Xu

importpandas
·
  • importpandas

AI & ML interests

Machine Reading Comprehension, Web Information Extraction, Multi-modal Pre-training

Organizations

OpenDFM's profile picture SJTU Cross Media Language Intelligence Lab's profile picture

upvoted a collection 4 months ago

Nemotron-Pre-Training-Datasets

Collection
Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 9 days ago • 84
upvoted a paper 5 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80
upvoted a paper 8 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82
upvoted a paper 12 months ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs