Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bestsonny 's Collections
papers

papers

updated Sep 1
Upvote
-

  • Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

    Paper • 2508.16949 • Published Aug 23 • 22

  • Diffusion Language Models Know the Answer Before Decoding

    Paper • 2508.19982 • Published Aug 27 • 23

  • ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

    Paper • 2508.18773 • Published Aug 26 • 15

  • Intern-S1: A Scientific Multimodal Foundation Model

    Paper • 2508.15763 • Published Aug 21 • 254

  • Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

    Paper • 2508.01191 • Published Aug 2 • 236

  • Self-Rewarding Vision-Language Model via Reasoning Decomposition

    Paper • 2508.19652 • Published Aug 27 • 84
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs