Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hallowks 's Collections
reasoning-RL
Chain of Thoughts LLM
Agents

Agents

updated Feb 8
Upvote
-

  • rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

    Paper • 2501.04519 • Published Jan 8 • 285

  • Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

    Paper • 2501.04682 • Published Jan 8 • 99

  • Search-o1: Agentic Search-Enhanced Large Reasoning Models

    Paper • 2501.05366 • Published Jan 9 • 102

  • Agent Laboratory: Using LLM Agents as Research Assistants

    Paper • 2501.04227 • Published Jan 8 • 94

  • URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

    Paper • 2501.04686 • Published Jan 8 • 53

  • LLM4SR: A Survey on Large Language Models for Scientific Research

    Paper • 2501.04306 • Published Jan 8 • 36

  • InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

    Paper • 2501.04575 • Published Jan 8 • 25

  • ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

    Paper • 2502.04306 • Published Feb 6 • 20
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs