Improve model card: Add library, links, and usage example

#1
by nielsr HF Staff - opened

This PR significantly enhances the model card for sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH by:

  • Updating the main heading to reflect the full model ID: sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH.
  • Adding library_name: transformers to the metadata, enabling the "how to use" widget and improving discoverability.
  • Including descriptive tags such as reinforcement-learning, llm, reasoning, and math for better categorization.
  • Providing an expanded model description based on the paper abstract and project details, giving users a better understanding of the model and the underlying "Intuitor" and "RLIF" frameworks.
  • Adding explicit links to the paper, the project page, and the GitHub repository for easy access to source materials and code.
  • Including a clear Python code snippet for sample usage with the transformers library, making it easier for users to get started with inference.

Please review and merge this PR to improve the model's visibility and usability on the Hugging Face Hub.

Xuandong changed pull request status to merged

Sign up or log in to comment