Improve model card: Add library, links, and usage example

by nielsr HF Staff - opened Aug 12

←

nielsr

Aug 12

This PR significantly enhances the model card for sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH by:

Updating the main heading to reflect the full model ID: sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH.
Adding library_name: transformers to the metadata, enabling the "how to use" widget and improving discoverability.
Including descriptive tags such as reinforcement-learning, llm, reasoning, and math for better categorization.
Providing an expanded model description based on the paper abstract and project details, giving users a better understanding of the model and the underlying "Intuitor" and "RLIF" frameworks.
Adding explicit links to the paper, the project page, and the GitHub repository for easy access to source materials and code.
Including a clear Python code snippet for sample usage with the transformers library, making it easier for users to get started with inference.

Please review and merge this PR to improve the model's visibility and usability on the Hugging Face Hub.

Xuandong changed pull request status to merged Aug 13

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment