Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

sunblaze-ucb
/
Qwen3-14B-GRPO-MATH-1EPOCH

Text Generation
Transformers
Safetensors
English
qwen3
reinforcement-learning
llm
reasoning
math
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
1
Qwen3-14B-GRPO-MATH-1EPOCH
2.07 kB
  • 2 contributors
History: 2 commits
Xuandong's picture
Xuandong
Create README.md
9493b90 verified 5 months ago
  • .gitattributes
    1.52 kB
    initial commit 5 months ago
  • README.md
    552 Bytes
    Create README.md 5 months ago