Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
stellalisy
/
rethink_rlvr_reproduce-incorrect-qwen2.5_math_7b-lr5e-7-kl0.00-step50
like
0
Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
a3e350b
rethink_rlvr_reproduce-incorrect-qwen2.5_math_7b-lr5e-7-kl0.00-step50
Commit History
Upload Qwen2ForCausalLM
a3e350b
verified
stellalisy
commited on
Jun 13
initial commit
ff6a8bb
verified
stellalisy
commited on
Jun 13