Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
polaris-73
/
ds8b_grpo_math_gsm8k_rloo-global_step_400
like
0
Safetensors
llama
Model card
Files
Files and versions
xet
Community
main
ds8b_grpo_math_gsm8k_rloo-global_step_400
/
model-00002-of-00007.safetensors
Commit History
Upload ds8b_grpo_math_gsm8k_rloo at global_step_400
1cc5eaf
verified
polaris-73
commited on
Aug 12