ppo_sample8_critic-warm10-lr2e-6_step20_crtic / model-00007-of-00007.safetensors

Commit History

Upload Qwen2ForCausalLM
dec8fdc
verified

daixuancheng commited on