model weights are actually BF16, but config.json says FP32

by CHNtentes - opened 24 days ago

24 days ago

Tongyi-MiA org 23 days ago

Thank you for the information, this is a bug caused by a verl ckpt merger script, we will fix this bug and upload newest version.

CHNtentes

23 days ago

Thank you for the information, this is a bug caused by a verl ckpt merger script, we will fix this bug and upload newest version.

Thanks for your reply. Another issue: why use_cache is set to false?

CleyChen

Tongyi-MiA org 23 days ago

•

edited 23 days ago

Thank you for the information, this is a bug caused by a verl ckpt merger script, we will fix this bug and upload newest version.

Thanks for your reply. Another issue: why use_cache is set to false?

Also the issue in this script, we follow easyr1 to merge the model, but things came wrong maybe the version is not correct, and in our evaluation, this setting is false actually, same with vllm's default setting, and torch type was set to bf 16 manually.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment