Lyric1010's picture
Add model card with tags for mlp_drop-iter_0001192
48be4d1 verified
metadata
model_name: mlp_drop-iter_0001192
tags:
  - qwen2.5

mlp_drop-iter_0001192

This is a model uploaded from /mnt/nanjingcephfs/project_wx-rec-alg-bdc-exp/bwzheng/yulan/hyw/Ubiquant-Pretrain/build/wjp-share/output_mcore_qwen2.5_pretrain/checkpoint/mlp_drop_2025.10.09-20.15.34-pretrain-mcore-qwen2.5-0.5B-lr-1e-5-minlr-1e-6-bs-4-gbs-1024-seqlen-8192-pr-bf16-tp-1-pp-1-cp-1-ac-sel-do-true-sp-false-ti-1192-wi-119/huggingface/qwen2-0.5b-using-llam2-modeling.