Lyric1010 commited on
Commit
cf7f635
·
verified ·
1 Parent(s): 4353c14

Add model card with tags for hidden_dropout-iter_0001192

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ model_name: hidden_dropout-iter_0001192
3
+ tags:
4
+ - qwen2.5
5
+ ---
6
+
7
+ # hidden_dropout-iter_0001192
8
+
9
+ This is a model uploaded from /mnt/nanjingcephfs/project_wx-rec-alg-bdc-exp/bwzheng/yulan/hyw/Ubiquant-Pretrain/build/wjp-share/output_mcore_qwen2.5_pretrain/checkpoint/hidden_dropout_2025.10.08-22.05.45-pretrain-mcore-qwen2.5-0.5B-lr-1e-5-minlr-1e-6-bs-4-gbs-1024-seqlen-8192-pr-bf16-tp-1-pp-1-cp-1-ac-sel-do-true-sp-false-ti-1192-wi-119/huggingface/qwen2-0.5b-using-llam2-modeling.