rwightman
/

timm-optim-caution

Model card Files Files and versions

rwightman HF Staff commited on Dec 6, 2024

Commit

d03b163

·

verified ·

1 Parent(s): 58638f1

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -19,6 +19,11 @@ This is what the 'caution' addition looks like in an optimizer:
     exp_avg = exp_avg * mask
 ```
 # LaProp
@@ -74,4 +79,5 @@ This is what the 'caution' addition looks like in an optimizer:
 ![Top-1](mars/eval_top1_comparison.png)
 ## MARS Train Loss
-![Loss](mars/train_loss_comparison.png)

     exp_avg = exp_avg * mask
 ```
+Train args:
+```
+./distributed_train.sh 2 --dataset hfds/timm/mini-imagenet --num-classes 100 --model vit_wee_patch16_reg1_gap_256 -j 8 --epochs 200 --warmup-prefix --sched-on-updates --warmup-lr 0 --mixup .2 --model-ema --model-ema-decay 0.999 --model-ema-warmup --aa rand-m9-mstd0.5-inc1 --remode pixel --reprob 0.25 --amp --weight-decay .05 --drop 0.1 --drop-path .1 -b 288 --opt cadamw --lr 1e-3
+```
 # LaProp
 ![Top-1](mars/eval_top1_comparison.png)
 ## MARS Train Loss
+![Loss](mars/train_loss_comparison.png)