Commits · Motif-Technologies/optimizer

Support param group with various placements (#13)

e2b41e5
unverified

wyldecat github-actions[bot] commited on 5 days ago

fix bug in fsdp

811726c

ca1207 commited on 20 days ago

Update torch-ext/optimizer/muon.py

b0230e7
unverified

TaehyunKim commited on Oct 2

Update torch-ext/optimizer/muon.py

ff2fcfb
unverified

TaehyunKim commited on Oct 2

Update muon.py

c16b438
unverified

TaehyunKim commited on Oct 2

fix assert in a2a gather scatter

3dafb3e

ca1207 commited on Sep 29

delete state in split_func

15336dc

ca1207 commited on Sep 26

change owner_params to owned_params

6943c45

ca1207 commited on Sep 26

modify pre step (overlap step) can get from arsgs

589b763

ca1207 commited on Sep 26

add doc strings + init self rank on init_assign_params

267e8a0

ca1207 commited on Sep 26

license added for flash_muon

d7cd571

ca1207 commited on Sep 25

apply pre-commit hook

fceb334

dongseokmotif commited on Sep 25

consider multi node

39c42e0

dongseokmotif commited on Sep 25

misc

35894d1

ca1207 commited on Sep 24

use inpalce op in update_g

6e9baad

ca1207 commited on Sep 24

use COMM_DTYPE instead of hardcoded dtype

2a8631f

ca1207 commited on Sep 24

apply all2all scatter gather

ff6d675

ca1207 commited on Sep 24

feat(muon_clip) : add muon clip (#6)

d65066c
unverified

dongseokmotif dongseokmotif github-actions[bot] commited on Sep 24

feat(muon) : add tuned-abc-values & blfoat16 communication

f7faa93

wyldecat commited on Sep 18

feat: update muon to receive paramgroups, not model (#4)

b0f46c7
unverified

junhyeok-motech

leejunhyeok

wyldecat commited on Sep 11

fix(muon): add update_p stage and dealloc tensors properly

99e7c0c

wyldecat commited on Sep 9

chore: add .gitignore

79fc8ba

wyldecat commited on Sep 5

applied lint

db36e39

TaehyunKimMotif commited on Sep 2

fix(optimizer): resolve bug where weight decay was multiplied by wrong lr value (#5)

671b033
verified

dongseokmotif commited on Aug 28

Support HSDP (#4)

8447fd1
verified

iamwyldecat commited on Aug 25

fix(muon): handle un-distributed env

1f13dae

iamwyldecat commited on Jun 23

refactor(muon): change argument adam_wd to weight_decay and handle params' type

02ac540

iamwyldecat commited on Jun 23

fix(muon): free tensors that are no longer needed

64757cb

iamwyldecat commited on Jun 16

chore(muon): update comment

036642a

iamwyldecat commited on Jun 16

chore(muon): clean build and update doc

febdf5b

iamwyldecat commited on Jun 16

fix(muon): delete intermediate tensors immediately to lower peak mem usage

bdd2678

iamwyldecat commited on Jun 15

chore: initial commit

8535e80

iamwyldecat commited on Jun 15

Motif-Technologies
/

optimizer

Commit History

Support param group with various placements (#13)

e2b41e5
unverified

fix bug in fsdp

811726c

Update torch-ext/optimizer/muon.py

b0230e7
unverified

Update torch-ext/optimizer/muon.py

ff2fcfb
unverified

Update muon.py

c16b438
unverified

fix assert in a2a gather scatter

3dafb3e

delete state in split_func

15336dc

change owner_params to owned_params

6943c45

modify pre step (overlap step) can get from arsgs

589b763

add doc strings + init self rank on init_assign_params

267e8a0

license added for flash_muon

d7cd571

apply pre-commit hook

fceb334

consider multi node

39c42e0

misc

35894d1

use inpalce op in update_g

6e9baad

use COMM_DTYPE instead of hardcoded dtype

2a8631f

apply all2all scatter gather

ff6d675

feat(muon_clip) : add muon clip (#6)

d65066c
unverified

feat(muon) : add tuned-abc-values & blfoat16 communication

f7faa93

feat: update muon to receive paramgroups, not model (#4)

b0f46c7
unverified

fix(muon): add update_p stage and dealloc tensors properly

99e7c0c

chore: add .gitignore

79fc8ba

applied lint

db36e39

fix(optimizer): resolve bug where weight decay was multiplied by wrong lr value (#5)

671b033
verified

Support HSDP (#4)

8447fd1
verified

fix(muon): handle un-distributed env

1f13dae

refactor(muon): change argument adam_wd to weight_decay and handle params' type

02ac540

fix(muon): free tensors that are no longer needed

64757cb

chore(muon): update comment

036642a

chore(muon): clean build and update doc

febdf5b

fix(muon): delete intermediate tensors immediately to lower peak mem usage

bdd2678

chore: initial commit

8535e80

Commit History

Support param group with various placements (#13) e2b41e5 unverified

fix bug in fsdp 811726c

Update torch-ext/optimizer/muon.py b0230e7 unverified

Update torch-ext/optimizer/muon.py ff2fcfb unverified

Update muon.py c16b438 unverified

fix assert in a2a gather scatter 3dafb3e

delete state in split_func 15336dc

change owner_params to owned_params 6943c45

modify pre step (overlap step) can get from arsgs 589b763

add doc strings + init self rank on init_assign_params 267e8a0

license added for flash_muon d7cd571

apply pre-commit hook fceb334

consider multi node 39c42e0

misc 35894d1

use inpalce op in update_g 6e9baad

use COMM_DTYPE instead of hardcoded dtype 2a8631f

apply all2all scatter gather ff6d675

feat(muon_clip) : add muon clip (#6) d65066c unverified

feat(muon) : add tuned-abc-values & blfoat16 communication f7faa93

feat: update muon to receive paramgroups, not model (#4) b0f46c7 unverified

fix(muon): add update_p stage and dealloc tensors properly 99e7c0c

chore: add .gitignore 79fc8ba

applied lint db36e39

fix(optimizer): resolve bug where weight decay was multiplied by wrong lr value (#5) 671b033 verified

Support HSDP (#4) 8447fd1 verified

fix(muon): handle un-distributed env 1f13dae

refactor(muon): change argument adam_wd to weight_decay and handle params' type 02ac540

fix(muon): free tensors that are no longer needed 64757cb

chore(muon): update comment 036642a

chore(muon): clean build and update doc febdf5b

fix(muon): delete intermediate tensors immediately to lower peak mem usage bdd2678

chore: initial commit 8535e80

Support param group with various placements (#13)

e2b41e5
unverified

fix bug in fsdp

811726c

Update torch-ext/optimizer/muon.py

b0230e7
unverified

Update torch-ext/optimizer/muon.py

ff2fcfb
unverified

Update muon.py

c16b438
unverified

fix assert in a2a gather scatter

3dafb3e

delete state in split_func

15336dc

change owner_params to owned_params

6943c45

modify pre step (overlap step) can get from arsgs

589b763

add doc strings + init self rank on init_assign_params

267e8a0

license added for flash_muon

d7cd571

apply pre-commit hook

fceb334

consider multi node

39c42e0

misc

35894d1

use inpalce op in update_g

6e9baad

use COMM_DTYPE instead of hardcoded dtype

2a8631f

apply all2all scatter gather

ff6d675

feat(muon_clip) : add muon clip (#6)

d65066c
unverified

feat(muon) : add tuned-abc-values & blfoat16 communication

f7faa93

feat: update muon to receive paramgroups, not model (#4)

b0f46c7
unverified

fix(muon): add update_p stage and dealloc tensors properly

99e7c0c

chore: add .gitignore

79fc8ba

applied lint

db36e39

fix(optimizer): resolve bug where weight decay was multiplied by wrong lr value (#5)

671b033
verified

Support HSDP (#4)

8447fd1
verified

fix(muon): handle un-distributed env

1f13dae

refactor(muon): change argument adam_wd to weight_decay and handle params' type

02ac540

fix(muon): free tensors that are no longer needed

64757cb

chore(muon): update comment

036642a

chore(muon): clean build and update doc

febdf5b

fix(muon): delete intermediate tensors immediately to lower peak mem usage

bdd2678

chore: initial commit

8535e80