Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
drbh
/
yamoe
like
2
kernel
License:
mit
Model card
Files
Files and versions
xet
Community
1
main
yamoe
/
torch-ext
Commit History
fix: improve layer for transformer integration
bd058af
drbh
commited on
Sep 19
feat: align outputs and support backward method
cf66620
drbh
commited on
Sep 4
feat: impl backward experts
733f7f4
drbh
commited on
Sep 4
feat: yet another moe
281d8ba
drbh
commited on
Aug 28