Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
reaperdoesntknow
/
MoA-100M
like
0
Text Generation
Transformers
PyTorch
nvidia/Nemotron-Math-HumanReasoning
WeMake/Intelligent-Content-Understanding
English
moa_metric
mixture-of-attentions
distance-attention
metric-attention
mqa
hyperffn
router-gating
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MoA-100M
Commit History
Upload MoAMetricLM
7a75b28
verified
reaperdoesntknow
commited on
Sep 21
Update README.md
a61b9ff
verified
reaperdoesntknow
commited on
Sep 20
Update README.md
a9467db
verified
reaperdoesntknow
commited on
Sep 20
Update README.md
784c21a
verified
reaperdoesntknow
commited on
Sep 20
Update README.md
c22b85c
verified
reaperdoesntknow
commited on
Sep 20
Upload MoAMetricLM
377cc15
verified
reaperdoesntknow
commited on
Sep 20
initial commit
b952ab3
verified
reaperdoesntknow
commited on
Sep 20