Could you release a 20B‑scale MoE version? Thank you very much.

#27
by houxiaowei - opened

A 20‑B‑scale model that can run on edge devices with around 16 GB of memory. These machines make up a very large share of the market; it’s a “sweet‑spot” parameter size that avoids the severe hallucinations that can occur when the model is too small. Please

You can easily run A3B on 16GB

Sign up or log in to comment