DavidAU
/

Llama-3.2-8X3B-GATED-MOE-Reasoning-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF

Text Generation

mixture of experts

Mixture of Experts

creative writing

fiction writing

plot generation

sub-plot generation

story generation

science fiction

Model card Files Files and versions

DavidAU commited on May 15

Commit

d7ec027

·

verified ·

1 Parent(s): 1f19862

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -56,7 +56,8 @@ models into one massive powerhouse at 18.4B parameters (equal to 24B - 8 X 3B).
 This model's instruction following, and output generation for creative writing, prose, fiction and role play are exceptional.
-This model is also "gated", contains a master reasoning model (this can be turned on/off) and was built at float32 (32 bit) precision.
 The "gated" strucure means the "reasoning model" is re-inforced by the other 7 models in the MOE during reasoning, and then during
 output generation / non-reasoning the non-reasoning model(s) take control.

 This model's instruction following, and output generation for creative writing, prose, fiction and role play are exceptional.
+This model is also "gated", contains a master reasoning model (this can be turned on/off), was built at float32 (32 bit) precision
+and quants have the output tensor at Q8_0, with a few choice quants at f16 (16 bit) and a Q8_0 with f32 (32 bit).
 The "gated" strucure means the "reasoning model" is re-inforced by the other 7 models in the MOE during reasoning, and then during
 output generation / non-reasoning the non-reasoning model(s) take control.