Pristine-Mini-8B-A1B-Base (8.6B A1.4B)

Pristine is a model series / dataset based on a pruned version of Ring-Mini-2.0 (16B A1.4B --> 8.6B A1.4B). The main motivation behind this project is that Granite 4.0 7B A1B has been pretty disappointing and with a high-performance 8B-A1B these small MoEs can be avenged.

Note: This model hasn't been further trained. Do not use it as is or create quantizations of it!

Downloads last month
115
Safetensors
Model size
9B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for qingy2024/Pristine-Mini-8B-A1B-Base

Finetuned
(1)
this model
Quantizations
1 model