This is a MXFP4_MOE quantization of the model Phi-mini-MoE-instruct

Model quantized with F16 GGUF's from: https://huggingface.co/gabriellarson/Phi-mini-MoE-instruct-GGUF

Original model: https://huggingface.co/microsoft/Phi-mini-MoE-instruct

Downloads last month
77
GGUF
Model size
8B params
Architecture
phimoe
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for noctrex/Phi-mini-MoE-instruct-MXFP4_MOE-GGUF

Quantized
(2)
this model