Quantized version of yentinglin/Mistral-Small-24B-Instruct-2501-reasoning. Tested to work with llama.cpp and LM Studio

Downloads last month
10
GGUF
Model size
24B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support