We quantified mistralai/Mistral-Small-24B-Instruct-2501 to 4bit model using BitsAndBytes.

To use this model you need install BitsAndBytes at first:

pip install -U bitsandbytes

Then, use AutoModelForCausalLM:

from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained("minicreeper/Mistral-Small-24B-Instruct-2501-bnb-4bit")

Safetensors

Model size

13B params

Tensor type

F32

BF16

Model tree for minicreeper/Mistral-Small-24B-Instruct-2501-bnb-4bit

Base model

Finetuned

Quantized

(102)

this model