KAT-Dev-72B-Exp-GPTQ-INT4-gs32-0.01 / generation_config.json
Shane
Upload GPTQ quantized model (group_size=32)
0f4738f verified
raw
history blame contribute delete
143 Bytes
{
"_from_model_config": true,
"bos_token_id": 151643,
"eos_token_id": 151645,
"transformers_version": "4.52.4",
"use_cache": false
}