FP8 quantization using TensorRT-LLM
- Downloads last month
- 29
Model tree for JeiganS/ML2-123B-Magnum-Diamond_fp8
Base model
mistralai/Mistral-Large-Instruct-2411
Finetuned
Doctor-Shotgun/ML2-123B-Magnum-Diamond
FP8 quantization using TensorRT-LLM
Base model
mistralai/Mistral-Large-Instruct-2411