Use vLLM or TensorRT-LLM to serve.
Restraints:
- Blackwell GPUs only.
- Downloads last month
- 38
Model tree for kalbon/Behemoth-X-123B-v2-NVFP4
Base model
mistralai/Mistral-Large-Instruct-2411
Finetuned
TheDrummer/Behemoth-X-123B-v2
Use vLLM or TensorRT-LLM to serve.
Restraints:
Base model
mistralai/Mistral-Large-Instruct-2411