For inference with sglang and kt-kernel: https://lmsys.org/blog/2025-10-22-KTransformers/
This version is packed specifically for NUMA tensor parallel = 4
- Downloads last month
- 22
Model tree for CPU-Hybrid-MoE/DeepSeek-V3-0324-CPU-NUMA4-AMXINT8
Base model
deepseek-ai/DeepSeek-V3-0324