RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 11B • Updated Sep 22, 2025 • 2.47k • 3
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Sep 22, 2025 • 2.3k • 13
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 6.33k • 2
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w4a16 Text Generation • 2B • Updated Feb 27, 2025 • 3.52k
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8 Text Generation • 71B • Updated Feb 27, 2025 • 218 • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w4a16 Text Generation • 2B • Updated Feb 27, 2025 • 359 • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8 Text Generation • 15B • Updated Feb 27, 2025 • 4.87k • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w4a16 Text Generation • 3B • Updated Feb 27, 2025 • 1.1k • 1
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16 Text Generation • 6B • Updated Feb 27, 2025 • 1.24k • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8 Text Generation • Updated Feb 27, 2025 • 216 • 13
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 7.92k • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8 Text Generation • 2B • Updated Feb 27, 2025 • 9.18k • 2
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w4a16 Text Generation • 11B • Updated Feb 27, 2025 • 4.64k • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w4a16 Text Generation • 0.6B • Updated Feb 27, 2025 • 5 • 1
ISTA-DASLab/Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g Image-Text-to-Text • 5B • Updated Apr 6, 2025 • 6.7k • 17
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 44.7k • 9
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic Image-Text-to-Text • 109B • Updated Sep 22, 2025 • 10.7k • 28
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16 Image-Text-to-Text • 5B • Updated Oct 29, 2025 • 2.13k • 10
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8 Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 266 • 5