-
-
-
-
-
-
Inference Providers
Active filters:
vllm
nm-testing/Meta-Llama-3.1-8B-Instruct-FP8-hf
Text Generation
•
8B
•
Updated
•
27
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation
•
11B
•
Updated
•
12k
•
24
RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
•
1B
•
Updated
•
115k
•
3
RedHatAI/Llama-3.2-3B-Instruct-FP8-dynamic
Text Generation
•
4B
•
Updated
•
129
•
3
RedHatAI/Llama-3.2-90B-Vision-Instruct-FP8-dynamic
Text Generation
•
89B
•
Updated
•
5.61k
•
10
soprasteria/Mixtral-8x7B-Instruct-v0.1-FP8
47B
•
Updated
RedHatAI/Phi-3.5-mini-instruct-FP8-KV
Text Generation
•
4B
•
Updated
•
33
•
2
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
42
•
2
RedHatAI/Qwen2.5-0.5B-quantized.w8a16
Text Generation
•
0.4B
•
Updated
RedHatAI/Qwen2.5-1.5B-quantized.w8a16
Text Generation
•
0.8B
•
Updated
RedHatAI/Qwen2.5-3B-quantized.w8a16
Text Generation
•
1B
•
Updated
RedHatAI/Qwen2.5-7B-quantized.w8a16
Text Generation
•
3B
•
Updated
•
1
RedHatAI/Qwen2.5-32B-quantized.w8a16
Text Generation
•
9B
•
Updated
•
2
RedHatAI/Qwen2.5-72B-quantized.w8a16
Text Generation
•
20B
•
Updated
RedHatAI/pixtral-12b-FP8-dynamic
Text Generation
•
13B
•
Updated
•
90
•
10
mlx-community/Ministral-8B-Instruct-2410-bf16
8B
•
Updated
•
17
•
2
mlx-community/Ministral-8B-Instruct-2410-4bit
1B
•
Updated
•
124
•
9
mlx-community/Ministral-8B-Instruct-2410-8bit
2B
•
Updated
•
25
•
2
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
71B
•
Updated
•
173
•
14
TouchNight/Ministral-8B-Instruct-2410-HF
8B
•
Updated
•
22
TouchNight/Ministral-8B-Instruct-2410-HF-Q5_K_M-GGUF
8B
•
Updated
•
4
ijohn07/Ministral-8B-Instruct-2410-HF-Q8_0-GGUF
8B
•
Updated
•
4
adriabama06/reader-lm-1.5b-AWQ
Text Generation
•
0.4B
•
Updated
•
3
•
1
sasha0552/Ministral-8B-Instruct-2410
Updated
aashish1904/Ministral-8B-Instruct-2410-HF-Q4_K_M-GGUF
8B
•
Updated
•
17
•
1
QuantFactory/TouchNight-Ministral-8B-Instruct-2410-HF-GGUF
8B
•
Updated
•
101
•
2
aashish1904/Ministral-8B-Instruct-2410-HF-Q2_K-GGUF
8B
•
Updated
•
16
•
2
GrimsenClory/Ministral-8B-Instruct-2410-Q6_K-GGUF
8B
•
Updated
•
27
QuantFactory/Ministral-8B-Instruct-2410-GGUF
8B
•
Updated
•
222
•
2
gphorvath/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
8B
•
Updated
•
28