-
-
-
-
-
-
Inference Providers
Active filters:
4bit
ModelCloud/Meta-Llama-3.1-405B-Instruct-gptq-4bit
Text Generation
•
59B
•
Updated
•
2
legraphista/gemma-2-2b-it-IMat-GGUF
Text Generation
•
3B
•
Updated
•
1.21k
•
2
legraphista/gemma-2-2b-IMat-GGUF
Text Generation
•
3B
•
Updated
•
781
•
1
thesven/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
1B
•
Updated
•
284
•
1
legraphista/Palmyra-Fin-70B-32K-IMat-GGUF
Text Generation
•
71B
•
Updated
•
1.03k
•
10
legraphista/internlm2_5-1_8b-chat-IMat-GGUF
Text Generation
•
2B
•
Updated
•
388
legraphista/internlm2_5-20b-chat-IMat-GGUF
Text Generation
•
20B
•
Updated
•
518
legraphista/shieldgemma-2b-IMat-GGUF
Text Generation
•
3B
•
Updated
•
1.14k
legraphista/shieldgemma-9b-IMat-GGUF
Text Generation
•
9B
•
Updated
•
579
legraphista/shieldgemma-27b-IMat-GGUF
Text Generation
•
27B
•
Updated
•
908
•
1
legraphista/Qwen2-Math-1.5B-Instruct-IMat-GGUF
Text Generation
•
2B
•
Updated
•
483
legraphista/Qwen2-Math-7B-Instruct-IMat-GGUF
Text Generation
•
8B
•
Updated
•
382
•
1
legraphista/Qwen2-Math-72B-Instruct-IMat-GGUF
Text Generation
•
73B
•
Updated
•
802
ModelCloud/EXAONE-3.0-7.8B-Instruct-gptq-4bit
2B
•
Updated
•
2
•
3
legraphista/Hermes-3-Llama-3.1-8B-IMat-GGUF
Text Generation
•
8B
•
Updated
•
1.18k
•
1
legraphista/Hermes-3-Llama-3.1-70B-IMat-GGUF
Text Generation
•
71B
•
Updated
•
815
•
1
legraphista/Llama-3.1-Minitron-4B-Width-Base-GGUF
Text Generation
•
5B
•
Updated
•
314
•
13
legraphista/Minitron-4B-Base-GGUF
Text Generation
•
4B
•
Updated
•
441
legraphista/Minitron-8B-Base-GGUF
Text Generation
•
8B
•
Updated
•
112
legraphista/Llama-3.1-Storm-8B-IMat-GGUF
Text Generation
•
8B
•
Updated
•
922
legraphista/Phi-3.5-mini-instruct-IMat-GGUF
Text Generation
•
4B
•
Updated
•
876
legraphista/Mistral-NeMo-Minitron-8B-Base-IMat-GGUF
Text Generation
•
8B
•
Updated
•
518
•
1
legraphista/unsloth-Phi-3.5-mini-instruct-IMat-GGUF
Text Generation
•
4B
•
Updated
•
540
speakleash/Bielik-11B-v2.2-Instruct-MLX-4bit
Text Generation
•
2B
•
Updated
•
30
•
3
speakleash/Bielik-11B-v2.2-Instruct-Quanto-4bit
Text Generation
•
6B
•
Updated
•
14
•
3
0xroyce/Valkyrie-Llama-3.1-8B-bnb-4bit
Text Generation
•
8B
•
Updated
•
55
•
1
alexwww94/glm-4v-9b-gptq-4bit
3B
•
Updated
•
34
•
7
0xroyce/Plutus-Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
8B
•
Updated
•
350
•
7
legraphista/c4ai-command-r-plus-08-2024-IMat-GGUF
Text Generation
•
104B
•
Updated
•
1.91k
•
6
jhangmez/CHATPRG-v1.2-Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
42
•
1