Edit Models filters

Apps

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

374

Full-text search

Active filters: 4bit

ModelCloud/Meta-Llama-3.1-405B-Instruct-gptq-4bit

Text Generation • 59B • Updated Jul 30, 2024 • 2

legraphista/gemma-2-2b-it-IMat-GGUF

Text Generation • 3B • Updated Jul 31, 2024 • 1.21k • 2

legraphista/gemma-2-2b-IMat-GGUF

Text Generation • 3B • Updated Jul 31, 2024 • 781 • 1

thesven/Mistral-7B-Instruct-v0.3-GPTQ-4bit

Text Generation • 1B • Updated Aug 2, 2024 • 284 • 1

legraphista/Palmyra-Fin-70B-32K-IMat-GGUF

Text Generation • 71B • Updated Aug 2, 2024 • 1.03k • 10

legraphista/internlm2_5-1_8b-chat-IMat-GGUF

Text Generation • 2B • Updated Aug 5, 2024 • 388

legraphista/internlm2_5-20b-chat-IMat-GGUF

Text Generation • 20B • Updated Aug 5, 2024 • 518

legraphista/shieldgemma-2b-IMat-GGUF

Text Generation • 3B • Updated Aug 5, 2024 • 1.14k

legraphista/shieldgemma-9b-IMat-GGUF

Text Generation • 9B • Updated Aug 5, 2024 • 579

legraphista/shieldgemma-27b-IMat-GGUF

Text Generation • 27B • Updated Aug 5, 2024 • 908 • 1

legraphista/Qwen2-Math-1.5B-Instruct-IMat-GGUF

Text Generation • 2B • Updated Aug 8, 2024 • 483

legraphista/Qwen2-Math-7B-Instruct-IMat-GGUF

Text Generation • 8B • Updated Aug 8, 2024 • 382 • 1

legraphista/Qwen2-Math-72B-Instruct-IMat-GGUF

Text Generation • 73B • Updated Aug 8, 2024 • 802

ModelCloud/EXAONE-3.0-7.8B-Instruct-gptq-4bit

2B • Updated Aug 9, 2024 • 2 • 3

legraphista/Hermes-3-Llama-3.1-8B-IMat-GGUF

Text Generation • 8B • Updated Aug 16, 2024 • 1.18k • 1

legraphista/Hermes-3-Llama-3.1-70B-IMat-GGUF

Text Generation • 71B • Updated Aug 16, 2024 • 815 • 1

legraphista/Llama-3.1-Minitron-4B-Width-Base-GGUF

Text Generation • 5B • Updated Aug 17, 2024 • 314 • 13

legraphista/Minitron-4B-Base-GGUF

Text Generation • 4B • Updated Aug 17, 2024 • 441

legraphista/Minitron-8B-Base-GGUF

Text Generation • 8B • Updated Aug 19, 2024 • 112

legraphista/Llama-3.1-Storm-8B-IMat-GGUF

Text Generation • 8B • Updated Aug 20, 2024 • 922

legraphista/Phi-3.5-mini-instruct-IMat-GGUF

Text Generation • 4B • Updated Aug 20, 2024 • 876

legraphista/Mistral-NeMo-Minitron-8B-Base-IMat-GGUF

Text Generation • 8B • Updated Aug 21, 2024 • 518 • 1

legraphista/unsloth-Phi-3.5-mini-instruct-IMat-GGUF

Text Generation • 4B • Updated Aug 25, 2024 • 540

speakleash/Bielik-11B-v2.2-Instruct-MLX-4bit

Text Generation • 2B • Updated Aug 28, 2024 • 30 • 3

speakleash/Bielik-11B-v2.2-Instruct-Quanto-4bit

Text Generation • 6B • Updated Oct 7, 2024 • 14 • 3

0xroyce/Valkyrie-Llama-3.1-8B-bnb-4bit

Text Generation • 8B • Updated Aug 27, 2024 • 55 • 1

alexwww94/glm-4v-9b-gptq-4bit

3B • Updated Mar 14 • 34 • 7

0xroyce/Plutus-Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • 8B • Updated Jan 12 • 350 • 7

legraphista/c4ai-command-r-plus-08-2024-IMat-GGUF

Text Generation • 104B • Updated Aug 31, 2024 • 1.91k • 6

jhangmez/CHATPRG-v1.2-Meta-Llama-3.1-8B-Instruct-GGUF

Text Generation • 8B • Updated Aug 31, 2024 • 42 • 1