-
-
-
-
-
-
Inference Providers
Active filters: 8-bit
lukealonso/MiniMax-M2.5-NVFP4
130B • Updated
• 17.8k
• 33
lukealonso/MiniMax-M2.5-REAP-139B-A10B-NVFP4
80B • Updated
• 4.18k
• 15
cublya/GPT-OSS-Code-Reasoning-20B
Text Generation
• 22B • Updated
• 71
• 14
mlx-community/Qwen3-TTS-12Hz-1.7B-CustomVoice-8bit
Text-to-Speech
• 0.8B • Updated
• 2.37k
• 11
425B • Updated
• 11.1k
• 8
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
• 18B • Updated
• 259k
• 101
mlx-community/GLM-4.7-Flash-8bit
Text Generation
• Updated
• 40.7k
• 22
inferencerlabs/Qwen3-Coder-Next-MLX-9bit
Text Generation
• 80B • Updated
• 1.77k
• 7
MuXodious/gpt-oss-20b-RichardErkhov-heresy
Text Generation
• 22B • Updated
• 200
• 10
mlx-community/Nanbeige4.1-3B-8bit
Text Generation
• 1B • Updated
• 2.29k
• 13
tacos4me/Step-3.5-Flash-NVFP4
Text Generation
• 111B • Updated
• 1.54k
• 5
inferencerlabs/Qwen3.5-397B-A17B-MLX-9bit
Text Generation
• 396B • Updated
• 3.77k
• 6
Text Generation
• 17B • Updated
• 19.4k
• 8
GadflyII/GLM-4.7-Flash-MTP-NVFP4
Text Generation
• 19B • Updated
• 5.1k
• 3
MaziyarPanahi/Qwen3-0.6B-GGUF
Text Generation
• 0.8B • Updated
• 201k
• 10
FabioSarracino/VibeVoice-Large-Q8
Text-to-Audio
• 9B • Updated
• 2.06k
• 86
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation
• 133B • Updated
• 21.2k
• 13
kldzj/gpt-oss-120b-heretic-v2
Text Generation
• 117B • Updated
• 877
• 22
justinjja/gpt-oss-120b-Derestricted-MXFP4
2B • Updated
• 916
• 4
lmstudio-community/Qwen3-Coder-Next-MLX-8bit
80B • Updated
• 762k
• 5
p-e-w/gpt-oss-20b-heretic-v3
Text Generation
• 2B • Updated
• 276
• 2
EricRollei/HunyuanImage-3.0-Instruct-Distil-INT8-v2
Text-to-Image
• 83B • Updated
• 12
• 2
MaziyarPanahi/TinyLlama-1.1B-Chat-v1.0-GGUF
Text Generation
• 1B • Updated
• 244
• 2
MaziyarPanahi/Hermes-2-Pro-Llama-3-13B-GGUF
Text Generation
• 12B • Updated
• 76
• 1
biomap-research/proteinglm-100b-int4
50B • Updated
• 189
• 11
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
• 8B • Updated
• 8.64k
• 20
Text Generation
• 15B • Updated
• 124k
• 6
roleplaiapp/oh-dcft-v3.1-claude-3-5-haiku-20241022-Q8_0-GGUF
Text Generation
• 8B • Updated
• 30
• 1
mlx-community/Phi-4-mini-instruct-8bit
Text Generation
• Updated
• 324
• 5
MaziyarPanahi/gemma-3-12b-it-GGUF
Text Generation
• 12B • Updated
• 127k
• 16