-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
3.73k
•
2
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a8
Text Generation
•
4B
•
Updated
•
91
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a8
Text Generation
•
14B
•
Updated
•
16
•
2
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a8
Text Generation
•
2B
•
Updated
•
298
Xu-Ouyang/pythia-410m-deduped-int8-step36000-GPTQ-wikitext2
Text Generation
•
0.2B
•
Updated
Xu-Ouyang/pythia-410m-deduped-int8-step71000-GPTQ-wikitext2
Text Generation
•
0.2B
•
Updated
Xu-Ouyang/pythia-410m-deduped-int8-step107000-GPTQ-wikitext2
Text Generation
•
0.2B
•
Updated
•
1
Xu-Ouyang/pythia-410m-deduped-int8-step110000-GPTQ-wikitext2
Text Generation
•
0.2B
•
Updated
Xu-Ouyang/pythia-410m-deduped-int8-step143000-GPTQ-wikitext2
Text Generation
•
0.2B
•
Updated
•
1
Xu-Ouyang/pythia-6.9b-deduped-int8-step36000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
•
1
Xu-Ouyang/pythia-6.9b-deduped-int8-step71000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
Xu-Ouyang/pythia-6.9b-deduped-int8-step107000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
Adeptschneider/dyu_to_fr_v8.0_flan-t5-8bit
0.2B
•
Updated
Xu-Ouyang/pythia-6.9b-deduped-int8-step110000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
Xu-Ouyang/pythia-6.9b-deduped-int8-step143000-GPTQ-wikitext2
Text Generation
•
2B
•
Updated
nhotin/vistral7B-legalbizai-q8-gguf
Text Generation
•
7B
•
Updated
XelotX/WizardLM-2-8x22B-XelotX-iQuants
Text Generation
•
141B
•
Updated
•
244
RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
30
RedHatAI/Qwen2-7B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
16
PrunaAI/In2Training-FILM-7B-bnb-4bit-smashed
4B
•
Updated
•
4
Xu-Ouyang/pythia-160m-deduped-int8-step36000-GPTQ-wikitext2
Text Generation
•
99.5M
•
Updated
•
1
Xu-Ouyang/pythia-160m-deduped-int8-step71000-GPTQ-wikitext2
Text Generation
•
99.5M
•
Updated
Xu-Ouyang/pythia-160m-deduped-int8-step107000-GPTQ-wikitext2
Text Generation
•
99.5M
•
Updated
Xu-Ouyang/pythia-160m-deduped-int8-step110000-GPTQ-wikitext2
Text Generation
•
99.5M
•
Updated
Xu-Ouyang/pythia-160m-deduped-int8-step143000-GPTQ-wikitext2
Text Generation
•
99.5M
•
Updated
Xu-Ouyang/pythia-1b-deduped-int8-step36000-GPTQ-wikitext2
Text Generation
•
0.4B
•
Updated
Xu-Ouyang/pythia-1b-deduped-int8-step71000-GPTQ-wikitext2
Text Generation
•
0.4B
•
Updated
Xu-Ouyang/pythia-1b-deduped-int8-step107000-GPTQ-wikitext2
Text Generation
•
0.4B
•
Updated
Xu-Ouyang/pythia-1b-deduped-int8-step110000-GPTQ-wikitext2
Text Generation
•
0.4B
•
Updated
Xu-Ouyang/pythia-1b-deduped-int8-step143000-GPTQ-wikitext2
Text Generation
•
0.4B
•
Updated
•
1