Edit Models filters

Apps

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

arxiv: 2204.06745

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

75

Full-text search

Active filters: 2204.06745

RichardErkhov/EleutherAI_-_gpt-neox-20b-8bits

Text Generation • 21B • Updated Apr 23, 2024

stabilityai/japanese-stablelm-2-base-1_6b

Text Generation • 2B • Updated May 2, 2024 • 13

stabilityai/japanese-stablelm-2-instruct-1_6b

Text Generation • 2B • Updated Jul 10, 2024 • 14 • 27

RichardErkhov/stabilityai_-_stablelm-2-1_6b-4bits

Text Generation • 1B • Updated May 3, 2024

RichardErkhov/stabilityai_-_stablelm-2-1_6b-8bits

Text Generation • 2B • Updated May 3, 2024

RichardErkhov/stabilityai_-_stable-code-3b-4bits

Text Generation • 2B • Updated May 4, 2024

RichardErkhov/stabilityai_-_stable-code-3b-8bits

Text Generation • 3B • Updated May 4, 2024

RichardErkhov/stabilityai_-_stablelm-2-12b-4bits

Text Generation • 7B • Updated May 11, 2024 • 2

rob-x-ai/stablelm-2-12b-GGUF

12B • Updated Jun 6, 2024 • 166

QuantFactory/stable-code-3b-GGUF

Text Generation • 3B • Updated Jul 16, 2024 • 287 • 1

amd/AMD-Llama-135m

Text Generation • 0.1B • Updated Oct 9, 2024 • 6.72k • 117

amd/AMD-Llama-135m-code

Text Generation • 0.1B • Updated Oct 9, 2024 • 247 • 13

stabilityai/ar-stablelm-2-base

Text Generation • 2B • Updated Dec 6, 2024 • 17 • 6

RichardErkhov/stabilityai_-_stablelm-3b-4e1t-gguf

3B • Updated Jul 30, 2024 • 141

RichardErkhov/stabilityai_-_japanese-stablelm-3b-4e1t-instruct-gguf

3B • Updated Jul 30, 2024 • 144

RichardErkhov/stabilityai_-_japanese-stablelm-3b-4e1t-base-gguf

3B • Updated Jul 30, 2024 • 56

QuantFactory/AMD-Llama-135m-GGUF

0.1B • Updated Oct 6, 2024 • 106 • 3

QuantFactory/AMD-Llama-135m-code-GGUF

0.1B • Updated Oct 3, 2024 • 21 • 2

mav23/AMD-Llama-135m-GGUF

0.1B • Updated Oct 3, 2024 • 54

RichardErkhov/amd_-_AMD-Llama-135m-gguf

0.1B • Updated Oct 4, 2024 • 27

RichardErkhov/stabilityai_-_stablelm-3b-4e1t-4bits

2B • Updated Oct 6, 2024

RichardErkhov/stabilityai_-_stablelm-3b-4e1t-8bits

3B • Updated Oct 6, 2024

mav23/AMD-Llama-135m-code-GGUF

0.1B • Updated Oct 9, 2024 • 21

akswelh/NEOX

Updated Oct 14, 2024

mav23/gpt-neox-20b-GGUF

21B • Updated Oct 15, 2024 • 78

RichardErkhov/Upword_-_gpt-neox-20b-embeddings-gguf

21B • Updated Oct 27, 2024 • 48

RichardErkhov/stabilityai_-_stablelm-2-12b-gguf

12B • Updated Nov 1, 2024 • 529

RichardErkhov/EleutherAI_-_gpt-neox-20b-gguf

21B • Updated Nov 2, 2024 • 49

RichardErkhov/stabilityai_-_stablelm-2-1_6b-gguf

2B • Updated Nov 4, 2024 • 700

mllmTeam/PhoneLM-0.5B

Text Generation • 0.5B • Updated Nov 14, 2024 • 10