Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

137

Full-text search

Active filters: long-context

mlx-community/answerdotai-ModernBERT-base-6bit

Fill-Mask • 41.3M • Updated Apr 2 • 5

mlx-community/answerdotai-ModernBERT-base-8bit

Fill-Mask • 53M • Updated Apr 2 • 10

mlx-community/answerdotai-ModernBERT-base-bf16

Fill-Mask • 0.2B • Updated Apr 2 • 17 • 1

mlx-community/answerdotai-ModernBERT-Large-Instruct-4bit

Fill-Mask • 70M • Updated Apr 2 • 8

mlx-community/answerdotai-ModernBERT-Large-Instruct-6bit

Fill-Mask • 98M • Updated Apr 2 • 10

mlx-community/answerdotai-ModernBERT-Large-Instruct-8bit

Fill-Mask • 0.1B • Updated Apr 2 • 9

mlx-community/answerdotai-ModernBERT-Large-Instruct-bf16

Fill-Mask • 0.4B • Updated Apr 2 • 12

moonshotai/Kimi-VL-A3B-Instruct

Image-Text-to-Text • 16B • Updated Jul 30 • 95.5k • 237

TheCluster/Llama-3.1-8B-UltraLong-1M-Instruct-mlx-6bit

Text Generation • 2B • Updated Apr 15 • 2

TheCluster/Llama-3.1-8B-UltraLong-4M-Instruct-mlx-6bit

Text Generation • 2B • Updated Apr 15 • 2

TheCluster/Llama-3.1-8B-UltraLong-4M-Instruct-mlx-4bit

Text Generation • 1B • Updated Apr 15 • 1

TheCluster/Llama-3.1-8B-UltraLong-1M-Instruct-mlx-4bit

Text Generation • 1B • Updated Apr 15 • 6

thomas-sounack/BioClinical-ModernBERT-large

Fill-Mask • Updated 25 days ago • 314k • • 10

stefan-it/ModernBERT-large-tokenizer-fix

Fill-Mask • 0.4B • Updated Jul 16 • 4

Tongyi-Zhiwen/QwenLong-L1-32B

Text Generation • 33B • Updated Jun 9 • 1.45k • • 160

KnutJaegersberg/QwenLong-L1-32B-Q8_0-GGUF

33B • Updated May 26 • 7 • 3

WaveCut/QwenLong-L1-32B-mlx-4Bit

5B • Updated May 26 • 6 • 2

WaveCut/QwenLong-L1-32B-mlx-8Bit

9B • Updated May 26 • 7 • 2

mradermacher/QwenLong-L1-32B-GGUF

33B • Updated Jul 31 • 361 • 8

mradermacher/QwenLong-L1-32B-i1-GGUF

33B • Updated Jul 11 • 870

LSX-UniWue/ModernGBERT_1B

Feature Extraction • Updated 8 days ago • 1.1k • 6

LSX-UniWue/ModernGBERT_134M

Feature Extraction • 0.2B • Updated 8 days ago • 1.04k • • 5

cnfusion/QwenLong-L1-32B-mlx-4Bit

5B • Updated May 28 • 2

cnfusion/QwenLong-L1-32B-mlx-3Bit

4B • Updated May 28 • 3 • 1

Mungert/QwenLong-L1-32B-GGUF

Text Generation • 33B • Updated Sep 24 • 647 • 9

Tongyi-Zhiwen/QwenLong-L1-32B-AWQ

6B • Updated May 29 • 3 • 10

RoadToNowhere/QwenLong-L1-32B-abliterated-Q4_K_M-GGUF

33B • Updated May 31 • 4

Narutoouz/QwenLong-L1-32B-4bit-DWQ

Text Generation • 5B • Updated Jun 1 • 7

mradermacher/QwenLong-L1-32B-abliterated-GGUF

33B • Updated Jul 31 • 65 • 1

Narutoouz/GLM-4-9B-0414-4bit-DWQ

Text Generation • 1B • Updated Jun 1 • 34 • 1