Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

18

Full-text search

Active filters: kernel

drbh/img2gray

Updated Aug 18, 2025 • 2

RedHatAI/quantization

Updated Jul 27, 2025 • 6

RedHatAI/moe

Updated Jul 25, 2025 • 3

kernels-community/mamba-ssm

Updated 7 days ago • 1.89k • 1

medmekk/LlamaRMSNorm-triton

Updated Mar 26, 2025

medmekk/rmsnorm

Updated Mar 27, 2025

medmekk/triton-llama-mlp

Updated Apr 10, 2025

medmekk/triton-llama-attn

Updated Mar 28, 2025

cdreetz/kwen2.5-1.5b

Text Generation • 2B • Updated Jun 4, 2025 • 2

EricB/kernels-paged-attention-metal

Updated Jun 26, 2025 • 1

cdreetz/kwen2.5-1.5b-v2

Text Generation • 2B • Updated Jul 17, 2025

medmekk/triton-flash-attn-sink-clone

Updated Jul 24, 2025 • 1

JinnP/Qwen3-8B-Kernelbook-SFT-HF

Text Generation • 8B • Updated Aug 25, 2025 • 3 • 1

mradermacher/Qwen3-8B-Kernelbook-SFT-HF-GGUF

8B • Updated Aug 25, 2025 • 108 • 1

drbh/yamoe

Updated Sep 19, 2025 • 2

gagan3012/batch_invariant_kernel

Updated Sep 11, 2025 • 3

shisa-ai/megablocks-hip

Updated Nov 3, 2025 • 1

AhmedAyman/k2-think-cuda-1505

Text Generation • Updated Oct 26, 2025 • 8