-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 28.4k • 73 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 38.5k • • 389 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 483k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 120k • • 738
Collections
Discover the best community collections!
Collections trending this week
-
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 75 -
inclusionAI/LLaDA2.0-flash
Text Generation • 103B • Updated • 428 • 58 -
inclusionAI/LLaDA2.0-mini
Text Generation • 16B • Updated • 5.15k • 48 -
inclusionAI/LLaDA2.0-flash-preview
Text Generation • 103B • Updated • 125 • 69
-
unsloth/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation • 32B • Updated • 67.7k • 155 -
unsloth/GLM-4.7-GGUF
Text Generation • 358B • Updated • 23.4k • 41 -
unsloth/Qwen-Image-Edit-2511-GGUF
Image-to-Image • 20B • Updated • 33.6k • 108 -
unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF
24B • Updated • 125k • 61
-
nvidia/Nemotron-Cascade-8B
Text Generation • 8B • Updated • 1.69k • 40 -
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation • 8B • Updated • 1.13k • 25 -
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation • 15B • Updated • 2.11k • 43 -
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation • Updated • 6
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 28.4k • 73 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 38.5k • • 389 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 483k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 120k • • 738
-
unsloth/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation • 32B • Updated • 67.7k • 155 -
unsloth/GLM-4.7-GGUF
Text Generation • 358B • Updated • 23.4k • 41 -
unsloth/Qwen-Image-Edit-2511-GGUF
Image-to-Image • 20B • Updated • 33.6k • 108 -
unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF
24B • Updated • 125k • 61
-
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 75 -
inclusionAI/LLaDA2.0-flash
Text Generation • 103B • Updated • 428 • 58 -
inclusionAI/LLaDA2.0-mini
Text Generation • 16B • Updated • 5.15k • 48 -
inclusionAI/LLaDA2.0-flash-preview
Text Generation • 103B • Updated • 125 • 69
-
nvidia/Nemotron-Cascade-8B
Text Generation • 8B • Updated • 1.69k • 40 -
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation • 8B • Updated • 1.13k • 25 -
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation • 15B • Updated • 2.11k • 43 -
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation • Updated • 6