Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 28 days ago • 462
cwm Collection Collection for Code World Model, an agentic coding model from FAIR. • 3 items • Updated Sep 24 • 17
MobileLLM-R1 Collection MobileLLM-R1, a series of sub-billion parameter reasoning models • 7 items • Updated 21 days ago • 21
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 373
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 3 items • Updated 4 days ago • 128
Llama 4 Collection Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 3 days ago • 50
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209