TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar Paper • 2510.14972 • Published 17 days ago • 29
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published 12 days ago • 107
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 12 days ago • 101
Running on CPU Upgrade 1.01k 1.01k The Smol Training Playbook: The Secrets to Building World-Class LLMs 📝