Data-Efficient RLVR via Off-Policy Influence Guidance Paper • 2510.26491 • Published Oct 30, 2025 • 11
Running on CPU Upgrade Featured 2.97k The Smol Training Playbook 📚 2.97k The secrets to building world-class LLMs
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 206