INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats Paper • 2510.25602 • Published 13 days ago • 66
Running on CPU Upgrade 1.95k 1.95k The Smol Training Playbook: The Secrets to Building World-Class LLMs 📝 Explore loss curves for training LLMs
Running 3.46k 3.46k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
flax-sentence-embeddings/stackoverflow_mpnet-base Sentence Similarity • Updated Jul 26, 2021 • 139 • 5