FineWeb-HQ datasets Collection Collection containing FineWeb-HQ and FineWeb2-HQ quality filtered datasets. • 3 items • Updated 25 days ago
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments Paper • 2509.14233 • Published Sep 17 • 12
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments Paper • 2509.14233 • Published Sep 17 • 12
Benchmarking Optimizers for Large Language Model Pretraining Paper • 2509.01440 • Published Sep 1 • 24
Benchmarking Optimizers for Large Language Model Pretraining Paper • 2509.01440 • Published Sep 1 • 24
Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed Paper • 2406.04443 • Published Jun 6, 2024
Benchmarking Optimizers for Large Language Model Pretraining Paper • 2509.01440 • Published Sep 1 • 24
BaCaDI: Bayesian Causal Discovery with Unknown Interventions Paper • 2206.01665 • Published Jun 3, 2022 • 2
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 73
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 73
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 73
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 49
view post Post 5177 SmolVLM is now available on PocketPal — you can run it offline on your smartphone to interpret the world around you. 🌍📱And check out this real-time camera demo by @ngxson , powered by llama.cpp:https://github.com/ngxson/smolvlm-realtime-webcamhttps://x.com/pocketpal_ai See translation 4 replies · ❤️ 11 11 😎 1 1 + Reply