Submitted by Ruben Härle 21 KletterMix: Climbing Toward High-Quality German Pretraining Data Artificial Intelligence & Machine Learning Lab at TU Darmstadt 6