Running 3.6k The Ultra-Scale Playbook π 3.6k The ultimate guide to training LLM on large GPU Clusters
shenzhi-wang/Gemma-2-9B-Chinese-Chat Text Generation β’ 9B β’ Updated Jul 4, 2024 β’ 755 β’ β’ 78