baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text β’ 30B β’ Updated 11 days ago β’ 652 β’ 513
Running 3.62k The Ultra-Scale Playbook π 3.62k The ultimate guide to training LLM on large GPU Clusters
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV Visual Question Answering β’ 9B β’ Updated Oct 25, 2024 β’ 777 β’ 14
Running 86 Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks π 86 Evaluate multilingual models using FineTasks