Benchmark dataset Collection contains the dataset used to evaluate the model • 1 item • Updated 3 days ago • 1
Exprimental-x25 Collection Experiments conducted (Please Do Not use these Models) • 3 items • Updated 4 days ago • 1
view article Article AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠26 days ago • 9
Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 7 days ago • 25
arabic datasets Collection datasets related to Arabic-tunisian dialect • 17 items • Updated Nov 22 • 3
Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance Paper • 2504.09753 • Published Apr 13 • 6
view article Article Releasing the largest multilingual open pretraining dataset Nov 13, 2024 • 104
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic Paper • 2509.01363 • Published Sep 1 • 58
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 395
Whisper Release Collection Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 146
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Jul 31 • 70