Audio Language Model Collection Open source models including Malaysian context and dataset. • 23 items • Updated Jun 28, 2025 • 2
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond Paper • 2408.03900 • Published Aug 7, 2024 • 10
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133
Vietnamese speech dataset Collection for any speech-related tasks including but not limited to: speech-to-text & text-to-speech, speech classification, speaker verification, etc. • 34 items • Updated Jul 8, 2025 • 37