video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models Paper • 2506.15220 • Published Jun 18 • 1
SALMONN: Towards Generic Hearing Abilities for Large Language Models Paper • 2310.13289 • Published Oct 20, 2023 • 17