Awesome papers from 臺大李宏毅 (Hung-yi Lee) Collection Recent papers authored by Hung-yi Lee. Sorted by ID • 8 items • Updated 3 days ago • 17
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models Paper • 2510.16917 • Published 8 days ago • 19
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations Paper • 2510.16893 • Published 8 days ago • 17
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models Paper • 2510.06917 • Published 19 days ago • 34
Game-Time: Evaluating Temporal Dynamics in Spoken Language Models Paper • 2509.26388 • Published 27 days ago • 26
TAU: A Benchmark for Cultural Sound Understanding Beyond Semantics Paper • 2509.26329 • Published 27 days ago • 2
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3 • 18
STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models Paper • 2507.15375 • Published Jul 21 • 30
Mitigating Object Hallucinations via Sentence-Level Early Intervention Paper • 2507.12455 • Published Jul 16 • 7
Einstein Fields: A Neural Perspective To Computational General Relativity Paper • 2507.11589 • Published Jul 15 • 8
Evaluations of Large Audio-Language Models (LALMs) Collection This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17 • 3
Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.09250 • Published Jun 10 • 27
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.06941 • Published Jun 7 • 15
Is Extending Modality The Right Path Towards Omni-Modality? Paper • 2506.01872 • Published Jun 2 • 23
Audio-Aware Large Language Models as Judges for Speaking Styles Paper • 2506.05984 • Published Jun 6 • 15
Speechless: Speech Instruction Training Without Speech for Low Resource Languages Paper • 2505.17417 • Published May 23 • 14
Reverse Preference Optimization for Complex Instruction Following Paper • 2505.22172 • Published May 28 • 6
How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective Paper • 2505.21505 • Published May 27 • 18