Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations Paper • 2510.16893 • Published Oct 19 • 17
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models Paper • 2510.16917 • Published Oct 19 • 19
STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models Paper • 2507.15375 • Published Jul 21 • 30
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models Paper • 2510.06917 • Published Oct 8 • 34
Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models Paper • 2505.17496 • Published May 23 • 2
Game-Time: Evaluating Temporal Dynamics in Spoken Language Models Paper • 2509.26388 • Published Sep 30 • 26
Evaluations of Large Audio-Language Models (LALMs) Collection This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17 • 4
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3 • 18