When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs Paper • 2508.03365 • Published Aug 5, 2025 • 4
Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models Paper • 2508.04196 • Published Aug 6, 2025 • 1