Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer

Publication
Association for the Advancement of Artificial Intelligence(AAAI) 2024