To our knowledge, the audiovisual aerial scene recognition task has not been explored before. For further facilitating the research in this field, we construct a new dataset, with high-quality images and scene labels, named as ADVANCE, which in summary contains 5075 pairs of aerial images and sounds, classified into 13 classes.