Tian, et al.. Audio-visual Event Localization in Unconstrained Videos. 23 Mar. 2018.