Sun, Chen, Zhou, Li, Cao, Zheng, 2021. A non-hierarchical attention network with modality dropout for textual response generation in multimodal dialogue systems.