Yu, 2020. Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-Learning.