Zhao, et al.. MUSE: Parallel Multi-scale Attention for Sequence to Sequence Learning. 17 Nov. 2019.