Liu, et al.. ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-attention. 23 Mar. 2022.