Zhangand Hashimoto. On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies. 12 Apr. 2021.