Schwenk. Efficient Training of Large Neural Networks for Language Modeling. IEEE, doi:10.1109/ijcnn.2004.1381158.