Wang, Yu, Yu, Dai, Tsvetkov, Cao, 2022. SimVLM: Simple Visual Language Model Pretraining with Weak Supervision.