Zhong, Yang, Zhang, Li, Codella, Li, Zhou, Dai, Yuan, Li, Gao, 2021. RegionCLIP: Region-based Language-Image Pretraining.