Sadhu, A., Chen, K., Nevatia, R., 2020. Video Object Grounding Using Semantic Roles in Language Description, in: . IEEE.. https://doi.org/10.1109/cvpr42600.2020.01043