Li, Wang, Xiao, Chua, 2022. Equivariant and Invariant Grounding for Video Question Answering.