Qu, Li, Liu, Xiong, Zhang, Chu, Wang, Qi, Song, 2022. Variational Policy Propagation for Multi-agent Reinforcement Learning.