Azizzadenesheli, et al.. Policy Gradient in Partially Observable Environments: Approximation and Convergence. 24 May 2020.