Schulte, et al.. $\mathbf{q}$- and $\mathbf{a}$-learning Methods for Estimating Optimal Dynamic Treatment Regimes. no. 4, Institute of Mathematical Statistics, 2014, doi:10.1214/13-sts450.