Bergstrom, Reppy, 2012. Nested data-parallelism on the gpu, in: . ACM Press.. https://doi.org/10.1145/2364527.2364563