Sugano, Bulling, 2016. Seeing with Humans: Gaze-Assisted Neural Image Captioning.