Mees, Hermann, Rosete-Beas, Burgard, 2021. CALVIN: A Benchmark for Language-conditioned Policy Learning for Long-horizon Robot Manipulation Tasks.