Takahashi, Noma, Asada, 2008. Efficient Behavior Learning Based on State Value Estimation of Self and Others 22.. https://doi.org/10.1163/156855308x344882