An axiomatic basis for Blackwell optimality release_kkypohgjsveuvbayawzmovev24

by Adam Jonsson

Released as a article .

2017  

Abstract

In the theory of Markov decision processes (MDPs), a Blackwell optimal policy is a policy that is optimal for every discount factor sufficiently close to one. This paper provides an axiomatic basis for Blackwell optimality in discrete-time MDPs with finitely many states and finitely many actions.
In text/plain format

Archived Files and Locations

application/pdf  145.1 kB
file_thoevrx6urezxbbgsuuk645f3y
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2017-01-11
Version   v1
Language   en ?
arXiv  1701.02879v1
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 30bcc72b-d68a-4c47-b13c-a58a19b83639
API URL: JSON