BibTeX
CSL-JSON
MLA
Harvard
An axiomatic basis for Blackwell optimality
release_kkypohgjsveuvbayawzmovev24
by
Adam Jonsson
Released
as a article
.
2017
Abstract
In the theory of Markov decision processes (MDPs), a Blackwell optimal policy
is a policy that is optimal for every discount factor sufficiently close to
one. This paper provides an axiomatic basis for Blackwell optimality in
discrete-time MDPs with finitely many states and finitely many actions.
In text/plain
format
Archived Files and Locations
application/pdf 145.1 kB
file_thoevrx6urezxbbgsuuk645f3y
|
arxiv.org (repository) web.archive.org (webarchive) |
Read Archived PDF
Preserved and Accessible
arXiv
1701.02879v1
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
access all versions, variants, and formats of this works (eg, pre-prints)
Cite This
Lookup Links