Equi-Reward Utility Maximizing Design in Stochastic Environments
release_xt3ng2nakje5jifor22rf6twt4
by
Sarah Keren, Luis Pineda, Avigdor Gal, Erez Karpas, Shlomo Zilberstein
2017 p4353-4360
Abstract
We present the Equi Reward Utility Maximizing Design (ER-UMD) problem for redesigning stochastic environments to maximize agent performance. ER-UMD fits well contemporary applications that require offline design of environments where robots and humans act and cooperate. To find an optimal modification sequence we present two novel solution techniques: a compilation that embeds design into a planning problem, allowing use of off-the-shelf solvers to find a solution, and a heuristic search in the modifications space, for which we present an admissible heuristic. Evaluation shows the feasibility of the approach using standard benchmarks from the probabilistic planning competition and a benchmark we created for a vacuum cleaning robot setting.
In application/xml+jats
format
Archived Files and Locations
application/pdf 521.0 kB
file_apiy5cv6njc2foczqt3g3uipa4
|
web.archive.org (webarchive) www.ijcai.org (web) |
paper-conference
Stage
published
Year 2017
access all versions, variants, and formats of this works (eg, pre-prints)
Crossref Metadata (via API)
Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar