Software that Learns from its Own Failures release_afflzserk5fgflqcpziqktoeqe

by Martin Monperrus

Released as a article .

2015  

Abstract

All non-trivial software systems suffer from unanticipated production failures. However, those systems are passive with respect to failures and do not take advantage of them in order to improve their future behavior: they simply wait for them to happen and trigger hard-coded failure recovery strategies. Instead, I propose a new paradigm in which software systems learn from their own failures. By using an advanced monitoring system they have a constant awareness of their own state and health. They are designed in order to automatically explore alternative recovery strategies inferred from past successful and failed executions. Their recovery capabilities are assessed by self-injection of controlled failures; this process produces knowledge in prevision of future unanticipated failures.
In text/plain format

Archived Files and Locations

application/pdf  160.0 kB
file_oqolqrv7obfi5opmsad2welilu
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2015-02-03
Version   v1
Language   en ?
arXiv  1502.00821v1
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 080db07a-332e-4ddf-9a29-1c9a3bacdcc9
API URL: JSON