A Chaos Engineering System for Live Analysis and Falsification of
Exception-handling in the JVM
release_q6qpdo3de5bopd5tv27sfdyrei
by
Long Zhang, Brice Morin, Philipp Haller, Benoit Baudry, Martin
Monperrus
2018
Abstract
Software systems contain resilience code to handle those failures and
unexpected events happening in production. It is essential for developers to
understand and assess the resilience of their systems. Chaos engineering is a
technology that aims at assessing resilience and uncovering weaknesses by
actively injecting perturbations in production. In this paper, we propose a
novel design and implementation of a chaos engineering system in Java called
CHAOSMACHINE. It provides a unique and actionable analysis on
exception-handling capabilities in production, at the level of try-catch
blocks. To evaluate our approach, we have deployed CHAOSMACHINE on top of 3
large-scale and well-known Java applications totaling 630k lines of code. Our
results show that CHAOSMACHINE reveals both strengths and weaknesses of the
resilience code of a software system at the level of exception handling.
In text/plain
format
Archived Files and Locations
application/pdf 275.1 kB
file_zpal5wttq5efrlxm7kefi6kxty
|
arxiv.org (repository) web.archive.org (webarchive) |
1805.05246v2
access all versions, variants, and formats of this works (eg, pre-prints)