Autonomic Rejuvenation of Cloud Applications as a Countermeasure to Software Anomalies

Pierangelo Di Sanzo, Dimiter R. Avresky, and Alessandro Pellegrini

Published in: Software: Practice and Experience, 2021
pdf Download PDF

Failures in computer systems can be often tracked down to software anomalies of various kinds. In many scenarios, it could be difficult, unfeasible, or unprofitable to carry out extensive debugging activity to spot the causes of anomalies and remove them. In other cases, taking corrective actions may led to undesirable service downtime. In this article, we propose an alternative approach to cope with the problem of software anomalies in cloud-based applications, and we present the design of a distributed autonomic framework that implements our approach. It exploits the elastic capabilities of cloud infrastructures, and relies on machine learning models, proactive rejuvenation techniques and a new load balancing approach. By putting together all these elements, we show that it is possible to improve both availability and performance of applications deployed over heterogeneous cloud regions and subject to frequent failures. Overall, our study demonstrates the viability of our approach, thus opening the way towards it adoption, and encouraging further studies and practical experiences to evaluate and improve it.

BibTeX Entry:

author = {Di Sanzo, Pierangelo and Avresky, Dimiter R. and Pellegrini, Alessandro},
journal = {Software: Practice and Experience},
title = {Autonomic Rejuvenation of Cloud Applications as a Countermeasure to Software Anomalies},
year = {2021},
month = jan,
volume = {51},
number = {1},
pages = {46--71},
issn = {1097-024X},
publisher = {Wiley},
series = {SPE},
doi = {10.1002/spe.2908}