Ghent University Academic Bibliography

Advanced

A survey on resiliency techniques in cloud computing infrastructures and applications

Carlos Colman-Meixner, Chris Develder UGent, Massimo Tornatore and Biswanath Mukherjee (2016) IEEE COMMUNICATIONS SURVEYS AND TUTORIALS. 18(3). p.2244-2281
abstract
Today's businesses increasingly rely on cloud computing, which brings both great opportunities and challenges. One of the critical challenges is resiliency: disruptions due to failures (either accidental or because of disasters or attacks) may entail significant revenue losses (e.g., US$ 25.5 billion in 2010 for North America). Such failures may originate at any of the major components in a cloud architecture (and propagate to others): 1) the servers hosting the application; 2) the network interconnecting them (on different scales, inside a data center, up to wide-area connections); or 3) the application itself. We comprehensively survey a large body of work focusing on resilience of cloud computing, in each (or a combination) of the server, network, and application components. First, we present the cloud computing architecture and its key concepts. We highlight both the infrastructure (servers, network) and application components. A key concept is virtualization of infrastructure (i.e., partitioning into logically separate units), and thus we detail the components in both physical and virtual layers. Before moving to the detailed resilience aspects, we provide a qualitative overview of the types of failures that may occur (from the perspective of the layered cloud architecture), and their consequences. The second major part of the paper introduces and categorizes a large number of techniques for cloud computing infrastructure resiliency. This ranges from designing and operating the facilities, servers, networks, to their integration and virtualization (e.g., also including resilience of the middleware infrastructure). The third part focuses on resilience in application design and development. We study how applications are designed, installed, and replicated to survive multiple physical failure scenarios as well as disaster failures.
Please use this url to cite or link to this publication:
author
organization
year
type
journalArticle (original)
publication status
published
keyword
IBCN, DATA CENTER NETWORK, VIRTUAL MACHINE PLACEMENT, WDM MESH NETWORKS, HIGH-AVAILABILITY, FAULT-TOLERANCE, COMMUNICATION-NETWORKS, OPTICAL NETWORKS, SERVER INTERCONNECTION, DATACENTER NETWORKS, DATA CENTERS
journal title
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS
volume
18
issue
3
pages
2244 - 2281
Web of Science type
Article
Web of Science id
000384887100027
JCR category
COMPUTER SCIENCE, INFORMATION SYSTEMS
JCR impact factor
17.188 (2016)
JCR rank
1/146 (2016)
JCR quartile
1 (2016)
ISSN
1553-877X
DOI
10.1109/COMST.2016.2531104
language
English
UGent publication?
yes
classification
A1
id
8507254
handle
http://hdl.handle.net/1854/LU-8507254
date created
2017-02-03 08:39:01
date last changed
2017-02-13 08:56:48
@article{8507254,
  abstract     = {Today's businesses increasingly rely on cloud computing, which brings both great opportunities and challenges. One of the critical challenges is resiliency: disruptions due to failures (either accidental or because of disasters or attacks) may entail significant revenue losses (e.g., US\$ 25.5 billion in 2010 for North America). Such failures may originate at any of the major components in a cloud architecture (and propagate to others): 1) the servers hosting the application; 2) the network interconnecting them (on different scales, inside a data center, up to wide-area connections); or 3) the application itself. We comprehensively survey a large body of work focusing on resilience of cloud computing, in each (or a combination) of the server, network, and application components. First, we present the cloud computing architecture and its key concepts. We highlight both the infrastructure (servers, network) and application components. A key concept is virtualization of infrastructure (i.e., partitioning into logically separate units), and thus we detail the components in both physical and virtual layers. Before moving to the detailed resilience aspects, we provide a qualitative overview of the types of failures that may occur (from the perspective of the layered cloud architecture), and their consequences. The second major part of the paper introduces and categorizes a large number of techniques for cloud computing infrastructure resiliency. This ranges from designing and operating the facilities, servers, networks, to their integration and virtualization (e.g., also including resilience of the middleware infrastructure). The third part focuses on resilience in application design and development. We study how applications are designed, installed, and replicated to survive multiple physical failure scenarios as well as disaster failures.},
  author       = {Colman-Meixner, Carlos and Develder, Chris and Tornatore, Massimo and Mukherjee, Biswanath},
  issn         = {1553-877X},
  journal      = {IEEE COMMUNICATIONS SURVEYS AND TUTORIALS},
  keyword      = {IBCN,DATA CENTER NETWORK,VIRTUAL MACHINE PLACEMENT,WDM MESH NETWORKS,HIGH-AVAILABILITY,FAULT-TOLERANCE,COMMUNICATION-NETWORKS,OPTICAL NETWORKS,SERVER INTERCONNECTION,DATACENTER NETWORKS,DATA CENTERS},
  language     = {eng},
  number       = {3},
  pages        = {2244--2281},
  title        = {A survey on resiliency techniques in cloud computing infrastructures and applications},
  url          = {http://dx.doi.org/10.1109/COMST.2016.2531104},
  volume       = {18},
  year         = {2016},
}

Chicago
Colman-Meixner, Carlos, Chris Develder, Massimo Tornatore, and Biswanath Mukherjee. 2016. “A Survey on Resiliency Techniques in Cloud Computing Infrastructures and Applications.” Ieee Communications Surveys and Tutorials 18 (3): 2244–2281.
APA
Colman-Meixner, C., Develder, C., Tornatore, M., & Mukherjee, B. (2016). A survey on resiliency techniques in cloud computing infrastructures and applications. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 18(3), 2244–2281.
Vancouver
1.
Colman-Meixner C, Develder C, Tornatore M, Mukherjee B. A survey on resiliency techniques in cloud computing infrastructures and applications. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS. 2016;18(3):2244–81.
MLA
Colman-Meixner, Carlos, Chris Develder, Massimo Tornatore, et al. “A Survey on Resiliency Techniques in Cloud Computing Infrastructures and Applications.” IEEE COMMUNICATIONS SURVEYS AND TUTORIALS 18.3 (2016): 2244–2281. Print.