Improving Load Balance For Data-Intensive Computing On Cloud Platforms
Keywords
Cloud Computing; Data-Intensive Computing; Hadoop; Hadoop Distributed File System; Load Balance; MapReduce; Replica Placement
Abstract
Nowadays, big data problems are ubiquitous, which in turn creates huge demand for data-intensive computing. The advent of Cloud Computing has made data-intensive computing much more accessible and affordable than ever before. One of the crucial issues that can significantly affect the performance of data-intensive applications is the load balance among cluster nodes. In this paper, we address the load balance problem in the context of Hadoop Distributed File System (HDFS), a widely used file system for data-intensive computing on Cloud platforms, and present an innovative replica placement policy for HDFS, which can perfectly balance the computing load among all cluster nodes in both homogeneous and heterogeneous cluster environments.
Publication Date
12-22-2016
Publication Title
Proceedings - 2016 IEEE International Conference on Smart Cloud, SmartCloud 2016
Number of Pages
140-145
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/SmartCloud.2016.44
Copyright Status
Unknown
Socpus ID
85011060309 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85011060309
STARS Citation
Dai, Wei; Ibrahim, Ibrahim; and Bassiouni, Mostafa, "Improving Load Balance For Data-Intensive Computing On Cloud Platforms" (2016). Scopus Export 2015-2019. 4252.
https://stars.library.ucf.edu/scopus2015/4252