TY - GEN
T1 - A flexible analysis and prediction framework on resource usage in public clouds
AU - Lin, Chia Yu
AU - Chen, Yan Ann
AU - Tseng, Yu-Chee
AU - Wang, Li-Chun
PY - 2012/12/1
Y1 - 2012/12/1
N2 - In cloud computing environments, users can rent virtual machines (VMs) from cloud providers to execute their programs or provide network services. While using this kind of cloud services, one of the biggest problems for the users is to determine the proper number of VMs to complete the jobs considering both budget and time. In this paper, we propose a resource prediction framework (RPF), which can help users choose the minimum number of virtual machines to complete their jobs within a user specified time constraint. In order to verify the feasibility of RPF, we have done three case studies, namely parallel frequent pattern growth (FP-Growth), parallel K-means, and Particle Swarm Optimization (PSO). FP-growth, K-means and PSO are data intensive algorithms. These algorithms are typically executed repeatedly with different execution parameters to find the optimal results. When evaluating RPF by these algorithms in cloud environments, we have to modify them to parallel versions. The evaluation results indicate that RPF can successfully obtain the minimum number of VMs with acceptable errors. According to our case studies, the proposed RPF can be adopted by data intensive jobs by providing flexibility to both end users and cloud system providers.
AB - In cloud computing environments, users can rent virtual machines (VMs) from cloud providers to execute their programs or provide network services. While using this kind of cloud services, one of the biggest problems for the users is to determine the proper number of VMs to complete the jobs considering both budget and time. In this paper, we propose a resource prediction framework (RPF), which can help users choose the minimum number of virtual machines to complete their jobs within a user specified time constraint. In order to verify the feasibility of RPF, we have done three case studies, namely parallel frequent pattern growth (FP-Growth), parallel K-means, and Particle Swarm Optimization (PSO). FP-growth, K-means and PSO are data intensive algorithms. These algorithms are typically executed repeatedly with different execution parameters to find the optimal results. When evaluating RPF by these algorithms in cloud environments, we have to modify them to parallel versions. The evaluation results indicate that RPF can successfully obtain the minimum number of VMs with acceptable errors. According to our case studies, the proposed RPF can be adopted by data intensive jobs by providing flexibility to both end users and cloud system providers.
KW - MapReduce
KW - Parallel Frequent Pattern Growth
KW - Parallel K-means
KW - Particle Swarm Optimization
KW - cloud computing
KW - resource prediction
UR - http://www.scopus.com/inward/record.url?scp=84874229064&partnerID=8YFLogxK
U2 - 10.1109/CloudCom.2012.6427543
DO - 10.1109/CloudCom.2012.6427543
M3 - Conference contribution
AN - SCOPUS:84874229064
SN - 9781467345095
T3 - CloudCom 2012 - Proceedings: 2012 4th IEEE International Conference on Cloud Computing Technology and Science
SP - 309
EP - 316
BT - CloudCom 2012 - Proceedings
T2 - 2012 4th IEEE International Conference on Cloud Computing Technology and Science, CloudCom 2012
Y2 - 3 December 2012 through 6 December 2012
ER -