Authors
Xinye Cai, Haiyang Xu, Xiaoping Li, Kang Wang, Long Chen, Rubén Ruiz García, Qingfu Zhang
Publication date
2022/8/16
Journal
IEEE Transactions on Parallel and Distributed Systems
Volume
33
Issue
12
Pages
4547-4562
Publisher
IEEE
Description
In this article, we consider the dynamic allocation of bursty requests stochastically arriving at heterogeneous servers with uncertain setup times. Lower expected response time and less power consumption are desirable objectives of users and service providers respectively. However, sudden increase and decrease of cloud servers caused by bursty requests are rather challenging to get an appropriate trade-off between the two conflicting objectives which are closely related to the launched servers. The heterogeneity of the cloud servers further makes it more difficult to decide how to switch on and off servers and effectively and efficiently allocate bursty requests with balanced objectives. Based on a Markov decision process, a real-time bilevel decision-making model is constructed for unallocated requests which includes: whether to launch a server and which type of server to launch. A learn-and-deploy algorithm …
Scholar articles
X Cai, H Xu, X Li, K Wang, L Chen, RR García… - IEEE Transactions on Parallel and Distributed Systems, 2022