Author
Listed:
- Zihao Qu
(Naveen Jindal School of Management, The University of Texas at Dallas, Richardson, Texas 75080)
- Milind Dawande
(Naveen Jindal School of Management, The University of Texas at Dallas, Richardson, Texas 75080)
- Ganesh Janakiraman
(Naveen Jindal School of Management, The University of Texas at Dallas, Richardson, Texas 75080)
Abstract
Motivated by the rapid growth of the cloud cost management and optimization (CCMO) industry to support the exploding cloud-computing market, we study an infinite-horizon, stochastic optimization problem from the viewpoint of a firm that employs cloud resources to process incoming orders (or jobs) over time. We model the following vital practical features of CCMO in our problem. There are several types of resources that differ in their costs and performance attributes (e.g., processor speed, memory, storage). For each type of resource, capacity can either be reserved over the long term at a discounted price or be used on demand at a relatively higher price. Orders of several types arrive stochastically through time; orders differ in their completion-time deadlines and in their resource-specific processing-time distributions. Moreover, the progress of an order can be observed periodically, and if required, the order can be moved from one resource type to another. Penalty costs are incurred for orders not completed by their deadlines. The firm’s goal is to minimize the long-run average expected cost per period, taking into account reserved-capacity costs, on-demand capacity costs, and order-delay costs. We derive a lower bound on the optimal cost by considering a set of decoupled problems, one for each order. The solutions of these problems are then used to construct a feasible policy for the original problem and derive an upper bound on that policy’s optimality gap. Importantly, we show that our policy is asymptotically optimal ; when the demand rates of the orders are scaled by a factor θ > 0 , the policy’s optimality gap scales proportional to 1 / θ . We also report results of a comprehensive numerical study—on a test bed informed by capacity and pricing data from Amazon Web Services—to demonstrate the impressive performance of our policy.
Suggested Citation
Zihao Qu & Milind Dawande & Ganesh Janakiraman, 2024.
"Technical Note—Cloud Cost Optimization: Model, Bounds, and Asymptotics,"
Operations Research, INFORMS, vol. 72(1), pages 132-150, January.
Handle:
RePEc:inm:oropre:v:72:y:2024:i:1:p:132-150
DOI: 10.1287/opre.2022.0362
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:72:y:2024:i:1:p:132-150. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.