IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v32y2020i2p321-345.html
   My bibliography  Save this article

Scalable, Adaptable, and Fast Estimation of Transient Downtime in Virtual Infrastructures Using Convex Decomposition and Sample Path Randomization

Author

Listed:
  • Zhiling Guo

    (School of Information Systems, Singapore Management University, Singapore 178902)

  • Jin Li

    (School of Management, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, China; School of Economics and Management, Xidian University, Xi’an, Shaanxi 710071, China)

  • Ram Ramesh

    (Department of Management Science and Systems, State University of New York, Buffalo, New York 14260)

Abstract

Network function virtualization enables efficient cloud-resource planning by virtualizing network services and applications into software running on commodity servers. A cloud-service provider needs to manage and ensure service availability of a network of concurrent virtualized network functions (VNFs). The downtime distribution of a network of VNFs can be estimated using sample-path randomization on the underlying birth–death process. An integrated modeling approach for this purpose is limited by its scalability and computational load because of the high dimensionality of the integrated birth–death process. We propose a generalized convex decomposition of the integrated birth–death process, which transforms the high-dimensional multi-VNF process into a series of interlinked, low-dimensional, single-VNF processes. We theoretically show the statistical equivalence between the transition probabilities of the integrated birth–death process and those resulting from interlinking the decomposed system of processes. We further develop a decomposition algorithm that yields scalable and fast estimation of the system downtime distribution. Our algorithmic framework can be easily adapted to any logical definition of overall system availability. It can also be easily extended to various realistic VNF network configurations and characteristics including heterogeneous VNF failure distributions, effects of both node and link failures on the overall system downtime of fully or partially connected networks, and resource sharing across multiple VNFs. Our extensive computational results demonstrate the computational efficiency of the proposed algorithms while ensuring statistical consistency with the integrated-network model and the superior performance of the decomposition strategy over the integrated modeling approach.

Suggested Citation

  • Zhiling Guo & Jin Li & Ram Ramesh, 2020. "Scalable, Adaptable, and Fast Estimation of Transient Downtime in Virtual Infrastructures Using Convex Decomposition and Sample Path Randomization," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 321-345, April.
  • Handle: RePEc:inm:orijoc:v:32:y:2020:i:2:p:321-345
    DOI: 10.1287/ijoc.2019.0888
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/ijoc.2019.0888
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2019.0888?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Panos Parpas & Berk Ustun & Mort Webster & Quang Kha Tran, 2015. "Importance Sampling in Stochastic Programming: A Markov Chain Monte Carlo Approach," INFORMS Journal on Computing, INFORMS, vol. 27(2), pages 358-377, May.
    2. Mauro Passacantando & Danilo Ardagna & Anna Savi, 2016. "Service Provisioning Problem in Cloud and Multi-Cloud Systems," INFORMS Journal on Computing, INFORMS, vol. 28(2), pages 265-277, May.
    3. William H. Kaczynski & Lawrence M. Leemis & John H. Drew, 2012. "Transient Queueing Analysis," INFORMS Journal on Computing, INFORMS, vol. 24(1), pages 10-28, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bo Li & Subodha Kumar, 2022. "Managing Software‐as‐a‐Service: Pricing and operations," Production and Operations Management, Production and Operations Management Society, vol. 31(6), pages 2588-2608, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiaotie Chen & David L. Woodruff, 2024. "Distributions and bootstrap for data-based stochastic programming," Computational Management Science, Springer, vol. 21(1), pages 1-21, June.
    2. Flötteröd, G. & Osorio, C., 2017. "Stochastic network link transmission model," Transportation Research Part B: Methodological, Elsevier, vol. 102(C), pages 180-209.
    3. Oscar Dowson & Lea Kapelevich, 2021. "SDDP.jl : A Julia Package for Stochastic Dual Dynamic Programming," INFORMS Journal on Computing, INFORMS, vol. 33(1), pages 27-33, January.
    4. Simon Thevenin & Yossiri Adulyasak & Jean-François Cordeau, 2022. "Stochastic Dual Dynamic Programming for Multiechelon Lot Sizing with Component Substitution," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3151-3169, November.
    5. Wei Zhang & Kai Wang & Alexandre Jacquillat & Shuaian Wang, 2023. "Optimized Scenario Reduction: Solving Large-Scale Stochastic Programs with Quality Guarantees," INFORMS Journal on Computing, INFORMS, vol. 35(4), pages 886-908, July.
    6. Bo Li & Subodha Kumar, 2022. "Managing Software‐as‐a‐Service: Pricing and operations," Production and Operations Management, Production and Operations Management Society, vol. 31(6), pages 2588-2608, June.
    7. Chaithanya Bandi & Nikolaos Trichakis & Phebe Vayanos, 2019. "Robust Multiclass Queuing Theory for Wait Time Estimation in Resource Allocation Systems," Management Science, INFORMS, vol. 65(1), pages 152-187, January.
    8. Narayanan C. Viswanath, 2022. "Transient study of Markov models with time-dependent transition rates," Operational Research, Springer, vol. 22(3), pages 2209-2243, July.
    9. Bart P. G. Van Parys & Peyman Mohajerin Esfahani & Daniel Kuhn, 2021. "From Data to Decisions: Distributionally Robust Optimization Is Optimal," Management Science, INFORMS, vol. 67(6), pages 3387-3402, June.
    10. Hui Dong & Marvin K. Nakayama, 2017. "Quantile Estimation with Latin Hypercube Sampling," Operations Research, INFORMS, vol. 65(6), pages 1678-1695, December.
    11. Manuel A. Nunez & Xue Bai & Linna Du, 2021. "Leveraging Slack Capacity in IaaS Contract Cloud Services," Production and Operations Management, Production and Operations Management Society, vol. 30(4), pages 883-901, April.
    12. Rodica Ioana Lung & Noémi Gaskó & Mihai Alexandru Suciu, 2020. "Pareto-based evolutionary multiobjective approaches and the generalized Nash equilibrium problem," Journal of Heuristics, Springer, vol. 26(4), pages 561-584, August.
    13. Fei, Xin & Gülpınar, Nalân & Branke, Jürgen, 2019. "Efficient solution selection for two-stage stochastic programs," European Journal of Operational Research, Elsevier, vol. 277(3), pages 918-929.
    14. Tahir Ekin & Stephen Walker & Paul Damien, 2023. "Augmented simulation methods for discrete stochastic optimization with recourse," Annals of Operations Research, Springer, vol. 320(2), pages 771-793, January.
    15. Rajib L. Saha & Sumanta Singha & Subodha Kumar, 2021. "Does Congestion Always Hurt? Managing Discount Under Congestion in a Game-Theoretic Setting," Information Systems Research, INFORMS, vol. 32(4), pages 1347-1367, December.
    16. Ekblom, J. & Blomvall, J., 2020. "Importance sampling in stochastic optimization: An application to intertemporal portfolio choice," European Journal of Operational Research, Elsevier, vol. 285(1), pages 106-119.
    17. Franks, Jordan & Vihola, Matti, 2020. "Importance sampling correction versus standard averages of reversible MCMCs in terms of the asymptotic variance," Stochastic Processes and their Applications, Elsevier, vol. 130(10), pages 6157-6183.
    18. Carolina Osorio & Jana Yamani, 2017. "Analytical and Scalable Analysis of Transient Tandem Markovian Finite Capacity Queueing Networks," Transportation Science, INFORMS, vol. 51(3), pages 823-840, August.
    19. Anna Nagurney & Min Yu & Deniz Besik, 2017. "Supply chain network capacity competition with outsourcing: a variational equilibrium framework," Journal of Global Optimization, Springer, vol. 69(1), pages 231-254, September.
    20. Narum, Benjamin S. & Fairbrother, Jamie & Wallace, Stein W., 2024. "Problem-based scenario generation by decomposing output distributions," European Journal of Operational Research, Elsevier, vol. 318(1), pages 154-166.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:32:y:2020:i:2:p:321-345. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.