IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v193y2020ics0951832019300729.html
   My bibliography  Save this article

Quantitative approaches for optimization of user experience based on network resilience for wireless service provider networks

Author

Listed:
  • Kakadia, Deepak
  • Ramirez-Marquez, Dr. Jose Emmanuel

Abstract

Since the 1980′s and in particular 1996, telecom operators and recently mobile operators have been facing increasingly fierce competition, combined with flat subscriber growth and increased data usage resulting in tremendous downward pressures on profitability, forcing operators to differentiate themselves by trying to offer network services with better customer experience at lower operational costs. Wireless operators are challenged with measuring user experience which in itself is subjective, in a manner that accurately reflects the functional and emotional aspects of perceived quality and linking to Network Resiliency which characterizes the network behavior as it responds to disruptions. Current network faults and alarms only consider device failures and do not consider actual impact to user experience. For instance a failed router may not impact the users experience due to built in redundancies in the network. Studies to date, have proposed methods and models that focus on specific aspects of user experience in wired and cellular networks. However, to the best of our knowledge, there is currently very little research that connects linking poor user network experience to root cause. Previous recent work in this area focus on identifying what and where measurements to gage subscriber OoE, modeling and high level concepts, but do not address realistic challenges and approaches that can be automated to materially impact improved customer experiences at lower operational expenses. There is a gap on how operators can automatically associate poor user experience, relevant network metrics and root causes with a suitable model that can be analyzed and optimized. We propose a general framework for a solution that links these entities together, with a quantified approach to optimize user network experience by optimizing network resilience using a model that can be analyzed and optimized using machine learning methods to improve resilience and hence user experience. Results of directly applying existing machine learning algorithms for identifying root causes to network telemetry data have proven to be ineffective in practice due to the fact that existing machine learning algorithms are designed for prediction, classification and ranking not for identifying causal relationships and further complicated by the fact that these algorithms have assumptions on the data and in reality the network data distributions vary wildly during network disturbances. The proposed general framework combines existing methods for anomaly detection and machine learning algorithms, however the novel contribution centers on improving the accuracy of finding associated root causes by dynamically selecting the optimal machine learning algorithm based on the network telemetry data features that are recomputed before, during and after network disturbances. The proposed approach then allows us to automate the time consuming manual tasks of network engineers that proactively monitor key performance metrics for anomalies, correlate with other data sources to ultimately determine actionable insights to maintain a certain acceptable level of user experience by dynamically selecting the appropriate machine learning algorithm for the given data characteristics or features. We describe an example case study specific to wireless provider environment, illustrating the potential viability with results from actual wireless(approx 8 million monthly subscribers) operations data showing promising results by applying the proposed approach. The prototype implementation was able to programmatically detect anomalies, identify potential root causes using different algorithms suitable for the given data and time frame, which dramatically increased the accuracy and efficiency of the small network engineering team, and hence improved the user experience by improving network resiliency.

Suggested Citation

  • Kakadia, Deepak & Ramirez-Marquez, Dr. Jose Emmanuel, 2020. "Quantitative approaches for optimization of user experience based on network resilience for wireless service provider networks," Reliability Engineering and System Safety, Elsevier, vol. 193(C).
  • Handle: RePEc:eee:reensy:v:193:y:2020:i:c:s0951832019300729
    DOI: 10.1016/j.ress.2019.106606
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832019300729
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2019.106606?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ramirez-Marquez, Jose E. & Rocco, Claudio M. & Gebre, Bethel A. & Coit, David W. & Tortorella, Michael, 2006. "New insights on multi-state component criticality and importance," Reliability Engineering and System Safety, Elsevier, vol. 91(8), pages 894-904.
    2. Ramírez-Márquez, José E. & Jiang, Wei, 2006. "Confidence bounds for the reliability of binary capacitated two-terminal networks," Reliability Engineering and System Safety, Elsevier, vol. 91(8), pages 905-914.
    3. Sols, Alberto & Ramírez-Márquez, José E. & Verma, Dinesh & Vitoriano, Begoña, 2007. "Evaluation of full and degraded mission reliability and mission dependability for intermittently operated, multi-functional systems," Reliability Engineering and System Safety, Elsevier, vol. 92(9), pages 1274-1280.
    4. Hosseini, Seyedmohsen & Barker, Kash & Ramirez-Marquez, Jose E., 2016. "A review of definitions and measures of system resilience," Reliability Engineering and System Safety, Elsevier, vol. 145(C), pages 47-61.
    5. Ramirez-Marquez, Jose Emmanuel & Coit, David W., 2007. "Multi-state component criticality analysis for reliability improvement in multi-state systems," Reliability Engineering and System Safety, Elsevier, vol. 92(12), pages 1608-1619.
    6. Henry, Devanandham & Emmanuel Ramirez-Marquez, Jose, 2012. "Generic metrics and quantitative approaches for system resilience as a function of time," Reliability Engineering and System Safety, Elsevier, vol. 99(C), pages 114-122.
    7. Eija Kaasinen & Virpi Roto & Kristin Roloff & Kaisa Väänänen-Vainio-Mattila & Teija Vainio & Wolfgang Maehr & Dhaval Joshi & Sujan Shrestha, 2009. "User Experience of Mobile Internet: Analysis and Recommendations," International Journal of Mobile Human Computer Interaction (IJMHCI), IGI Global, vol. 1(4), pages 4-23, October.
    8. Barker, Kash & Ramirez-Marquez, Jose Emmanuel & Rocco, Claudio M., 2013. "Resilience-based network component importance measures," Reliability Engineering and System Safety, Elsevier, vol. 117(C), pages 89-97.
    9. Ramirez-Marquez, Jose E. & Coit, David W., 2007. "Optimization of system reliability in the presence of common cause failures," Reliability Engineering and System Safety, Elsevier, vol. 92(10), pages 1421-1434.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yeh, Wei-Chang, 2021. "A quick BAT for evaluating the reliability of binary-state networks," Reliability Engineering and System Safety, Elsevier, vol. 216(C).
    2. Yeh, Wei-Chang, 2023. "Novel recursive inclusion-exclusion technology based on BAT and MPs for heterogeneous-arc binary-state network reliability problems," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    3. Yeh, Wei-Chang, 2021. "Novel binary-addition tree algorithm (BAT) for binary-state network reliability problem," Reliability Engineering and System Safety, Elsevier, vol. 208(C).
    4. Yeh, Wei-Chang, 2023. "QB-II for evaluating the reliability of binary-state networks," Reliability Engineering and System Safety, Elsevier, vol. 230(C).
    5. Yeh, Wei-Chang & Tan, Shi-Yi & Forghani-elahabad, Majid & Khadiri, Mohamed El & Jiang, Yunzhi & Lin, Chen-Shiun, 2022. "New binary-addition tree algorithm for the all-multiterminal binary-state network reliability problem," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
    6. Yeh, Wei-Chang & Du, Chia-Ming & Tan, Shi-Yi & Forghani-elahabad, Majid, 2023. "Application of LSTM based on the BAT-MCS for binary-state network approximated time-dependent reliability problems," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
    7. Yeh, Wei-Chang, 2022. "Novel direct algorithm for computing simultaneous all-level reliability of multistate flow networks," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
    8. Yeh, Wei-Chang & Tan, Shi-Yi & Zhu, Wenbo & Huang, Chia-Ling & Yang, Guang-yi, 2022. "Novel binary addition tree algorithm (BAT) for calculating the direct lower-bound of the highly reliable binary-state network reliability," Reliability Engineering and System Safety, Elsevier, vol. 223(C).
    9. Yeh, Wei-Chang, 2022. "Novel self-adaptive Monte Carlo simulation based on binary-addition-tree algorithm for binary-state network reliability approximation," Reliability Engineering and System Safety, Elsevier, vol. 228(C).
    10. Mohajer, Amin & Bavaghar, Maryam & Farrokhi, Hamid, 2020. "Mobility-aware load Balancing for Reliable Self-Organization Networks: Multi-agent Deep Reinforcement Learning," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    11. Yeh, Wei-Chang, 2021. "Novel Algorithm for Computing All-Pairs Homogeneity-Arc Binary-State Undirected Network Reliability," Reliability Engineering and System Safety, Elsevier, vol. 216(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baroud, Hiba & Barker, Kash, 2018. "A Bayesian kernel approach to modeling resilience-based network component importance," Reliability Engineering and System Safety, Elsevier, vol. 170(C), pages 10-19.
    2. Uday, Payuna & Chandrahasa, Rakshit & Marais, Karen, 2019. "System Importance Measures: Definitions and Application to System-of-Systems Analysis," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
    3. Trucco, Paolo & Petrenj, Boris, 2023. "Characterisation of resilience metrics in full-scale applications to interdependent infrastructure systems," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
    4. Baroud, Hiba & Barker, Kash & Ramirez-Marquez, Jose E. & Rocco S., Claudio M., 2014. "Importance measures for inland waterway network resilience," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 62(C), pages 55-67.
    5. Poulin, Craig & Kane, Michael B., 2021. "Infrastructure resilience curves: Performance measures and summary metrics," Reliability Engineering and System Safety, Elsevier, vol. 216(C).
    6. Zhao, S. & Liu, X. & Zhuo, Y., 2017. "Hybrid Hidden Markov Models for resilience metrics in a dynamic infrastructure system," Reliability Engineering and System Safety, Elsevier, vol. 164(C), pages 84-97.
    7. Claudio M Rocco & Kash Barker & Jose Moronta & Jose E Ramirez-Marquez, 2018. "Community detection and resilience in multi-source, multi-terminal networks," Journal of Risk and Reliability, , vol. 232(6), pages 616-626, December.
    8. Ramirez-Marquez, Jose E. & Rocco, Claudio M. & Barker, Kash & Moronta, Jose, 2018. "Quantifying the resilience of community structures in networks," Reliability Engineering and System Safety, Elsevier, vol. 169(C), pages 466-474.
    9. Gonçalves, L.A.P.J. & Ribeiro, P.J.G., 2020. "Resilience of urban transportation systems. Concept, characteristics, and methods," Journal of Transport Geography, Elsevier, vol. 85(C).
    10. MacKenzie, Cameron A. & Hu, Chao, 2019. "Decision making under uncertainty for design of resilient engineered systems," Reliability Engineering and System Safety, Elsevier, vol. 192(C).
    11. Almoghathawi, Yasser & Barker, Kash & Albert, Laura A., 2019. "Resilience-driven restoration model for interdependent infrastructure networks," Reliability Engineering and System Safety, Elsevier, vol. 185(C), pages 12-23.
    12. Zhang, Chao & Xu, Xin & Dui, Hongyan, 2020. "Resilience Measure of Network Systems by Node and Edge Indicators," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    13. Xu, Zhaoping & Ramirez-Marquez, Jose Emmanuel & Liu, Yu & Xiahou, Tangfan, 2020. "A new resilience-based component importance measure for multi-state networks," Reliability Engineering and System Safety, Elsevier, vol. 193(C).
    14. David C. Lane & Birgit Kopainsky & Hugo Herrera, 2017. "From Metaphor to Practice: Operationalizing the Analysis of Resilience Using System Dynamics Modelling," Systems Research and Behavioral Science, Wiley Blackwell, vol. 34(4), pages 444-462, July.
    15. Patriarca, Riccardo & De Paolis, Alessandro & Costantino, Francesco & Di Gravio, Giulio, 2021. "Simulation model for simple yet robust resilience assessment metrics for engineered systems," Reliability Engineering and System Safety, Elsevier, vol. 209(C).
    16. Das, Laya & Munikoti, Sai & Natarajan, Balasubramaniam & Srinivasan, Babji, 2020. "Measuring smart grid resilience: Methods, challenges and opportunities," Renewable and Sustainable Energy Reviews, Elsevier, vol. 130(C).
    17. Hossain, Niamat Ullah Ibne & Jaradat, Raed & Hosseini, Seyedmohsen & Marufuzzaman, Mohammad & Buchanan, Randy K., 2019. "A framework for modeling and assessing system resilience using a Bayesian network: A case study of an interdependent electrical infrastructure system," International Journal of Critical Infrastructure Protection, Elsevier, vol. 25(C), pages 62-83.
    18. Kuttler, Emma & Ghorbani-Renani, Nafiseh & Barker, Kash & González, Andrés D. & Johansson, Jonas, 2024. "Protection-interdiction-restoration for resilient multi-commodity networks," Reliability Engineering and System Safety, Elsevier, vol. 242(C).
    19. Caputo, Antonio C. & Kalemi, Bledar & Paolacci, Fabrizio & Corritore, Daniele, 2020. "Computing resilience of process plants under Na-Tech events: Methodology and application to sesmic loading scenarios," Reliability Engineering and System Safety, Elsevier, vol. 195(C).
    20. Adel Mottahedi & Farhang Sereshki & Mohammad Ataei & Ali Nouri Qarahasanlou & Abbas Barabadi, 2021. "The Resilience of Critical Infrastructure Systems: A Systematic Literature Review," Energies, MDPI, vol. 14(6), pages 1-32, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:193:y:2020:i:c:s0951832019300729. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.