IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0262734.html
   My bibliography  Save this article

The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China

Author

Listed:
  • Daren Zhao
  • Huiwu Zhang
  • Qing Cao
  • Zhiyi Wang
  • Sizhang He
  • Minghua Zhou
  • Ruihua Zhang

Abstract

Background and objective: Tuberculosis (Tuberculosis, TB) is a public health problem in China, which not only endangers the population’s health but also affects economic and social development. It requires an accurate prediction analysis to help to make policymakers with early warning and provide effective precautionary measures. In this study, ARIMA, GM(1,1), and LSTM models were constructed and compared, respectively. The results showed that the LSTM was the optimal model, which can be achieved satisfactory performance for TB cases predictions in mainland China. Methods: The data of tuberculosis cases in mainland China were extracted from the National Health Commission of the People’s Republic of China website. According to the TB data characteristics and the sample requirements, we created the ARIMA, GM(1,1), and LSTM models, which can make predictions for the prevalence trend of TB. The mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) were applied to evaluate the effects of model fitting predicting accuracy. Results: There were 3,021,995 tuberculosis cases in mainland China from January 2018 to December 2020. And the overall TB cases in mainland China take on a downtrend trend. We established ARIMA, GM(1,1), and LSTM models, respectively. The optimal ARIMA model is the ARIMA (0,1,0) × (0,1,0)12. The equation for GM(1,1) model was X(k+1) = -10057053.55e(-0.01k) + 10153178.55 the Mean square deviation ratio C value was 0.49, and the Small probability of error P was 0.94. LSTM model consists of an input layer, a hidden layer and an output layer, the parameters of epochs, learning rating are 60, 0.01, respectively. The MAE, RMSE, and MAPE values of LSTM model were smaller than that of GM(1,1) and ARIMA models. Conclusions: Our findings showed that the LSTM model was the optimal model, which has a higher accuracy performance than that of ARIMA and GM (1,1) models. Its prediction results can act as a predictive tool for TB prevention measures in mainland China.

Suggested Citation

  • Daren Zhao & Huiwu Zhang & Qing Cao & Zhiyi Wang & Sizhang He & Minghua Zhou & Ruihua Zhang, 2022. "The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China," PLOS ONE, Public Library of Science, vol. 17(2), pages 1-18, February.
  • Handle: RePEc:plo:pone00:0262734
    DOI: 10.1371/journal.pone.0262734
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0262734
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0262734&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0262734?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Singh, Sarbjit & Parmar, Kulwinder Singh & Makkhan, Sidhu Jitendra Singh & Kaur, Jatinder & Peshoria, Shruti & Kumar, Jatinder, 2020. "Study of ARIMA and least square support vector machine (LS-SVM) models for the prediction of SARS-CoV-2 confirmed cases in the most affected countries," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    2. Hafiz Shahbaz Munir & Shengbing Ren & Mubashar Mustafa & Chaudry Naeem Siddique & Shazib Qayyum, 2021. "Attention based GRU-LSTM for software defect prediction," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-19, March.
    3. Yi-Chung Hu, 2017. "A genetic-algorithm-based remnant grey prediction model for energy demand forecasting," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-11, October.
    4. Yanhui Guo & Yi Feng & Fuli Qu & Li Zhang & Bingyu Yan & Jingjing Lv, 2020. "Prediction of hepatitis E using machine learning models," PLOS ONE, Public Library of Science, vol. 15(9), pages 1-12, September.
    5. Peng Zhang & Xin Ma & Kun She, 2019. "A novel power-driven fractional accumulated grey model and its application in forecasting wind energy consumption of China," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-33, December.
    6. Yan-Ling Zheng & Li-Ping Zhang & Xue-Liang Zhang & Kai Wang & Yu-Jian Zheng, 2015. "Forecast Model Analysis for the Morbidity of Tuberculosis in Xinjiang, China," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-13, March.
    7. Luo, Xilin & Duan, Huiming & He, Leiyuhang, 2020. "A Novel Riccati Equation Grey Model And Its Application In Forecasting Clean Energy," Energy, Elsevier, vol. 205(C).
    8. Ya-wen Wang & Zhong-zhou Shen & Yu Jiang, 2018. "Comparison of ARIMA and GM(1,1) models for prediction of hepatitis B in China," PLOS ONE, Public Library of Science, vol. 13(9), pages 1-11, September.
    9. Wudi Wei & Junjun Jiang & Hao Liang & Lian Gao & Bingyu Liang & Jiegang Huang & Ning Zang & Yanyan Liao & Jun Yu & Jingzhen Lai & Fengxiang Qin & Jinming Su & Li Ye & Hui Chen, 2016. "Application of a Combined Model with Autoregressive Integrated Moving Average (ARIMA) and Generalized Regression Neural Network (GRNN) in Forecasting Hepatitis Incidence in Heng County, China," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-13, June.
    10. Yu-Wei Lin & Yuqian Zhou & Faraz Faghri & Michael J Shaw & Roy H Campbell, 2019. "Analysis and prediction of unplanned intensive care unit readmission using recurrent neural networks with long short-term memory," PLOS ONE, Public Library of Science, vol. 14(7), pages 1-22, July.
    11. Xiaojun Guo & Sifeng Liu & Lifeng Wu & Lingling Tang, 2014. "Application of a Novel Grey Self-Memory Coupling Model to Forecast the Incidence Rates of Two Notifiable Diseases in China: Dysentery and Gonorrhea," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-17, December.
    12. Singh, Sarbjit & Parmar, Kulwinder Singh & Kumar, Jatinder & Makkhan, Sidhu Jitendra Singh, 2020. "Development of new hybrid model of discrete wavelet decomposition and autoregressive integrated moving average (ARIMA) models in application to one month forecast the casualties cases of COVID-19," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rui Zhang & Hejia Song & Qiulan Chen & Yu Wang & Songwang Wang & Yonghong Li, 2022. "Comparison of ARIMA and LSTM for prediction of hemorrhagic fever at different time scales in China," PLOS ONE, Public Library of Science, vol. 17(1), pages 1-14, January.
    2. Luo, Xilin & Duan, Huiming & Xu, Kai, 2021. "A novel grey model based on traditional Richards model and its application in COVID-19," Chaos, Solitons & Fractals, Elsevier, vol. 142(C).
    3. Gaetano Perone, 2022. "Comparison of ARIMA, ETS, NNAR, TBATS and hybrid models to forecast the second wave of COVID-19 hospitalizations in Italy," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 23(6), pages 917-940, August.
    4. Atif Maqbool Khan & Magdalena Osińska, 2021. "How to Predict Energy Consumption in BRICS Countries?," Energies, MDPI, vol. 14(10), pages 1-21, May.
    5. Naimoli, Antonio, 2022. "Modelling the persistence of Covid-19 positivity rate in Italy," Socio-Economic Planning Sciences, Elsevier, vol. 82(PA).
    6. Méndez-Gordillo, Alma Rosa & Cadenas, Erasmo, 2021. "Wind speed forecasting by the extraction of the multifractal patterns of time series through the multiplicative cascade technique," Chaos, Solitons & Fractals, Elsevier, vol. 143(C).
    7. Jiang, P. & Liu, X., 2016. "Hidden Markov model for municipal waste generation forecasting under uncertainties," European Journal of Operational Research, Elsevier, vol. 250(2), pages 639-651.
    8. You-Shyang Chen & Arun Kumar Sangaiah & Yu-Pei Lin, 2024. "Hyperautomation on fuzzy data dredging on four advanced industrial forecasting models to support sustainable business management," Annals of Operations Research, Springer, vol. 342(1), pages 215-264, November.
    9. Shen, Meng & Li, Xiang & Lu, Yujie & Cui, Qingbin & Wei, Yi-Ming, 2021. "Personality-based normative feedback intervention for energy conservation," Energy Economics, Elsevier, vol. 104(C).
    10. Gaetano Perone, 2020. "An ARIMA model to forecast the spread and the final size of COVID-2019 epidemic in Italy," Health, Econometrics and Data Group (HEDG) Working Papers 20/07, HEDG, c/o Department of Economics, University of York.
    11. Yanbin Li & Zhen Li, 2019. "Forecasting of Coal Demand in China Based on Support Vector Machine Optimized by the Improved Gravitational Search Algorithm," Energies, MDPI, vol. 12(12), pages 1-20, June.
    12. Wudi Wei & Junjun Jiang & Hao Liang & Lian Gao & Bingyu Liang & Jiegang Huang & Ning Zang & Yanyan Liao & Jun Yu & Jingzhen Lai & Fengxiang Qin & Jinming Su & Li Ye & Hui Chen, 2016. "Application of a Combined Model with Autoregressive Integrated Moving Average (ARIMA) and Generalized Regression Neural Network (GRNN) in Forecasting Hepatitis Incidence in Heng County, China," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-13, June.
    13. Indy Man Kit Ho & Anthony Weldon & Jason Tze Ho Yong & Candy Tze Tim Lam & Jaime Sampaio, 2023. "Using Machine Learning Algorithms to Pool Data from Meta-Analysis for the Prediction of Countermovement Jump Improvement," IJERPH, MDPI, vol. 20(10), pages 1-15, May.
    14. Claudiu-Ionuţ Popîrlan & Irina-Valentina Tudor & Constantin-Cristian Dinu & Gabriel Stoian & Cristina Popîrlan & Daniela Dănciulescu, 2021. "Hybrid Model for Unemployment Impact on Social Life," Mathematics, MDPI, vol. 9(18), pages 1-19, September.
    15. Singh, Sarbjit & Parmar, Kulwinder Singh & Makkhan, Sidhu Jitendra Singh & Kaur, Jatinder & Peshoria, Shruti & Kumar, Jatinder, 2020. "Study of ARIMA and least square support vector machine (LS-SVM) models for the prediction of SARS-CoV-2 confirmed cases in the most affected countries," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    16. Changjun Huang & Lv Zhou & Fenliang Liu & Yuanzhi Cao & Zhong Liu & Yun Xue, 2023. "Deformation Prediction of Dam Based on Optimized Grey Verhulst Model," Mathematics, MDPI, vol. 11(7), pages 1-15, April.
    17. Batistela, Cristiane M. & Correa, Diego P.F. & Bueno, Átila M & Piqueira, José Roberto C., 2021. "SIRSi compartmental model for COVID-19 pandemic with immunity loss," Chaos, Solitons & Fractals, Elsevier, vol. 142(C).
    18. Charu Arora & Poras Khetarpal & Saket Gupta & Nuzhat Fatema & Hasmat Malik & Asyraf Afthanorhan, 2023. "Mathematical Modelling to Predict the Effect of Vaccination on Delay and Rise of COVID-19 Cases Management," Mathematics, MDPI, vol. 11(4), pages 1-15, February.
    19. Rasheed, Jawad & Jamil, Akhtar & Hameed, Alaa Ali & Aftab, Usman & Aftab, Javaria & Shah, Syed Attique & Draheim, Dirk, 2020. "A survey on artificial intelligence approaches in supporting frontline workers and decision makers for the COVID-19 pandemic," Chaos, Solitons & Fractals, Elsevier, vol. 141(C).
    20. Dinesh K. Sharma & H. S. Hota & Kate Brown & Richa Handa, 2022. "Integration of genetic algorithm with artificial neural network for stock market forecasting," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(2), pages 828-841, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0262734. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.