IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0241686.html
   My bibliography  Save this article

Evaluation of classification and forecasting methods on time series gene expression data

Author

Listed:
  • Nafis Irtiza Tripto
  • Mohimenul Kabir
  • Md Shamsuzzoha Bayzid
  • Atif Rahman

Abstract

Time series gene expression data is widely used to study different dynamic biological processes. Although gene expression datasets share many of the characteristics of time series data from other domains, most of the analyses in this field do not fully leverage the time-ordered nature of the data and focus on clustering the genes based on their expression values. Other domains, such as financial stock and weather prediction, utilize time series data for forecasting purposes. Moreover, many studies have been conducted to classify generic time series data based on trend, seasonality, and other patterns. Therefore, an assessment of these approaches on gene expression data would be of great interest to evaluate their adequacy in this domain. Here, we perform a comprehensive evaluation of different traditional unsupervised and supervised machine learning approaches as well as deep learning based techniques for time series gene expression classification and forecasting on five real datasets. In addition, we propose deep learning based methods for both classification and forecasting, and compare their performances with the state-of-the-art methods. We find that deep learning based methods generally outperform traditional approaches for time series classification. Experiments also suggest that supervised classification on gene expression is more effective than clustering when labels are available. In time series gene expression forecasting, we observe that an autoregressive statistical approach has the best performance for short term forecasting, whereas deep learning based methods are better suited for long term forecasting.

Suggested Citation

  • Nafis Irtiza Tripto & Mohimenul Kabir & Md Shamsuzzoha Bayzid & Atif Rahman, 2020. "Evaluation of classification and forecasting methods on time series gene expression data," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-17, November.
  • Handle: RePEc:plo:pone00:0241686
    DOI: 10.1371/journal.pone.0241686
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0241686
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0241686&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0241686?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sean J. Taylor & Benjamin Letham, 2018. "Forecasting at Scale," The American Statistician, Taylor & Francis Journals, vol. 72(1), pages 37-45, January.
    2. Bryan, Jenny, 2004. "Problems in gene clustering based on gene expression data," Journal of Multivariate Analysis, Elsevier, vol. 90(1), pages 44-66, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jorge-Eusebio Velasco-López & Ramón-Alberto Carrasco & Jesús Serrano-Guerrero & Francisco Chiclana, 2024. "Profiling Social Sentiment in Times of Health Emergencies with Information from Social Networks and Official Statistics," Mathematics, MDPI, vol. 12(6), pages 1-23, March.
    2. Fadaki, Masih & Asadikia, Atie, 2024. "Augmenting Monte Carlo Tree Search for managing service level agreements," International Journal of Production Economics, Elsevier, vol. 271(C).
    3. Michael Vössing & Niklas Kühl & Matteo Lind & Gerhard Satzger, 2022. "Designing Transparency for Effective Human-AI Collaboration," Information Systems Frontiers, Springer, vol. 24(3), pages 877-895, June.
    4. Maghsoodi, Abtin Ijadi, 2023. "Cryptocurrency portfolio allocation using a novel hybrid and predictive big data decision support system," Omega, Elsevier, vol. 115(C).
    5. Miroslav Navratil & Andrea Kolkova, 2019. "Decomposition and Forecasting Time Series in the Business Economy Using Prophet Forecasting Model," Central European Business Review, Prague University of Economics and Business, vol. 2019(4), pages 26-39.
    6. Fadi Kahwash & Basel Barakat & Ahmad Taha & Qammer H. Abbasi & Muhammad Ali Imran, 2021. "Optimising Electrical Power Supply Sustainability Using a Grid-Connected Hybrid Renewable Energy System—An NHS Hospital Case Study," Energies, MDPI, vol. 14(21), pages 1-23, October.
    7. Lorenzo Menculini & Andrea Marini & Massimiliano Proietti & Alberto Garinei & Alessio Bozza & Cecilia Moretti & Marcello Marconi, 2021. "Comparing Prophet and Deep Learning to ARIMA in Forecasting Wholesale Food Prices," Forecasting, MDPI, vol. 3(3), pages 1-19, September.
    8. Romero-Fiances, Irene & Livera, Andreas & Theristis, Marios & Makrides, George & Stein, Joshua S. & Nofuentes, Gustavo & de la Casa, Juan & Georghiou, George E., 2022. "Impact of duration and missing data on the long-term photovoltaic degradation rate estimation," Renewable Energy, Elsevier, vol. 181(C), pages 738-748.
    9. Zhewei Huang & Yawen Yi, 2024. "Short-Term Load Forecasting for Regional Smart Energy Systems Based on Two-Stage Feature Extraction and Hybrid Inverted Transformer," Sustainability, MDPI, vol. 16(17), pages 1-25, September.
    10. Winita Sulandari & Yudho Yudhanto & Sri Subanti & Crisma Devika Setiawan & Riskhia Hapsari & Paulo Canas Rodrigues, 2023. "Comparing the Simple to Complex Automatic Methods with the Ensemble Approach in Forecasting Electrical Time Series Data," Energies, MDPI, vol. 16(22), pages 1-16, November.
    11. Ashish Shrestha & Bishal Ghimire & Francisco Gonzalez-Longatt, 2021. "A Bayesian Model to Forecast the Time Series Kinetic Energy Data for a Power System," Energies, MDPI, vol. 14(11), pages 1-15, June.
    12. Nik Dawson & Sacha Molitorisz & Marian-Andrei Rizoiu & Peter Fray, 2020. "Layoffs, Inequity and COVID-19: A Longitudinal Study of the Journalism Jobs Crisis in Australia from 2012 to 2020," Papers 2008.12459, arXiv.org, revised Feb 2021.
    13. Luyao Zhang & Fan Zhang, 2023. "Understand Waiting Time in Transaction Fee Mechanism: An Interdisciplinary Perspective," Papers 2305.02552, arXiv.org.
    14. Md Jamal Ahmed Shohan & Md Omar Faruque & Simon Y. Foo, 2022. "Forecasting of Electric Load Using a Hybrid LSTM-Neural Prophet Model," Energies, MDPI, vol. 15(6), pages 1-18, March.
    15. Ángel Cuevas & Ramiro Ledo & Enrique M. Quilis, 2021. "Seasonal adjustment of the Spanish sales daily data," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 12(4), pages 687-708, December.
    16. Sprangers, Olivier & Schelter, Sebastian & de Rijke, Maarten, 2023. "Parameter-efficient deep probabilistic forecasting," International Journal of Forecasting, Elsevier, vol. 39(1), pages 332-345.
    17. Aleksandr N. Grekov & Elena V. Vyshkvarkova & Aleksandr S. Mavrin, 2024. "Forecasting and Anomaly Detection in BEWS: Comparative Study of Theta, Croston, and Prophet Algorithms," Forecasting, MDPI, vol. 6(2), pages 1-14, May.
    18. Odin Foldvik Eikeland & Filippo Maria Bianchi & Harry Apostoleris & Morten Hansen & Yu-Cheng Chiou & Matteo Chiesa, 2021. "Predicting Energy Demand in Semi-Remote Arctic Locations," Energies, MDPI, vol. 14(4), pages 1-17, February.
    19. Jayesh Thaker & Robert Höller, 2022. "A Comparative Study of Time Series Forecasting of Solar Energy Based on Irradiance Classification," Energies, MDPI, vol. 15(8), pages 1-26, April.
    20. Srinka Basu & Sugata Sen, 2023. "COVID 19 Pandemic, Socio-Economic Behaviour and Infection Characteristics: An Inter-Country Predictive Study Using Deep Learning," Computational Economics, Springer;Society for Computational Economics, vol. 61(2), pages 645-676, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0241686. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.