IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0248896.html
   My bibliography  Save this article

Adaptive dimensionality reduction for neural network-based online principal component analysis

Author

Listed:
  • Nico Migenda
  • Ralf Möller
  • Wolfram Schenck

Abstract

“Principal Component Analysis” (PCA) is an established linear technique for dimensionality reduction. It performs an orthonormal transformation to replace possibly correlated variables with a smaller set of linearly independent variables, the so-called principal components, which capture a large portion of the data variance. The problem of finding the optimal number of principal components has been widely studied for offline PCA. However, when working with streaming data, the optimal number changes continuously. This requires to update both the principal components and the dimensionality in every timestep. While the continuous update of the principal components is widely studied, the available algorithms for dimensionality adjustment are limited to an increment of one in neural network-based and incremental PCA. Therefore, existing approaches cannot account for abrupt changes in the presented data. The contribution of this work is to enable in neural network-based PCA the continuous dimensionality adjustment by an arbitrary number without the necessity to learn all principal components. A novel algorithm is presented that utilizes several PCA characteristics to adaptivly update the optimal number of principal components for neural network-based PCA. A precise estimation of the required dimensionality reduces the computational effort while ensuring that the desired amount of variance is kept. The computational complexity of the proposed algorithm is investigated and it is benchmarked in an experimental study against other neural network-based and incremental PCA approaches where it produces highly competitive results.

Suggested Citation

  • Nico Migenda & Ralf Möller & Wolfram Schenck, 2021. "Adaptive dimensionality reduction for neural network-based online principal component analysis," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-32, March.
  • Handle: RePEc:plo:pone00:0248896
    DOI: 10.1371/journal.pone.0248896
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0248896
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0248896&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0248896?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Louis Guttman, 1954. "Some necessary conditions for common-factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 19(2), pages 149-161, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Christopher K. Wikle & Abhirup Datta & Bhava Vyasa Hari & Edward L. Boone & Indranil Sahoo & Indulekha Kavila & Stefano Castruccio & Susan J. Simmons & Wesley S. Burr & Won Chang, 2023. "An illustration of model agnostic explainability methods applied to environmental data," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. GUO-FITOUSSI, Liang, 2013. "A Comparison of the Finite Sample Properties of Selection Rules of Factor Numbers in Large Datasets," MPRA Paper 50005, University Library of Munich, Germany.
    2. Louis Guttman, 1955. "A generalized simplex for factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 20(3), pages 173-192, September.
    3. Anos-Casero, Paloma & Udomsaph, Charles, 2009. "What drives firm productivity growth ?," Policy Research Working Paper Series 4841, The World Bank.
    4. Gerhard Derflinger, 1984. "A loss function for alpha factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 49(3), pages 325-330, September.
    5. Taiki Yoshida & Yohei Otaka & Shin Kitamura & Kazuki Ushizawa & Masashi Kumagai & Yuto Kurihara & Jun Yaeda & Rieko Osu, 2022. "Development and validation of new evaluation scale for measuring stroke patients’ motivation for rehabilitation in rehabilitation wards," PLOS ONE, Public Library of Science, vol. 17(3), pages 1-17, March.
    6. Kate Elizabeth Gannon, Mike Hulme, 2017. "Geoengineering at the ‘edge of the world’: exploring perceptions of ocean fertilization through the Haida Salmon Restoration Corporation," GRI Working Papers 280, Grantham Research Institute on Climate Change and the Environment.
    7. Trucíos, Carlos & Mazzeu, João H.G. & Hotta, Luiz K. & Valls Pereira, Pedro L. & Hallin, Marc, 2021. "Robustness and the general dynamic factor model with infinite-dimensional space: Identification, estimation, and forecasting," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1520-1534.
    8. Yoo, Sun-Young & Vonk, M. Elizabeth, 2012. "The development and initial validation of the Immigrant Parental Stress Inventory (IPSI) in a sample of Korean immigrant parents," Children and Youth Services Review, Elsevier, vol. 34(5), pages 989-998.
    9. Oscar Claveria & Enric Monte & Salvador Torra, 2017. "A new approach for the quantification of qualitative measures of economic expectations," Quality & Quantity: International Journal of Methodology, Springer, vol. 51(6), pages 2685-2706, November.
    10. Kevin Handtke & Lisa Richter-Beuschel & Susanne Bögeholz, 2022. "Self-Efficacy Beliefs of Teaching ESD: A Theory-Driven Instrument and the Effectiveness of ESD in German Teacher Education," Sustainability, MDPI, vol. 14(11), pages 1-32, May.
    11. Horstmann, Felix, 2017. "Measuring the shopper's attitude toward the point of sale display: Scale development and validation," Journal of Retailing and Consumer Services, Elsevier, vol. 36(C), pages 112-123.
    12. Jaung, Wanggi & Putzel, Louis & Bull, Gary Q. & Kozak, Robert & Markum,, 2016. "Certification of forest watershed services: A Q methodology analysis of opportunities and challenges in Lombok, Indonesia," Ecosystem Services, Elsevier, vol. 22(PA), pages 51-59.
    13. Chun, JongSerl & Kim, Jinyung & Lee, Serim, 2023. "Development of a cyberbullying victimization scale for adolescents in South Korea," Children and Youth Services Review, Elsevier, vol. 144(C).
    14. Marconi, Gabriele, 2014. "European higher education policies and the problem of estimating a complex model with a small cross-section," MPRA Paper 87600, University Library of Munich, Germany.
    15. Philip DuBois, 1960. "An analysis of Guttman's simplex," Psychometrika, Springer;The Psychometric Society, vol. 25(2), pages 173-182, June.
    16. Trucíos, Carlos & Hotta, Luiz K. & Valls Pereira, Pedro L., 2019. "On the robustness of the principal volatility components," Journal of Empirical Finance, Elsevier, vol. 52(C), pages 201-219.
    17. Rue, Lisa A. & Estrada, Samantha & Floren, Michael & MacKinnon, Krista, 2016. "Formative evaluation: Developing measures for online family mental health recovery education," Evaluation and Program Planning, Elsevier, vol. 55(C), pages 27-34.
    18. Ana Calderon & Knut Arne Hooper Storeide & Cecilie Elvejord & Helene Amundsen Nissen-Lie & Randi Ulberg & Hanne-Sofie Johnsen Dahl, 2022. "Examining Psychotherapeutic Processes with Depressed Adolescents: A Comparative Study of Two Psychodynamic Therapies," IJERPH, MDPI, vol. 19(24), pages 1-20, December.
    19. Ard H. J. den Reijer & Pieter W. Otter & Jan P. A. M. Jacobs, 2024. "An heuristic scree plot criterion for the number of factors," Statistical Papers, Springer, vol. 65(6), pages 3991-4000, August.
    20. Meisam Moghimbeygi & Anahita Nodehi, 2022. "Multinomial Principal Component Logistic Regression on Shape Data," Journal of Classification, Springer;The Classification Society, vol. 39(3), pages 578-599, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0248896. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.