IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i23p3074-d691113.html
   My bibliography  Save this article

Categorical Functional Data Analysis. The cfda R Package

Author

Listed:
  • Cristian Preda

    (UMR CNRS 8524—Laboratoire Paul Painlevé, University of Lille, 59000 Lille, France
    Institute of Statistics and Applied Mathematics of the Romanian Academy, 050711 Bucharest, Romania
    Inria Lille Nord-Europe, MODAL, 59655 Villeneuve d’Ascq, France)

  • Quentin Grimonprez

    (DiagRAMS Technologies, 59000 Lille, France)

  • Vincent Vandewalle

    (Inria Lille Nord-Europe, MODAL, 59655 Villeneuve d’Ascq, France
    Biostatistics Department, University of Lille, CHU Lille—ULR 2694 METRICS, 59000 Lille, France)

Abstract

Categorical functional data represented by paths of a stochastic jump process with continuous time and a finite set of states are considered. As an extension of the multiple correspondence analysis to an infinite set of variables, optimal encodings of states over time are approximated using an arbitrary finite basis of functions. This allows dimension reduction, optimal representation, and visualisation of data in lower dimensional spaces. The methodology is implemented in the cfda R package and is illustrated using a real data set in the clustering framework.

Suggested Citation

  • Cristian Preda & Quentin Grimonprez & Vincent Vandewalle, 2021. "Categorical Functional Data Analysis. The cfda R Package," Mathematics, MDPI, vol. 9(23), pages 1-31, November.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:23:p:3074-:d:691113
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/23/3074/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/23/3074/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Deville, J. -C. & Saporta, G., 1983. "Correspondence analysis, with an extension towards nominal time series," Journal of Econometrics, Elsevier, vol. 22(1-2), pages 169-189.
    2. Jackson, Christopher, 2011. "Multi-State Models for Panel Data: The msm Package for R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 38(i08).
    3. Scholz, Michael, 2016. "R Package clickstream: Analyzing Clickstream Data with Markov Chains," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i04).
    4. Hervé Cardot & Guillaume Lecuelle & Pascal Schlich & Michel Visalli, 2019. "Estimating finite mixtures of semi‐Markov chains: an application to the segmentation of temporal sensory data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 68(5), pages 1281-1303, November.
    5. Melnykov, Volodymyr, 2016. "ClickClust: An R Package for Model-Based Clustering of Categorical Sequences," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i09).
    6. Byron J. Idrovo-Aguirre & Francisco J. Lozano & Javier E. Contreras-Reyes, 2021. "Prosperity or Real Estate Bubble? Exuberance Probability Index of Real Housing Prices in Chile," IJFS, MDPI, vol. 9(3), pages 1-24, September.
    7. Nath, Ravinder & Pavur, Robert, 1985. "A new statistic in the one-way multivariate analysis of variance," Computational Statistics & Data Analysis, Elsevier, vol. 2(4), pages 297-315, February.
    8. Dai, Wenlin & Mrkvička, Tomáš & Sun, Ying & Genton, Marc G., 2020. "Functional outlier detection and taxonomy by sequential transformations," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qiu, Qinjing & Kawai, Reiichiro, 2022. "A decoupling principle for Markov-modulated chains," Statistics & Probability Letters, Elsevier, vol. 182(C).
    2. Vernon T. Farewell & Li Su & Christopher Jackson, 2019. "Partially hidden multi-state modelling of a prolonged disease state defined by a composite outcome," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 25(4), pages 696-711, October.
    3. Gaffney, Edward & McCann, Fergal, 2019. "The cyclicality in SICR: mortgage modelling under IFRS 9," ESRB Working Paper Series 92, European Systemic Risk Board.
    4. Cindy Frascolla & Guillaume Lecuelle & Pascal Schlich & Hervé Cardot, 2022. "Two sample tests for Semi-Markov processes with parametric sojourn time distributions: an application in sensory analysis," Computational Statistics, Springer, vol. 37(5), pages 2553-2580, November.
    5. Biagini, Francesca & Groll, Andreas & Widenmann, Jan, 2013. "Intensity-based premium evaluation for unemployment insurance products," Insurance: Mathematics and Economics, Elsevier, vol. 53(1), pages 302-316.
    6. Sharples, Linda D., 2018. "The role of statistics in the era of big data: Electronic health records for healthcare research," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 105-110.
    7. Alex Bottle & Chiara Maria Ventura & Kumar Dharmarajan & Paul Aylin & Francesca Ieva & Anna Maria Paganoni, 2018. "Regional variation in hospitalisation and mortality in heart failure: comparison of England and Lombardy using multistate modelling," Health Care Management Science, Springer, vol. 21(2), pages 292-304, June.
    8. Wildhaber, Mark L. & Albers, Janice L. & Green, Nicholas S. & Moran, Edward H., 2017. "A fully-stochasticized, age-structured population model for population viability analysis of fish: Lower Missouri River endangered pallid sturgeon example," Ecological Modelling, Elsevier, vol. 359(C), pages 434-448.
    9. Budhi Surya, 2021. "A new class of conditional Markov jump processes with regime switching and path dependence: properties and maximum likelihood estimation," Papers 2107.07026, arXiv.org.
    10. Tsiropoulos, Vasilis, 2018. "A Vulnerability Analysis for Mortgaged Irish Households," Financial Stability Notes 02-18, Central Bank of Ireland.
    11. João Costa Freitas & Alberto Adrego Pinto & Óscar Felgueiras, 2024. "Game Theory for Predicting Stocks’ Closing Prices," Mathematics, MDPI, vol. 12(17), pages 1-49, August.
    12. Sarrias, Mauricio, 2021. "A two recursive equation model to correct for endogeneity in latent class binary probit models," Journal of choice modelling, Elsevier, vol. 40(C).
    13. Emanuel Krebs & Jeong E. Min & Elizabeth Evans & Libo Li & Lei Liu & David Huang & Darren Urada & Thomas Kerr & Yih-Ing Hser & Bohdan Nosyk, 2017. "Estimating State Transitions for Opioid Use Disorders," Medical Decision Making, , vol. 37(5), pages 483-497, July.
    14. Guglielmo D'Amico & Stefania Scocchera & Loriano Storchi, 2021. "Randentropy: a software to measure inequality in random systems," Papers 2103.09107, arXiv.org.
    15. Wood, Kevin A. & Hilton, Geoff M. & Newth, Julia L. & Rees, Eileen C., 2019. "Seasonal variation in energy gain explains patterns of resource use by avian herbivores in an agricultural landscape: Insights from a mechanistic model," Ecological Modelling, Elsevier, vol. 409(C), pages 1-1.
    16. Sy-Miin Chow & Guangjian Zhang, 2013. "Nonlinear Regime-Switching State-Space (RSSS) Models," Psychometrika, Springer;The Psychometric Society, vol. 78(4), pages 740-768, October.
    17. Moritz Herrmann & Fabian Scheipl, 2021. "A Geometric Perspective on Functional Outlier Detection," Stats, MDPI, vol. 4(4), pages 1-41, November.
    18. Michael Crowther & Paul Lambert, 2016. "Multistate survival analysis in Stata," United Kingdom Stata Users' Group Meetings 2016 02, Stata Users Group.
    19. Pauhofová, Iveta & Želinský, Tomáš, 2017. "On the Regional Convergence of Income at District Level in Slovakia," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 65(10), pages 918-934.
    20. Shanshan Qin & Zhenni Tan & Yuehua Wu, 2024. "On robust estimation of hidden semi-Markov regime-switching models," Annals of Operations Research, Springer, vol. 338(2), pages 1049-1081, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:23:p:3074-:d:691113. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.