IDEAS home Printed from https://ideas.repec.org/a/taf/japsta/v43y2016i8p1419-1435.html
   My bibliography  Save this article

Preprocessing of centred logratio transformed density functions using smoothing splines

Author

Listed:
  • J. Machalová
  • K. Hron
  • G.S. Monti

Abstract

With large-scale database systems, statistical analysis of data, occurring in the form of probability distributions, becomes an important task in explorative data analysis. Nevertheless, due to specific properties of density functions, their proper statistical treatment of these data still represents a challenging task in functional data analysis. Namely, the usual metric does not fully accounts for the relative character of information, carried by density functions; instead, their geometrical features are captured by Bayes spaces of measures. The easiest possibility of expressing density functions in an space is to use centred logratio transformation, even though this results in functional data with a constant integral constraint that needs to be taken into account in further analysis. While theoretical background for reasonable analysis of density functions is already provided comprehensively by Bayes spaces themselves, preprocessing issues still need to be developed. The aim of this paper is to introduce optimal smoothing splines for centred logratio transformed density functions that take all their specific features into account and provide a concise methodology for reasonable preprocessing of raw (discretized) distributional observations. Theoretical developments are illustrated with a real-world data set from official statistics and with a simulation study.

Suggested Citation

  • J. Machalová & K. Hron & G.S. Monti, 2016. "Preprocessing of centred logratio transformed density functions using smoothing splines," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(8), pages 1419-1435, June.
  • Handle: RePEc:taf:japsta:v:43:y:2016:i:8:p:1419-1435
    DOI: 10.1080/02664763.2015.1103706
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/02664763.2015.1103706
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/02664763.2015.1103706?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Delicado, P., 2011. "Dimensionality reduction when data are density functions," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 401-420, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dominique Guegan & Matteo Iacopini, 2018. "Nonparametric forecasting of multivariate probability density functions," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01821815, HAL.
    2. Karel Hron & Jitka Machalová & Alessandra Menafoglio, 2023. "Bivariate densities in Bayes spaces: orthogonal decomposition and spline representation," Statistical Papers, Springer, vol. 64(5), pages 1629-1667, October.
    3. Jitka Machalová & Renáta Talská & Karel Hron & Aleš Gába, 2021. "Compositional splines for representation of density functions," Computational Statistics, Springer, vol. 36(2), pages 1031-1064, June.
    4. Dominique Guégan & Matteo Iacopini, 2018. "Nonparameteric forecasting of multivariate probability density functions," Documents de travail du Centre d'Economie de la Sorbonne 18012, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    5. Thomas-Agnan, Christine & Simioni, Michel & Trinh, Thi-Huong, 2023. "Discrete and Smooth Scalar-on-Density Compositional Regression for Assessing the Impact of Climate Change on Rice Yield in Vietnam," TSE Working Papers 23-1410, Toulouse School of Economics (TSE), revised Apr 2024.
    6. Matteo Iacopini & Dominique Guégan, 2018. "Nonparametric Forecasting of Multivariate Probability Density Functions," Working Papers 2018:15, Department of Economics, University of Venice "Ca' Foscari".
    7. Thomas-Agnan, Christine & Mondon, Camille & Trinh, Thi-Huong & Ruiz-Gazen, Anne, 2024. "ICS for complex data with application to outlier detection for density data objects," TSE Working Papers 24_1585, Toulouse School of Economics (TSE).
    8. Dominique Guegan & Matteo Iacopini, 2018. "Nonparametric forecasting of multivariate probability density functions," Post-Print halshs-01821815, HAL.
    9. Talská, R. & Menafoglio, A. & Machalová, J. & Hron, K. & Fišerová, E., 2018. "Compositional regression with functional response," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 66-85.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tadao Hoshino, 2024. "Functional Spatial Autoregressive Models," Papers 2402.14763, arXiv.org, revised Oct 2024.
    2. Berrendero, José R. & Cuevas, Antonio & Pateiro-López, Beatriz, 2016. "Shape classification based on interpoint distance distributions," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 237-247.
    3. Hron, K. & Menafoglio, A. & Templ, M. & Hrůzová, K. & Filzmoser, P., 2016. "Simplicial principal component analysis for density functions in Bayes spaces," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 330-350.
    4. Karel Hron & Jitka Machalová & Alessandra Menafoglio, 2023. "Bivariate densities in Bayes spaces: orthogonal decomposition and spline representation," Statistical Papers, Springer, vol. 64(5), pages 1629-1667, October.
    5. Martínez-Camblor, Pablo & Corral, Norberto, 2011. "Repeated measures analysis for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3244-3256, December.
    6. ARATA Yoshiyuki, 2017. "A Functional Linear Regression Model in the Space of Probability Density Functions," Discussion papers 17015, Research Institute of Economy, Trade and Industry (RIETI).
    7. Seo, Won-Ki & Beare, Brendan K., 2019. "Cointegrated linear processes in Bayes Hilbert space," Statistics & Probability Letters, Elsevier, vol. 147(C), pages 90-95.
    8. Petersen, Alexander & Zhang, Chao & Kokoszka, Piotr, 2022. "Modeling Probability Density Functions as Data Objects," Econometrics and Statistics, Elsevier, vol. 21(C), pages 159-178.
    9. Kokoszka, Piotr & Miao, Hong & Petersen, Alexander & Shang, Han Lin, 2019. "Forecasting of density functions with an application to cross-sectional and intraday returns," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1304-1317.
    10. Epifanio, Irene & Ventura-Campos, Noelia, 2011. "Functional data analysis in shape analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2758-2773, September.
    11. Talská, R. & Menafoglio, A. & Machalová, J. & Hron, K. & Fišerová, E., 2018. "Compositional regression with functional response," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 66-85.
    12. Delicado, Pedro & Vieu, Philippe, 2015. "Optimal level sets for bivariate density representation," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 1-18.
    13. Menafoglio, Alessandra & Petris, Giovanni, 2016. "Kriging for Hilbert-space valued random fields: The operatorial point of view," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 84-94.
    14. Berrendero, J.R. & Justel, A. & Svarc, M., 2011. "Principal components for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2619-2634, September.
    15. Jitka Machalová & Renáta Talská & Karel Hron & Aleš Gába, 2021. "Compositional splines for representation of density functions," Computational Statistics, Springer, vol. 36(2), pages 1031-1064, June.
    16. Won-Ki Seo, 2020. "Functional Principal Component Analysis for Cointegrated Functional Time Series," Papers 2011.12781, arXiv.org, revised Apr 2023.
    17. Zhang, Zhen & Müller, Hans-Georg, 2011. "Functional density synchronization," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2234-2249, July.
    18. Bongiorno, Enea G. & Goia, Aldo, 2019. "Describing the concentration of income populations by functional principal component analysis on Lorenz curves," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 10-24.
    19. Angela Montanari & Daniela Calò, 2013. "Model-based clustering of probability density functions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 301-319, September.
    20. S. Barahona & P. Centella & X. Gual-Arnau & M. V. Ibáñez & A. Simó, 2020. "Supervised classification of geometrical objects by integrating currents and functional data analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 637-660, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:japsta:v:43:y:2016:i:8:p:1419-1435. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/CJAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.