IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v186y2021ics0047259x21000610.html
   My bibliography  Save this article

Sequential estimation of Spearman rank correlation using Hermite series estimators

Author

Listed:
  • Stephanou, Michael
  • Varughese, Melvin

Abstract

In this article we describe a new Hermite series based sequential estimator for the Spearman rank correlation coefficient and provide algorithms applicable in both the stationary and non-stationary settings. To treat the non-stationary setting, we introduce a novel, exponentially weighted estimator for the Spearman rank correlation, which allows the local nonparametric correlation of a bivariate data stream to be tracked. To the best of our knowledge this is the first algorithm to be proposed for estimating a time varying Spearman rank correlation that does not rely on a moving window approach. We explore the practical effectiveness of the Hermite series based estimators through real data and simulation studies demonstrating good practical performance. The simulation studies in particular reveal competitive performance compared to an existing algorithm. The potential applications of this work are manifold. The Hermite series based Spearman rank correlation estimator can be applied to fast and robust online calculation of correlation which may vary over time. Possible machine learning applications include, amongst others, fast feature selection and hierarchical clustering on massive data sets.

Suggested Citation

  • Stephanou, Michael & Varughese, Melvin, 2021. "Sequential estimation of Spearman rank correlation using Hermite series estimators," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
  • Handle: RePEc:eee:jmvana:v:186:y:2021:i:c:s0047259x21000610
    DOI: 10.1016/j.jmva.2021.104783
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X21000610
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2021.104783?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Michael Stephanou & Melvin Varughese, 2021. "On the properties of hermite series based distribution function estimators," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(4), pages 535-559, May.
    2. Christophe Croux & Catherine Dehon, 2010. "Influence functions of the Spearman and Kendall correlation measures," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 19(4), pages 497-515, November.
    3. Greblicki, Wlodzimierz & Pawlak, Miroslaw, 1985. "Pointwise consistency of the hermite series density estimate," Statistics & Probability Letters, Elsevier, vol. 3(2), pages 65-69, April.
    4. Greblicki, W?odzimierz & Pawlak, Miros?aw, 1984. "Hermite series estimates of a probability density and its derivatives," Journal of Multivariate Analysis, Elsevier, vol. 15(2), pages 174-182, October.
    5. E. Liebscher, 1990. "Hermite series estimators for probability densities," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 37(1), pages 321-343, December.
    6. Asma Jmaei & Yousri Slaoui & Wassima Dellagi, 2017. "Recursive distribution estimator defined by stochastic approximation method using Bernstein polynomials," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(4), pages 792-805, October.
    7. Philippe Pébay & Timothy B. Terriberry & Hemanth Kolla & Janine Bennett, 2016. "Numerically stable, scalable formulas for parallel and online computation of higher-order multivariate central moments with arbitrary weights," Computational Statistics, Springer, vol. 31(4), pages 1305-1325, December.
    8. Aït-Sahalia, Yacine & Fan, Jianqing & Xiu, Dacheng, 2010. "High-Frequency Covariance Estimates With Noisy and Asynchronous Financial Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1504-1517.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael Stephanou & Melvin Varughese, 2021. "On the properties of hermite series based distribution function estimators," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(4), pages 535-559, May.
    2. Foster, Joshua, 2022. "Semi-nonparametric estimation of secret reserve prices in auctions," Economics Letters, Elsevier, vol. 220(C).
    3. repec:cte:wsrepe:es142416 is not listed on IDEAS
    4. Harry-Paul Vander Elst & David Veredas, 2014. "Disentangled Jump-Robust Realized Covariances and Correlations with Non-Synchronous Prices," Working Papers ECARES ECARES 2014-35, ULB -- Universite Libre de Bruxelles.
    5. Katerina Papagiannouli, 2022. "A Lepskiĭ-type stopping rule for the covariance estimation of multi-dimensional Lévy processes," Statistical Inference for Stochastic Processes, Springer, vol. 25(3), pages 505-535, October.
    6. Altmeyer, Randolf & Bibinger, Markus, 2015. "Functional stable limit theorems for quasi-efficient spectral covolatility estimators," Stochastic Processes and their Applications, Elsevier, vol. 125(12), pages 4556-4600.
    7. Peter Reinhard Hansen & Guillaume Horel & Asger Lunde & Ilya Archakov, 2015. "A Markov Chain Estimator of Multivariate Volatility from High Frequency Data," CREATES Research Papers 2015-19, Department of Economics and Business Economics, Aarhus University.
    8. Liao, Yin & Anderson, Heather M., 2019. "Testing for cojumps in high-frequency financial data: An approach based on first-high-low-last prices," Journal of Banking & Finance, Elsevier, vol. 99(C), pages 252-274.
    9. Markus Bibinger & Nikolaus Hautsch & Peter Malec & Markus Reiss, 2019. "Estimating the Spot Covariation of Asset Prices—Statistical Theory and Empirical Evidence," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 37(3), pages 419-435, July.
    10. Li, Degui, 2024. "Estimation of Large Dynamic Covariance Matrices: A Selective Review," Econometrics and Statistics, Elsevier, vol. 29(C), pages 16-30.
    11. Fulvio Corsi & Stefano Peluso & Francesco Audrino, 2015. "Missing in Asynchronicity: A Kalman‐em Approach for Multivariate Realized Covariance Estimation," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(3), pages 377-397, April.
    12. Yiqi Liu & Qiang Liu & Zhi Liu & Deng Ding, 2017. "Determining the integrated volatility via limit order books with multiple records," Quantitative Finance, Taylor & Francis Journals, vol. 17(11), pages 1697-1714, November.
    13. Haugom, Erik & Lien, Gudbrand & Veka, Steinar & Westgaard, Sjur, 2014. "Covariance estimation using high-frequency data: Sensitivities of estimation methods," Economic Modelling, Elsevier, vol. 43(C), pages 416-425.
    14. Michael Pfarrhofer, 2024. "Forecasts with Bayesian vector autoregressions under real time conditions," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(3), pages 771-801, April.
    15. Bollerslev, Tim & Patton, Andrew J. & Quaedvlieg, Rogier, 2018. "Modeling and forecasting (un)reliable realized covariances for more reliable financial decisions," Journal of Econometrics, Elsevier, vol. 207(1), pages 71-91.
    16. Alvarez, Agustín & Boente, Graciela & Kudraszow, Nadia, 2019. "Robust sieve estimators for functional canonical correlation analysis," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 46-62.
    17. Ruijun Bu & Degui Li & Oliver Linton & Hanchao Wang, 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Working Papers 202212, University of Liverpool, Department of Economics.
    18. Clinet, Simon & Potiron, Yoann, 2018. "Efficient asymptotic variance reduction when estimating volatility in high frequency data," Journal of Econometrics, Elsevier, vol. 206(1), pages 103-142.
    19. Andrzej Chmielowiec, 2021. "Algorithm for error-free determination of the variance of all contiguous subsequences and fixed-length contiguous subsequences for a sequence of industrial measurement data," Computational Statistics, Springer, vol. 36(4), pages 2813-2840, December.
    20. Ariane Hanebeck & Bernhard Klar, 2021. "Smooth distribution function estimation for lifetime distributions using Szasz–Mirakyan operators," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(6), pages 1229-1247, December.
    21. Xinyu Song, 2019. "Large Volatility Matrix Prediction with High-Frequency Data," Papers 1907.01196, arXiv.org, revised Sep 2019.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:186:y:2021:i:c:s0047259x21000610. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.