IDEAS home Printed from https://ideas.repec.org/p/zbw/irtgdp/2021003.html
   My bibliography  Save this paper

K-expectiles clustering

Author

Listed:
  • Wang, Bingling
  • Li, Yingxing
  • Härdle, Wolfgang

Abstract

K-means clustering is one of the most widely-used partitioning algorithm in cluster analysis due to its simplicity and computational efficiency, but it may not provide ideal clustering results when applying to data with non-spherically shaped clusters. By considering the asymmetrically weighted distance, We propose the K-expectile clustering and search the clusters via a greedy algorithm that minimizes the within cluster τ -variance. We provide algorithms based on two schemes: the fixed τ clustering, and the adaptive τ clustering. Validated by simulation results, our method has enhanced performance on data with asymmetric shaped clusters or clusters with a complicated structure. Applications of our method show that the fixed τ clustering can bring some flexibility on segmentation with a decent accuracy, while the adaptive τ clustering may yield better performance.

Suggested Citation

  • Wang, Bingling & Li, Yingxing & Härdle, Wolfgang, 2021. "K-expectiles clustering," IRTG 1792 Discussion Papers 2021-003, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
  • Handle: RePEc:zbw:irtgdp:2021003
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/233511/1/1755349009.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Kuan, Chung-Ming & Yeh, Jin-Huei & Hsu, Yu-Chin, 2009. "Assessing value at risk with CARE, the Conditional Autoregressive Expectile models," Journal of Econometrics, Elsevier, vol. 150(2), pages 261-270, June.
    2. Johanna F. Ziegel, 2016. "Coherence And Elicitability," Mathematical Finance, Wiley Blackwell, vol. 26(4), pages 901-918, October.
    3. Trimborn, Simon & Härdle, Wolfgang Karl, 2018. "CRIX an Index for cryptocurrencies," Journal of Empirical Finance, Elsevier, vol. 49(C), pages 107-122.
    4. Kim, Joonpyo & Oh, Hee-Seok, 2020. "Pseudo-quantile functional data clustering," Journal of Multivariate Analysis, Elsevier, vol. 178(C).
    5. Zhang, Yingying & Wang, Huixia Judy & Zhu, Zhongyi, 2019. "Quantile-regression-based clustering for panel data," Journal of Econometrics, Elsevier, vol. 213(1), pages 54-67.
    6. Julien Jacques & Cristian Preda, 2014. "Functional data clustering: a survey," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 231-255, September.
    7. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    8. Kim, Alisa & Trimborn, Simon & Härdle, Wolfgang Karl, 2021. "VCRIX — A volatility index for crypto-currencies," International Review of Financial Analysis, Elsevier, vol. 78(C).
    9. Tran, Ngoc M. & Burdejová, Petra & Ospienko, Maria & Härdle, Wolfgang K., 2019. "Principal component analysis in an asymmetric norm," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 1-21.
    10. Yingxing Li & David Ruppert, 2008. "On the asymptotics of penalized splines," Biometrika, Biometrika Trust, vol. 95(2), pages 415-436.
    11. Prates, Marcos Oliveira & Lachos, Victor Hugo & Barbosa Cabral, Celso Rômulo, 2013. "mixsmsn: Fitting Finite Mixture of Scale Mixture of Skew-Normal Distributions," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 54(i12).
    12. Aigner, D J & Amemiya, Takeshi & Poirier, Dale J, 1976. "On the Estimation of Production Frontiers: Maximum Likelihood Estimation of the Parameters of a Discontinuous Density Function," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 17(2), pages 377-396, June.
    13. J. Ramsay, 1982. "When the data are functions," Psychometrika, Springer;The Psychometric Society, vol. 47(4), pages 379-396, December.
    14. Sobotka, Fabian & Kneib, Thomas, 2012. "Geoadditive expectile regression," Computational Statistics & Data Analysis, Elsevier, vol. 56(4), pages 755-767.
    15. Newey, Whitney K & Powell, James L, 1987. "Asymmetric Least Squares Estimation and Testing," Econometrica, Econometric Society, vol. 55(4), pages 819-847, July.
    16. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Konstantin Häusler & Hongyu Xia, 2022. "Indices on cryptocurrencies: an evaluation," Digital Finance, Springer, vol. 4(2), pages 149-167, September.
    2. Konstantin Hausler & Wolfgang Karl Hardle, 2021. "Cryptocurrency Dynamics: Rodeo or Ascot?," Papers 2103.12461, arXiv.org, revised Jan 2022.
    3. Aneiros, Germán & Horová, Ivana & Hušková, Marie & Vieu, Philippe, 2022. "On functional data analysis and related topics," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    4. Häusler, Konstantin & Härdle, Wolfgang, 2021. "Rodeo or ascot: Which hat to wear at the crypto race?," IRTG 1792 Discussion Papers 2021-007, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ibrahim M. Almanjahie & Salim Bouzebda & Zoulikha Kaid & Ali Laksaci, 2024. "The local linear functional kNN estimator of the conditional expectile: uniform consistency in number of neighbors," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 87(8), pages 1007-1035, November.
    2. Mohammedi, Mustapha & Bouzebda, Salim & Laksaci, Ali, 2021. "The consistency and asymptotic normality of the kernel type expectile regression estimator for functional data," Journal of Multivariate Analysis, Elsevier, vol. 181(C).
    3. Kneib, Thomas & Silbersdorff, Alexander & Säfken, Benjamin, 2023. "Rage Against the Mean – A Review of Distributional Regression Approaches," Econometrics and Statistics, Elsevier, vol. 26(C), pages 99-123.
    4. Daouia, Abdelaati & Girard, Stéphane & Stupfler, Gilles, 2021. "ExpectHill estimation, extreme risk and heavy tails," Journal of Econometrics, Elsevier, vol. 221(1), pages 97-117.
    5. Man, Rebeka & Tan, Kean Ming & Wang, Zian & Zhou, Wen-Xin, 2024. "Retire: Robust expectile regression in high dimensions," Journal of Econometrics, Elsevier, vol. 239(2).
    6. repec:hum:wpaper:sfb649dp2016-058 is not listed on IDEAS
    7. Laura Garcia-Jorcano & Lidia Sanchis-Marco, 2023. "Measuring Systemic Risk Using Multivariate Quantile-Located ES Models," Journal of Financial Econometrics, Oxford University Press, vol. 21(1), pages 1-72.
    8. Chao, Shih-Kang & Härdle, Wolfgang Karl & Huang, Chen, 2016. "Multivariate factorisable sparse asymmetric least squares regression," SFB 649 Discussion Papers 2016-058, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    9. Taoufik Bouezmarni & Mohamed Doukali & Abderrahim Taamouti, 2024. "Testing Granger non-causality in expectiles," Econometric Reviews, Taylor & Francis Journals, vol. 43(1), pages 30-51, January.
    10. Otto-Sobotka, Fabian & Salvati, Nicola & Ranalli, Maria Giovanna & Kneib, Thomas, 2019. "Adaptive semiparametric M-quantile regression," Econometrics and Statistics, Elsevier, vol. 11(C), pages 116-129.
    11. Litimein, Ouahiba & Laksaci, Ali & Mechab, Boubaker & Bouzebda, Salim, 2023. "Local linear estimate of the functional expectile regression," Statistics & Probability Letters, Elsevier, vol. 192(C).
    12. Zhang, Feipeng & Xu, Yixiong & Fan, Caiyun, 2023. "Nonparametric inference of expectile-based value-at-risk for financial time series with application to risk assessment," International Review of Financial Analysis, Elsevier, vol. 90(C).
    13. Daouia, Abdelaati & Girard, Stéphane & Stupfler, Gilles, 2018. "Tail expectile process and risk assessment," TSE Working Papers 18-944, Toulouse School of Economics (TSE).
    14. Lina Liao & Cheolwoo Park & Hosik Choi, 2019. "Penalized expectile regression: an alternative to penalized quantile regression," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(2), pages 409-438, April.
    15. Zhang, Feipeng & Li, Qunhua, 2017. "A continuous threshold expectile model," Computational Statistics & Data Analysis, Elsevier, vol. 116(C), pages 49-66.
    16. Daouia, Abdelaati & Padoan, Simone A. & Stupfler, Gilles, 2023. "Extreme expectile estimation for short-tailed data, with an application to market risk assessment," TSE Working Papers 23-1414, Toulouse School of Economics (TSE), revised May 2024.
    17. Garcia-Jorcano, Laura & Sanchis-Marco, Lidia, 2021. "Systemic-systematic risk in financial system: A dynamic ranking based on expectiles," International Review of Economics & Finance, Elsevier, vol. 75(C), pages 330-365.
    18. Jun Zhao & Guan’ao Yan & Yi Zhang, 2022. "Robust estimation and shrinkage in ultrahigh dimensional expectile regression with heavy tails and variance heterogeneity," Statistical Papers, Springer, vol. 63(1), pages 1-28, February.
    19. Aneiros, Germán & Horová, Ivana & Hušková, Marie & Vieu, Philippe, 2022. "On functional data analysis and related topics," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    20. Daouia, Abdelaati & Stupfler, Gilles & Usseglio-Carleve, Antoine, 2023. "An expectile computation cookbook," TSE Working Papers 23-1458, Toulouse School of Economics (TSE).
    21. Girard, Stéphane & Stupfler, Gilles & Usseglio-Carleve, Antoine, 2022. "Functional estimation of extreme conditional expectiles," Econometrics and Statistics, Elsevier, vol. 21(C), pages 131-158.

    More about this item

    Keywords

    clustering; expectiles; asymmetric quadratic loss; image segmentation;
    All these keywords.

    JEL classification:

    • C00 - Mathematical and Quantitative Methods - - General - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:irtgdp:2021003. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/wfhubde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.