An extension of the K-means algorithm to clustering skewed data
Author
Abstract
Suggested Citation
DOI: 10.1007/s00180-018-0821-z
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Sharon Lee & Geoffrey McLachlan, 2013. "Rejoinder to the discussion of “Model-based clustering and classification with non-normal mixture distributions”," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 473-479, November.
- Brock, Guy & Pihur, Vasyl & Datta, Susmita & Datta, Somnath, 2008. "clValid: An R Package for Cluster Validation," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 25(i04).
- Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
- Sharon Lee & Geoffrey McLachlan, 2013. "Model-based clustering and classification with non-normal mixture distributions," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 427-454, November.
- Melnykov, Volodymyr & Shen, Gang, 2013. "Clustering through empirical likelihood ratio," Computational Statistics & Data Analysis, Elsevier, vol. 62(C), pages 1-10.
- Efron B. & Tibshirani R. & Storey J.D. & Tusher V., 2001. "Empirical Bayes Analysis of a Microarray Experiment," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1151-1160, December.
- Melnykov, Volodymyr & Melnykov, Igor, 2012. "Initializing the EM algorithm in Gaussian mixture models with an unknown number of components," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1381-1395.
- Melnykov, Igor & Melnykov, Volodymyr, 2014. "On K-means algorithm with the use of Mahalanobis distances," Statistics & Probability Letters, Elsevier, vol. 84(C), pages 88-95.
- Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
- Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
- Celeux, Gilles & Govaert, Gerard, 1992. "A classification EM algorithm for clustering and two stochastic versions," Computational Statistics & Data Analysis, Elsevier, vol. 14(3), pages 315-332, October.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Peihuang Huang & Pei Yao & Zhendong Hao & Huihong Peng & Longkun Guo, 2021. "Improved Constrained k -Means Algorithm for Clustering with Domain Knowledge," Mathematics, MDPI, vol. 9(19), pages 1-14, September.
- İsmail Güzel & Atabey Kaygun, 2022. "A new non-archimedean metric on persistent homology," Computational Statistics, Springer, vol. 37(4), pages 1963-1983, September.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Zhu, Xuwen & Melnykov, Volodymyr, 2018. "Manly transformation in finite mixture modeling," Computational Statistics & Data Analysis, Elsevier, vol. 121(C), pages 190-208.
- Morris, Katherine & Punzo, Antonio & McNicholas, Paul D. & Browne, Ryan P., 2019. "Asymmetric clusters and outliers: Mixtures of multivariate contaminated shifted asymmetric Laplace distributions," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 145-166.
- Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2017. "Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 141-156.
- Derek S. Young & Xi Chen & Dilrukshi C. Hewage & Ricardo Nilo-Poyanco, 2019. "Finite mixture-of-gamma distributions: estimation, inference, and model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(4), pages 1053-1082, December.
- Sylvia Frühwirth-Schnatter & Gertraud Malsiner-Walli, 2019. "From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 33-64, March.
- Cristina Tortora & Brian C. Franczak & Ryan P. Browne & Paul D. McNicholas, 2019. "A Mixture of Coalesced Generalized Hyperbolic Distributions," Journal of Classification, Springer;The Classification Society, vol. 36(1), pages 26-57, April.
- Melnykov, Volodymyr & Zhu, Xuwen, 2018. "On model-based clustering of skewed matrix data," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 181-194.
- Sharon Lee & Geoffrey McLachlan, 2013. "Model-based clustering and classification with non-normal mixture distributions," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 427-454, November.
- Xuwen Zhu, 2019. "Probability of misclassification in model-based clustering," Computational Statistics, Springer, vol. 34(3), pages 1427-1442, September.
- Barbara Su, 2023. "Banking practices and borrowing firms’ financial reporting quality: evidence from bank cross-selling," Review of Accounting Studies, Springer, vol. 28(1), pages 201-236, March.
- Shaikh, Ibrahim A. & O'Brien, Jonathan Paul & Peters, Lois, 2018. "Inside directors and the underinvestment of financial slack towards R&D-intensity in high-technology firms," Journal of Business Research, Elsevier, vol. 82(C), pages 192-201.
- Mikel Bedayo & Gabriel Jiménez & José-Luis Peydró & Raquel Vegas, 2020.
"Screening and Loan Origination Time: Lending Standards, Loan Defaults and Bank Failures,"
Working Papers
1215, Barcelona School of Economics.
- Mikel Bedayo & Gabriel Jiménez & José-Luis Peydró & Raquel Vegas, 2020. "Screening and loan origination time: lending standards, loan defaults and bank failures," Working Papers 2037, Banco de España.
- Peydró, José-Luis & Jiménez, Gabriel & Bedayo, Mikel & Vegas, Raquel, 2020. "Screening and Loan Origination Time: Lending Standards, Loan Defaults and Bank Failures," CEPR Discussion Papers 15445, C.E.P.R. Discussion Papers.
- Mikel Bedayo & Gabriel Jiménez & José-Luis Peydró & Raquel Vegas, 2020. "Screening and loan origination time: lending standards, loan defaults and bank failures," Economics Working Papers 1749, Department of Economics and Business, Universitat Pompeu Fabra, revised Aug 2022.
- Bedayo, Mikel & Jiménez, Gabriel & Peydró, José Luis & Vegas, Raquel, 2023. "Screening and loan origination time: Lending standards, loan defaults and bank failures," EconStor Preprints 225986, ZBW - Leibniz Information Centre for Economics.
- Ruey-Ching Hwang, 2013. "Forecasting credit ratings with the varying-coefficient model," Quantitative Finance, Taylor & Francis Journals, vol. 13(12), pages 1947-1965, December.
- Antonio Davila & George Foster & Xiaobin He & Carlos Shimizu, 2015. "The rise and fall of startups: Creation and destruction of revenue and jobs by young companies," Australian Journal of Management, Australian School of Business, vol. 40(1), pages 6-35, February.
- Masahiro Enomoto, 2018. "Effects of Corporate Governance on the Relationship between Accounting Quality and Trade Credit: Evidence from Japan," Discussion Paper Series DP2018-12, Research Institute for Economics & Business Administration, Kobe University, revised Dec 2023.
- Chen, Peimin & Wu, Chunchi, 2014. "Default prediction with dynamic sectoral and macroeconomic frailties," Journal of Banking & Finance, Elsevier, vol. 40(C), pages 211-226.
- Knyazeva, Anzhela & Knyazeva, Diana, 2012. "Does being your bank’s neighbor matter?," Journal of Banking & Finance, Elsevier, vol. 36(4), pages 1194-1209.
- Giordani, Paolo & Jacobson, Tor & Schedvin, Erik von & Villani, Mattias, 2014.
"Taking the Twists into Account: Predicting Firm Bankruptcy Risk with Splines of Financial Ratios,"
Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 49(4), pages 1071-1099, August.
- Giordani, Paolo & Jacobson, Tor & von Schedvin , Erik & Villani, Mattias, 2011. "Taking the Twists into Account: Predicting Firm Bankruptcy Risk with Splines of Financial Ratios," Working Paper Series 256, Sveriges Riksbank (Central Bank of Sweden).
- Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
- Suzan Hol, 2006. "The influence of the business cycle on bankruptcy probability," Discussion Papers 466, Statistics Norway, Research Department.
More about this item
Keywords
Exponential transformation; CEM algorithm; Cluster analysis; Skewness;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:34:y:2019:i:1:d:10.1007_s00180-018-0821-z. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.