IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v108y2012icp41-52.html
   My bibliography  Save this article

On the upper bound of the number of modes of a multivariate normal mixture

Author

Listed:
  • Ray, Surajit
  • Ren, Dan

Abstract

The main result of this article states that one can get as many as D+1 modes from just a two component normal mixture in D dimensions. Multivariate mixture models are widely used for modeling homogeneous populations and for cluster analysis. Either the components directly or modes arising from these components are often used to extract individual clusters. Although in lower dimensions these strategies work well, our results show that high dimensional mixtures are often very complex and researchers should take extra precautions when using mixture models for cluster analysis. Further our analysis shows that the number of modes depends on the component means and eigenvalues of the ratio of the two component covariance matrices, which in turn provides a clear guideline as to when one can use mixture analysis for clustering high dimensional data.

Suggested Citation

  • Ray, Surajit & Ren, Dan, 2012. "On the upper bound of the number of modes of a multivariate normal mixture," Journal of Multivariate Analysis, Elsevier, vol. 108(C), pages 41-52.
  • Handle: RePEc:eee:jmvana:v:108:y:2012:i:c:p:41-52
    DOI: 10.1016/j.jmva.2012.02.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X12000401
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2012.02.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chen, Jiahua & Tan, Xianming, 2009. "Inference for multivariate normal mixtures," Journal of Multivariate Analysis, Elsevier, vol. 100(7), pages 1367-1383, August.
    2. Jörn Dannemann & Hajo Holzmann, 2008. "Likelihood Ratio Testing for Hidden Markov Models Under Non‐standard Conditions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 35(2), pages 309-321, June.
    3. Hajo Holzmann & Sebastian Vollmer, 2008. "A likelihood ratio test for bimodality in two-component mixtures with application to regional income distribution in the EU," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 92(1), pages 57-69, February.
    4. M.‐Y. Cheng & P. Hall, 1998. "Calibrating the excess mass and dip tests of modality," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(3), pages 579-589.
    5. Christian Hennig, 2010. "Methods for merging Gaussian mixture components," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 4(1), pages 3-34, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Redivo, Edoardo & Nguyen, Hien D. & Gupta, Mayetri, 2020. "Bayesian clustering of skewed and multimodal data using geometric skewed normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    2. José E. Chacón, 2019. "Mixture model modal clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(2), pages 379-404, June.
    3. Bader Alruwaili, 2023. "The modality of skew t-distribution," Statistical Papers, Springer, vol. 64(2), pages 497-507, April.
    4. Chen, Yi-Ting & Sun, Edward W. & Lin, Yi-Bing, 2020. "Merging anomalous data usage in wireless mobile telecommunications: Business analytics with a strategy-focused data-driven approach for sustainability," European Journal of Operational Research, Elsevier, vol. 281(3), pages 687-705.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seo, Byungtae & Kim, Daeyoung, 2012. "Root selection in normal mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 56(8), pages 2454-2470.
    2. Kim, Daeyoung & Seo, Byungtae, 2014. "Assessment of the number of components in Gaussian mixture models in the presence of multiple local maximizers," Journal of Multivariate Analysis, Elsevier, vol. 125(C), pages 100-120.
    3. Chacón, José E. & Fernández Serrano, Javier, 2024. "Bayesian taut splines for estimating the number of modes," Computational Statistics & Data Analysis, Elsevier, vol. 196(C).
    4. Roberto Rocci & Stefano Antonio Gattone & Roberto Di Mari, 2018. "A data driven equivariant approach to constrained Gaussian mixture modeling," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(2), pages 235-260, June.
    5. Arun Kumar Kuchibhotla & Somabha Mukherjee & Ayanendranath Basu, 2019. "Statistical inference based on bridge divergences," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(3), pages 627-656, June.
    6. Redivo, Edoardo & Nguyen, Hien D. & Gupta, Mayetri, 2020. "Bayesian clustering of skewed and multimodal data using geometric skewed normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    7. James Mitchell & Aubrey Poon & Dan Zhu, 2024. "Constructing density forecasts from quantile regressions: Multimodality in macrofinancial dynamics," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(5), pages 790-812, August.
    8. Zhu, Xuwen & Melnykov, Volodymyr, 2018. "Manly transformation in finite mixture modeling," Computational Statistics & Data Analysis, Elsevier, vol. 121(C), pages 190-208.
    9. Daniel J. Henderson & Christopher F. Parmeter & R. Robert Russell, 2008. "Modes, weighted modes, and calibrated modes: evidence of clustering using modality tests," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(5), pages 607-638.
    10. Di, J. & Kolaczyk, E., 2010. "Complexity-penalized estimation of minimum volume sets for dependent data," Journal of Multivariate Analysis, Elsevier, vol. 101(9), pages 1910-1926, October.
    11. Suren Basov & Svetlana Danilkina & David Prentice, 2020. "When Does Variety Increase with Quality?," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 56(3), pages 463-487, May.
    12. Holzmann, Hajo & Schwaiger, Florian, 2016. "Testing for the number of states in hidden Markov models," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 318-330.
    13. Semhar Michael & Volodymyr Melnykov, 2016. "An effective strategy for initializing the EM algorithm in finite mixture models," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 10(4), pages 563-583, December.
    14. Chaofeng Yuan & Wensheng Zhu & Xuming He & Jianhua Guo, 2019. "A mixture factor model with applications to microarray data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(1), pages 60-76, March.
    15. Branislav Panić & Marko Nagode & Jernej Klemenc & Simon Oman, 2022. "On Methods for Merging Mixture Model Components Suitable for Unsupervised Image Segmentation Tasks," Mathematics, MDPI, vol. 10(22), pages 1-22, November.
    16. Marek Śmieja & Magdalena Wiercioch, 2017. "Constrained clustering with a complex cluster structure," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(3), pages 493-518, September.
    17. Cavallo, Alberto & Rigobon, Roberto, 2011. "The Distribution of the Size of Price Changes," Working Papers 2011-011, Banco Central de Reserva del Perú.
    18. Nicolas Depraetere & Martina Vandebroek, 2014. "Order selection in finite mixtures of linear regressions," Statistical Papers, Springer, vol. 55(3), pages 871-911, August.
    19. repec:cte:wsrepe:ws1450804 is not listed on IDEAS
    20. José E. Chacón, 2020. "The Modal Age of Statistics," International Statistical Review, International Statistical Institute, vol. 88(1), pages 122-141, April.
    21. Peter Radchenko & Gourab Mukherjee, 2017. "Convex clustering via l 1 fusion penalization," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1527-1546, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:108:y:2012:i:c:p:41-52. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.