IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v76y2014i1p97-111.html
   My bibliography  Save this article

Non-parametric identification and estimation of the number of components in multivariate mixtures

Author

Listed:
  • Hiroyuki Kasahara
  • Katsumi Shimotsu

Abstract

type="main" xml:id="rssb12022-abs-0001"> We analyse the identifiability of the number of components in k-variate, M-component finite mixture models in which each component distribution has independent marginals, including models in latent class analysis. Without making parametric assumptions on the component distributions, we investigate how one can identify the number of components from the distribution function of the observed data. When k≥2, a lower bound on the number of components (M) is non-parametrically identifiable from the rank of a matrix constructed from the distribution function of the observed variables. Building on this identification condition, we develop a procedure to estimate a lower bound on the number of components consistently.

Suggested Citation

  • Hiroyuki Kasahara & Katsumi Shimotsu, 2014. "Non-parametric identification and estimation of the number of components in multivariate mixtures," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 97-111, January.
  • Handle: RePEc:bla:jorssb:v:76:y:2014:i:1:p:97-111
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1111/rssb.2013.76.issue-1
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Andrews, Donald W. K., 1987. "Asymptotic Results for Generalized Wald Tests," Econometric Theory, Cambridge University Press, vol. 3(3), pages 348-358, June.
    2. Kleibergen, Frank & Paap, Richard, 2006. "Generalized reduced rank tests using the singular value decomposition," Journal of Econometrics, Elsevier, vol. 133(1), pages 97-126, July.
    3. Woo, Mi-Ja & Sriram, T.N., 2006. "Robust Estimation of Mixture Complexity," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1475-1486, December.
    4. M. Levine & D. R. Hunter & D. Chauveau, 2011. "Maximum smoothed likelihood for multivariate mixtures," Biometrika, Biometrika Trust, vol. 98(2), pages 403-416.
    5. T. P. Hettmansperger & Hoben Thomas, 2000. "Almost nonparametric inference for repeated measures in mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(4), pages 811-825.
    6. Peter Hall & Amnon Neeman & Reza Pakyari & Ryan Elmore, 2005. "Nonparametric inference in multivariate mixtures," Biometrika, Biometrika Trust, vol. 92(3), pages 667-678, September.
    7. Dunson, David B. & Xing, Chuanhua, 2009. "Nonparametric Bayes Modeling of Multivariate Categorical Data," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1042-1051.
    8. Robert Mislevy, 1984. "Estimating latent distributions," Psychometrika, Springer;The Psychometric Society, vol. 49(3), pages 359-381, September.
    9. Hiroyuki Kasahara & Katsumi Shimotsu, 2009. "Nonparametric Identification of Finite Mixture Models of Dynamic Discrete Choices," Econometrica, Econometric Society, vol. 77(1), pages 135-175, January.
    10. Lutkepohl, Helmut & Burda, Maike M., 1997. "Modified Wald tests under nonregular conditions," Journal of Econometrics, Elsevier, vol. 78(2), pages 315-332, June.
    11. I. R. Cruz‐Medina & T. P. Hettmansperger & H. Thomas, 2004. "Semiparametric mixture models and repeated measures: the multinomial cut point model," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 53(3), pages 463-474, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Konstantin T. Matchev & Prasanth Shyamsundar, 2020. "InClass Nets: Independent Classifier Networks for Nonparametric Estimation of Conditional Independence Mixture Models and Unsupervised Classification," Papers 2009.00131, arXiv.org.
    2. Paul Schrimpf & Michio Suzuki & Hiroyuki Kasahara, 2015. "Identification and Estimation of Production Function with Unobserved Heterogeneity," 2015 Meeting Papers 924, Society for Economic Dynamics.
    3. Bonhomme, Stéphane & Jochmans, Koen & Robin, Jean-Marc, 2017. "Nonparametric estimation of non-exchangeable latent-variable models," Journal of Econometrics, Elsevier, vol. 201(2), pages 237-248.
    4. repec:spo:wpmain:info:hdl:2441/etefo8s8r89oamhnhiclqr530 is not listed on IDEAS
    5. repec:hal:spmain:info:hdl:2441/lpag9391598uoauqu4u9opq76 is not listed on IDEAS
    6. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2016. "Non-parametric estimation of finite mixtures from repeated measurements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 211-229, January.
    7. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2014. "Estimating Multivariate Latent-Structure Models," Working Papers hal-01097135, HAL.
    8. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2014. "Nonparametric spectral-based estimation of latent structures," CeMMAP working papers 18/14, Institute for Fiscal Studies.
    9. Qihui Chen & Zheng Fang, 2018. "Improved Inference on the Rank of a Matrix," Papers 1812.02337, arXiv.org, revised Mar 2019.
    10. Jean-Marc Robin & Stéphane Bonhomme & Koen Jochmans, 2014. "Estimating Multivariate Latent-Structure Models," Sciences Po Economics Discussion Papers 2014-18, Sciences Po Departement of Economics.
    11. Aureo de Paula & Xun Tang, 2020. "Testable Implications of Multiple Equilibria in Discrete Games with Correlated Types," Papers 2012.00787, arXiv.org.
    12. repec:hal:spmain:info:hdl:2441/4m4fqk908d9obqasu0uhft7t94 is not listed on IDEAS
    13. Johannes F. Jörg & Catherine Cleophas, 2022. "Nonparametric estimation of customer segments from censored sales panel data," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 21(4), pages 393-417, August.
    14. Erhao Xie, 2018. "Inference in Games Without Nash Equilibrium: An Application to Restaurants, Competition in Opening Hours," Staff Working Papers 18-60, Bank of Canada.
    15. Bonhomme, Stéphane & Jochmans, Koen & Robin, Jean-Marc, 2017. "Nonparametric estimation of non-exchangeable latent-variable models," Journal of Econometrics, Elsevier, vol. 201(2), pages 237-248.
    16. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2017. "Nonparametric estimation of non-exchangeable latent-variable models," Sciences Po publications info:hdl:2441/4m4fqk908d9, Sciences Po.
    17. Yu Hao & Hiroyuki Kasahara, 2022. "Testing the Number of Components in Finite Mixture Normal Regression Model with Panel Data," Papers 2210.02824, arXiv.org, revised Jun 2023.
    18. Xu, Ke-Li, 2018. "A semi-nonparametric estimator of regression discontinuity design with discrete duration outcomes," Journal of Econometrics, Elsevier, vol. 206(1), pages 258-278.
    19. David Balan & Patrick DeGraba & Francine Lafontaine & Patrick McAlvanah & Devesh Raval & David Schmidt, 2015. "Economics at the FTC: Fraud, Mergers and Exclusion," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 47(4), pages 371-398, December.
    20. Krasnokutskaya, Elena & Song, Kyungchul & Tang, Xun, 2022. "Estimating unobserved individual heterogeneity using pairwise comparisons," Journal of Econometrics, Elsevier, vol. 226(2), pages 477-497.
    21. Bagkavos, Dimitrios & Patil, Prakash N., 2023. "Goodness-of-fit testing for normal mixture densities," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    22. Hiroaki Masuhara, 2019. "Identifying finite mixture models in the presence of moment-generating function: application in medical care using a zero-inflated binomial model," Economics Bulletin, AccessEcon, vol. 39(2), pages 1529-1537.
    23. repec:hal:spmain:info:hdl:2441/etefo8s8r89oamhnhiclqr530 is not listed on IDEAS

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2016. "Non-parametric estimation of finite mixtures from repeated measurements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 211-229, January.
    2. repec:hal:spmain:info:hdl:2441/etefo8s8r89oamhnhiclqr530 is not listed on IDEAS
    3. repec:spo:wpmain:info:hdl:2441/etefo8s8r89oamhnhiclqr530 is not listed on IDEAS
    4. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2014. "Estimating Multivariate Latent-Structure Models," Working Papers hal-01097135, HAL.
    5. Jean-Marc Robin & Stéphane Bonhomme & Koen Jochmans, 2014. "Estimating Multivariate Latent-Structure Models," Sciences Po Economics Discussion Papers 2014-18, Sciences Po Departement of Economics.
    6. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2014. "Nonparametric spectral-based estimation of latent structures," CeMMAP working papers CWP18/14, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    7. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2013. "Nonparametric estimation of finite mixtures," SciencePo Working papers hal-00972868, HAL.
    8. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2014. "Nonparametric estimation of finite measures," CeMMAP working papers 11/14, Institute for Fiscal Studies.
    9. Hema Yoganarasimhan, 2016. "Estimation of Beauty Contest Auctions," Marketing Science, INFORMS, vol. 35(1), pages 27-54, January.
    10. Konstantin T. Matchev & Prasanth Shyamsundar, 2020. "InClass Nets: Independent Classifier Networks for Nonparametric Estimation of Conditional Independence Mixture Models and Unsupervised Classification," Papers 2009.00131, arXiv.org.
    11. Al-Sadoon, Majid M., 2017. "A unifying theory of tests of rank," Journal of Econometrics, Elsevier, vol. 199(1), pages 49-62.
    12. repec:spo:wpmain:info:hdl:2441/7o52iohb7k6srk09n8t4k21sm is not listed on IDEAS
    13. repec:hal:wpspec:info:hdl:2441/7o52iohb7k6srk09n8t4k21sm is not listed on IDEAS
    14. Majid M. Al-Sadoon, 2014. "A general theory of rank testing," Economics Working Papers 1411, Department of Economics and Business, Universitat Pompeu Fabra, revised Feb 2015.
    15. repec:spo:wpecon:info:hdl:2441/7o52iohb7k6srk09n8t4k21sm is not listed on IDEAS
    16. Hiroyuki Kasahara & Katsumi Shimotsu, 2007. "Nonparametric Identification And Estimation Of Multivariate Mixtures," Working Paper 1153, Economics Department, Queen's University.
    17. repec:hal:spmain:info:hdl:2441/7o52iohb7k6srk09n8t4k21sm is not listed on IDEAS
    18. Higgins, Ayden & Jochmans, Koen, 2023. "Identification of mixtures of dynamic discrete choices," Journal of Econometrics, Elsevier, vol. 237(1).
    19. Duplinskiy, A., 2014. "Is regularization necessary? A Wald-type test under non-regular conditions," Research Memorandum 025, Maastricht University, Graduate School of Business and Economics (GSBE).
    20. Kasahara, Hiroyuki & 笠原, 博幸 & Shimotsu, Katsumi & 下津, 克己, 2010. "Nonparametric Identification of Multivariate Mixtures," Discussion Papers 2010-09, Graduate School of Economics, Hitotsubashi University.
    21. Xu, Ke-Li, 2018. "A semi-nonparametric estimator of regression discontinuity design with discrete duration outcomes," Journal of Econometrics, Elsevier, vol. 206(1), pages 258-278.
    22. repec:hal:spmain:info:hdl:2441/lpag9391598uoauqu4u9opq76 is not listed on IDEAS
    23. Qihui Chen & Zheng Fang, 2018. "Improved Inference on the Rank of a Matrix," Papers 1812.02337, arXiv.org, revised Mar 2019.
    24. Ruli Xiao, 2016. "Nonparametric Identification of Dynamic Games with Multiple Equilibria and Unobserved Heterogeneity," CAEPR Working Papers 2016-002, Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington.
    25. Christian Tien, 2022. "Instrumented Common Confounding," Papers 2206.12919, arXiv.org, revised Sep 2022.
    26. Chauveau, Didier & Hoang, Vy Thuy Lynh, 2016. "Nonparametric mixture models with conditionally independent multivariate component densities," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 1-16.
    27. Zaka Ratsimalahelo, 2003. "Strongly Consistent Determination of the Rank of Matrix," EERI Research Paper Series EERI_RP_2003_04, Economics and Econometrics Research Institute (EERI), Brussels.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:76:y:2014:i:1:p:97-111. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.