IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v55y2011i9p2739-2747.html
   My bibliography  Save this article

Bias from misspecification of the component variances in a normal mixture

Author

Listed:
  • Lo, Yungtai

Abstract

Bias in parameter estimates can be substantial when heteroscedastic normal mixtures are misspecified as homoscedastic normal mixtures, and vice versa. We show through simulations that the maximum likelihood estimators under the false assumption of equal variances are inconsistent and bias in parameter estimates is appreciable and even substantial when the mixture components are not well-separated. Finite sample bias in parameter estimates is close to the asymptotic bias even for a sample size of 200 or less. When homoscedastic normal mixtures are misspecified as heteroscedastic normal mixtures, the maximum likelihood estimators are consistent. However, the maximum likelihood estimators under a correctly specified homoscedastic mixture model converge to the true parameter values faster than those under a misspecified heteroscedastic mixture model. The bias of the maximum likelihood estimators is less dependent on the lower bound imposed on the component variances to ensure that the likelihood is bounded under the false assumption of unequal variances when the sample size is 500 or more and the component distributions are well-separated. An example is given to demonstrate the effects of a misspecification of the component variances on estimates of the prevalence of hypertension using normal mixtures.

Suggested Citation

  • Lo, Yungtai, 2011. "Bias from misspecification of the component variances in a normal mixture," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2739-2747, September.
  • Handle: RePEc:eee:csdana:v:55:y:2011:i:9:p:2739-2747
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947311001368
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bowden, Roger J, 1973. "The Theory of Parametric Identification," Econometrica, Econometric Society, vol. 41(6), pages 1069-1074, November.
    2. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    3. White, Halbert, 1982. "Maximum Likelihood Estimation of Misspecified Models," Econometrica, Econometric Society, vol. 50(1), pages 1-25, January.
    4. Lo, Yungtai, 2005. "Likelihood ratio tests of the number of components in a normal mixture with unequal variances," Statistics & Probability Letters, Elsevier, vol. 71(3), pages 225-235, March.
    5. G. J. McLachlan, 1987. "On Bootstrapping the Likelihood Ratio Test Statistic for the Number of Components in a Normal Mixture," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 36(3), pages 318-324, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lo, Yungtai, 2005. "Likelihood ratio tests of the number of components in a normal mixture with unequal variances," Statistics & Probability Letters, Elsevier, vol. 71(3), pages 225-235, March.
    2. Giuliano Galimberti & Lorenzo Nuzzi & Gabriele Soffritti, 2021. "Covariance matrix estimation of the maximum likelihood estimator in multivariate clusterwise linear regression," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 235-268, March.
    3. Polymenis, A. & Titterington, D. M., 1998. "On the determination of the number of components in a mixture," Statistics & Probability Letters, Elsevier, vol. 38(4), pages 295-298, July.
    4. Daniel Fernández & Richard Arnold & Shirley Pledger & Ivy Liu & Roy Costilla, 2019. "Finite mixture biclustering of discrete type multivariate data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 117-143, March.
    5. Vaidehi Dixit & Ryan Martin, 2022. "Estimating a Mixing Distribution on the Sphere Using Predictive Recursion," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(2), pages 596-626, November.
    6. Maria Grazia Pittau & Roberto Zelli, 2006. "Empirical evidence of income dynamics across EU regions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(5), pages 605-628, July.
    7. Roberto Colombi & Sabrina Giordano, 2019. "Likelihood-based tests for a class of misspecified finite mixture models for ordinal categorical data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(4), pages 1175-1202, December.
    8. Kim, Jae-Young, 2014. "An alternative quasi likelihood approach, Bayesian analysis and data-based inference for model specification," Journal of Econometrics, Elsevier, vol. 178(P1), pages 132-145.
    9. Roy Levy & Gregory R. Hancock, 2011. "An Extended Model Comparison Framework for Covariance and Mean Structure Models, Accommodating Multiple Groups and Latent Mixtures," Sociological Methods & Research, , vol. 40(2), pages 256-278, May.
    10. KENNETH C. LAND & PATRICIA L. McCALL & DANIEL S. NAGIN, 1996. "A Comparison of Poisson, Negative Binomial, and Semiparametric Mixed Poisson Regression Models," Sociological Methods & Research, , vol. 24(4), pages 387-442, May.
    11. Hung-Chia Chen & James J. Chen, 2016. "Hybrid Mixture Model for Subpopulation Identification," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 8(1), pages 28-42, June.
    12. Bettina Grün & Gertraud Malsiner-Walli & Sylvia Frühwirth-Schnatter, 2022. "How many data clusters are in the Galaxy data set?," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 325-349, June.
    13. P.A.V.B. Swamy & I-Lok Chang & Jatinder S. Mehta & William H. Greene & Stephen G. Hall & George S. Tavlas, 2016. "Removing Specification Errors from the Usual Formulation of Binary Choice Models," Econometrics, MDPI, vol. 4(2), pages 1-21, June.
    14. Carlo Altavilla & Raffaella Giacomini & Giuseppe Ragusa, 2017. "Anchoring the yield curve using survey expectations," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(6), pages 1055-1068, September.
    15. Fernando Rios-Avila & Gustavo Canavire-Bacarreza, 2018. "Standard-error correction in two-stage optimization models: A quasi–maximum likelihood estimation approach," Stata Journal, StataCorp LP, vol. 18(1), pages 206-222, March.
    16. Sandy Fréret & Denis Maguain, 2017. "The effects of agglomeration on tax competition: evidence from a two-regime spatial panel model on French data," International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 24(6), pages 1100-1140, December.
    17. Ai, Chunrong & Chen, Xiaohong, 2007. "Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables," Journal of Econometrics, Elsevier, vol. 141(1), pages 5-43, November.
    18. Fetene B. Tekle & Dereje W. Gudicha & Jeroen K. Vermunt, 2016. "Power analysis for the bootstrap likelihood ratio test for the number of classes in latent class models," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 10(2), pages 209-224, June.
    19. Gregory, Allan W. & McCurdy, Thomas H., 1986. "The unbiasedness hypothesis in the forward foreign exchange market: A specification analysis with application to France, Italy, Japan, the United Kingdom and West Germany," European Economic Review, Elsevier, vol. 30(2), pages 365-381, April.
    20. B. Praag & T. Dijkstra & J. Velzen, 1985. "Least-squares theory based on general distributional assumptions with an application to the incomplete observations problem," Psychometrika, Springer;The Psychometric Society, vol. 50(1), pages 25-36, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:9:p:2739-2747. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.