IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v97y2006i8p1799-1814.html
   My bibliography  Save this article

Prediction of Euclidean distances with discrete and continuous outcomes

Author

Listed:
  • Mortier, F.
  • Robin, S.
  • Lassalvy, S.
  • Baril, C.P.
  • Bar-Hen, A.

Abstract

The objective of this paper is first to predict generalized Euclidean distances in the context of discrete and quantitative variables and then to derive their statistical properties. We first consider the simultaneous modelling of discrete and continuous random variables with covariates and obtain the likelihood. We derive an important property useful for its practical maximization. We then study the prediction of any Euclidean distances and its statistical proprieties, especially for the Mahalanobis distance. The quality of distance estimation is analyzed through simulations. This results are applied to our motivating example: the official distinction procedure of rapeseed varieties.

Suggested Citation

  • Mortier, F. & Robin, S. & Lassalvy, S. & Baril, C.P. & Bar-Hen, A., 2006. "Prediction of Euclidean distances with discrete and continuous outcomes," Journal of Multivariate Analysis, Elsevier, vol. 97(8), pages 1799-1814, September.
  • Handle: RePEc:eee:jmvana:v:97:y:2006:i:8:p:1799-1814
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047-259X(05)00102-8
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Barhen, A. & Daudin, J. J., 1995. "Generalization of the Mahalanobis Distance in the Mixed Case," Journal of Multivariate Analysis, Elsevier, vol. 53(2), pages 332-342, May.
    2. Wai-Yin Poon & Sik-Yum Lee, 1987. "Maximum likelihood estimation of multivariate polyserial and polychoric correlation coefficients," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 409-430, September.
    3. G. Nuel & S. Robin & C. P. Baril, 2001. "Predicting distances using a linear model: The case of varietal distinctness," Journal of Applied Statistics, Taylor & Francis Journals, vol. 28(5), pages 607-621.
    4. Edward J. Bedrick & Jodi Lapidus & Joseph F. Powell, 2000. "Estimating the Mahalanobis Distance from Mixed Continuous and Discrete Data," Biometrics, The International Biometric Society, vol. 56(2), pages 394-401, June.
    5. de Leon, A. R. & Carrière, K. C., 2005. "A generalized Mahalanobis distance for mixed data," Journal of Multivariate Analysis, Elsevier, vol. 92(1), pages 174-185, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fernando Almeida & Nelson Amoedo, 2021. "Exploring the association between R&D expenditure and the job quality in the European Union," Papers 2101.03214, arXiv.org.
    2. Chaubert, F. & Mortier, F. & Saint André, L., 2008. "Multivariate dynamic model for ordinal outcomes," Journal of Multivariate Analysis, Elsevier, vol. 99(8), pages 1717-1732, September.
    3. Pierrette Chagneau & Frédéric Mortier & Nicolas Picard & Jean-Noël Bacro, 2011. "A Hierarchical Bayesian Model for Spatial Prediction of Multivariate Non-Gaussian Random Fields," Biometrics, The International Biometric Society, vol. 67(1), pages 97-105, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alban Mbina Mbina & Guy Martial Nkiet & Fulgence Eyi Obiang, 2019. "Variable selection in discriminant analysis for mixed continuous-binary variables and several groups," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 773-795, September.
    2. Cheng, Tsung-Chi & Biswas, Atanu, 2008. "Maximum trimmed likelihood estimator for multivariate mixed continuous and categorical data," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 2042-2065, January.
    3. A. R. de Leon & A. Soo & T. Williamson, 2011. "Classification with discrete and continuous variables via general mixed-data models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(5), pages 1021-1032, February.
    4. Merbouha, A. & Mkhadri, A., 2004. "Regularization of the location model in discrimination with mixed discrete and continuous variables," Computational Statistics & Data Analysis, Elsevier, vol. 45(3), pages 563-576, April.
    5. de Leon, A. R. & Carrière, K. C., 2005. "A generalized Mahalanobis distance for mixed data," Journal of Multivariate Analysis, Elsevier, vol. 92(1), pages 174-185, January.
    6. de Leon, A.R., 2005. "Pairwise likelihood approach to grouped continuous model and its extension," Statistics & Probability Letters, Elsevier, vol. 75(1), pages 49-57, November.
    7. Tang, John P., 2015. "Pollution havens and the trade in toxic chemicals: Evidence from U.S. trade flows," Ecological Economics, Elsevier, vol. 112(C), pages 150-160.
    8. Leila Amiri & Mojtaba Khazaei & Mojtaba Ganjali, 2018. "A mixture latent variable model for modeling mixed data in heterogeneous populations and its applications," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(1), pages 95-115, January.
    9. Poon, Wai-Yin & Hung, Hin-Yan, 1996. "Analysis of square tables with ordered categories," Computational Statistics & Data Analysis, Elsevier, vol. 22(3), pages 303-322, July.
    10. Wai Chan & Peter Bentler, 1998. "Covariance structure analysis of ordinal ipsative data," Psychometrika, Springer;The Psychometric Society, vol. 63(4), pages 369-399, December.
    11. Steffen Fieuws & Geert Verbeke, 2006. "Pairwise Fitting of Mixed Models for the Joint Modeling of Multivariate Longitudinal Profiles," Biometrics, The International Biometric Society, vol. 62(2), pages 424-431, June.
    12. Florian Schuberth & Jörg Henseler & Theo K. Dijkstra, 2018. "Partial least squares path modeling using ordinal categorical indicators," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(1), pages 9-35, January.
    13. Katsikatsou, Myrsini & Moustaki, Irini & Md Jamil, Haziq, 2022. "Pairwise likelihood estimation for confirmatory factor analysis models with categorical variables and data that are missing at random," LSE Research Online Documents on Economics 108933, London School of Economics and Political Science, LSE Library.
    14. Li, Zhengtao & Folmer, Henk & Xue, Jianhong, 2014. "To what extent does air pollution affect happiness? The case of the Jinchuan mining area, China," Ecological Economics, Elsevier, vol. 99(C), pages 88-99.
    15. Sik-Yum Lee & Wai-Yin Poon & P. Bentler, 1989. "Simultaneous analysis of multivariate polytomous variates in several groups," Psychometrika, Springer;The Psychometric Society, vol. 54(1), pages 63-73, March.
    16. Pierrette Chagneau & Frédéric Mortier & Nicolas Picard & Jean-Noël Bacro, 2011. "A Hierarchical Bayesian Model for Spatial Prediction of Multivariate Non-Gaussian Random Fields," Biometrics, The International Biometric Society, vol. 67(1), pages 97-105, March.
    17. Leila Amiri & Mojtaba Khazaei & Mojtaba Ganjali, 2017. "General location model with factor analyzer covariance matrix structure and its applications," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(3), pages 593-609, September.
    18. Piotr Tarka, 2018. "An overview of structural equation modeling: its beginnings, historical development, usefulness and controversies in the social sciences," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(1), pages 313-354, January.
    19. Hao Bai & Yuan Zhong & Xin Gao & Wei Xu, 2020. "Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method," Stats, MDPI, vol. 3(3), pages 1-18, July.
    20. Michael Schroder & Robert Dornau, 2002. "Do forecasters use monetary models? an empirical analysis of exchange rate expectations," Applied Financial Economics, Taylor & Francis Journals, vol. 12(8), pages 535-543.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:97:y:2006:i:8:p:1799-1814. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.