IDEAS home Printed from https://ideas.repec.org/a/eee/stapro/v77y2007i3p265-271.html
   My bibliography  Save this article

Splitting variable selection for multivariate regression trees

Author

Listed:
  • Hsiao, Wei-Cheng
  • Shih, Yu-Shan

Abstract

We show that the usual exhaustive search principle adapted for multivariate regression trees has selection bias toward variables with more split points. A selection scheme is proposed to control bias by utilizing hierarchical loglinear model for three-way contingency table of residuals.

Suggested Citation

  • Hsiao, Wei-Cheng & Shih, Yu-Shan, 2007. "Splitting variable selection for multivariate regression trees," Statistics & Probability Letters, Elsevier, vol. 77(3), pages 265-271, February.
  • Handle: RePEc:eee:stapro:v:77:y:2007:i:3:p:265-271
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-7152(06)00232-X
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. David R. Larsen & Paul L. Speckman, 2004. "Multivariate Regression Trees for Analysis of Abundance Data," Biometrics, The International Biometric Society, vol. 60(2), pages 543-549, June.
    2. Keon Lee, Seong, 2005. "On generalized multivariate decision tree by using GEE," Computational Statistics & Data Analysis, Elsevier, vol. 49(4), pages 1105-1119, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yu-Shan Shih & Kuang-Hsun Liu, 2019. "Regression trees for detecting preference patterns from rank data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 683-702, September.
    2. Schmid, Lena & Gerharz, Alexander & Groll, Andreas & Pauly, Markus, 2023. "Tree-based ensembles for multi-output regression: Comparing multivariate approaches with separate univariate ones," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    3. Wei-Yin Loh, 2014. "Fifty Years of Classification and Regression Trees," International Statistical Review, International Statistical Institute, vol. 82(3), pages 329-348, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wei-Yin Loh, 2014. "Fifty Years of Classification and Regression Trees," International Statistical Review, International Statistical Institute, vol. 82(3), pages 329-348, December.
    2. Schmid, Lena & Gerharz, Alexander & Groll, Andreas & Pauly, Markus, 2023. "Tree-based ensembles for multi-output regression: Comparing multivariate approaches with separate univariate ones," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    3. Keon Lee, Seong, 2005. "On generalized multivariate decision tree by using GEE," Computational Statistics & Data Analysis, Elsevier, vol. 49(4), pages 1105-1119, June.
    4. Hongliang Liu & Jinpeng Tan & Kyongson Jon & Wensheng Zhu, 2022. "A New Case-Mix Classification Method for Medical Insurance Payment," Mathematics, MDPI, vol. 10(10), pages 1-15, May.
    5. Verniest, Fabien & Greulich, Sabine, 2019. "Methods for assessing the effects of environmental parameters on biological communities in long-term ecological studies - A literature review," Ecological Modelling, Elsevier, vol. 414(C).
    6. Piccarreta, Raffaella, 2010. "Binary trees for dissimilarity data," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1516-1524, June.
    7. Hajjem, Ahlem & Bellavance, François & Larocque, Denis, 2011. "Mixed effects regression trees for clustered data," Statistics & Probability Letters, Elsevier, vol. 81(4), pages 451-459, April.
    8. Kai Yu & William Wheeler & Qizhai Li & Andrew W. Bergen & Neil Caporaso & Nilanjan Chatterjee & Jinbo Chen, 2010. "A Partially Linear Tree-based Regression Model for Multivariate Outcomes," Biometrics, The International Biometric Society, vol. 66(1), pages 89-96, March.
    9. Antonio D’Ambrosio & Willem J. Heiser, 2016. "A Recursive Partitioning Method for the Prediction of Preference Rankings Based Upon Kemeny Distances," Psychometrika, Springer;The Psychometric Society, vol. 81(3), pages 774-794, September.
    10. Evan L. Reynolds & Brian C. Callaghan & Michael Gaies & Mousumi Banerjee, 2023. "Regression Trees and Ensemble for Multivariate Outcomes," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 77-109, May.
    11. Dine, Abdessamad & Larocque, Denis & Bellavance, François, 2009. "Multivariate trees for mixed outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 53(11), pages 3795-3804, September.
    12. Peter Calhoun & Richard A. Levine & Juanjuan Fan, 2021. "Repeated measures random forests (RMRF): Identifying factors associated with nocturnal hypoglycemia," Biometrics, The International Biometric Society, vol. 77(1), pages 343-351, March.
    13. Mathlouthi, Walid & Fredette, Marc & Larocque, Denis, 2015. "Regression trees and forests for non-homogeneous Poisson processes," Statistics & Probability Letters, Elsevier, vol. 96(C), pages 204-211.
    14. David I. Warton, 2011. "Regularized Sandwich Estimators for Analysis of High-Dimensional Data Using Generalized Estimating Equations," Biometrics, The International Biometric Society, vol. 67(1), pages 116-123, March.

    More about this item

    Keywords

    Bias Loglinear model Residual;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:stapro:v:77:y:2007:i:3:p:265-271. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.