IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v167y2022ics0167947321002115.html
   My bibliography  Save this article

Uncertainty quantification for honest regression trees

Author

Listed:
  • Wu, Suofei
  • Hannig, Jan
  • Lee, Thomas C.M.

Abstract

A new method is developed for quantifying the uncertainties of the estimates and predictions produced by honest random forests. This new method is based on the generalized fiducial methodology, and provides a fiducial density function that measures how likely each single honest tree is the true model. With such a density function, estimates and predictions, as well as their confidence/prediction intervals, can be obtained. The promising empirical properties of the proposed method are demonstrated by numerical comparisons with several state-of-the-art methods, and by applications to a few real data sets. Lastly, the proposed method is theoretically backed up by an asymptotic guarantee.

Suggested Citation

  • Wu, Suofei & Hannig, Jan & Lee, Thomas C.M., 2022. "Uncertainty quantification for honest regression trees," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).
  • Handle: RePEc:eee:csdana:v:167:y:2022:i:c:s0167947321002115
    DOI: 10.1016/j.csda.2021.107377
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947321002115
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2021.107377?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bradley Efron, 2014. "Estimation and Accuracy After Model Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 991-1007, September.
    2. Randy C. S. Lai & Jan Hannig & Thomas C. M. Lee, 2015. "Generalized Fiducial Inference for Ultrahigh-Dimensional Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 760-772, June.
    3. Jan Hannig & Hari Iyer & Randy C. S. Lai & Thomas C. M. Lee, 2016. "Generalized Fiducial Inference: A Review and New Results," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1346-1361, July.
    4. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    5. Ryan Martin & Chuanhai Liu, 2015. "Marginal Inferential Models: Prior-Free Probabilistic Inference on Interest Parameters," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1621-1631, December.
    6. Harrison, David Jr. & Rubinfeld, Daniel L., 1978. "Hedonic housing prices and the demand for clean air," Journal of Environmental Economics and Management, Elsevier, vol. 5(1), pages 81-102, March.
    7. Ryan Martin & Chuanhai Liu, 2015. "Conditional inferential models: combining information for prior-free probabilistic inference," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 77(1), pages 195-217, January.
    8. Jan Hannig & Thomas C. M. Lee, 2009. "Generalized fiducial inference for wavelet regression," Biometrika, Biometrika Trust, vol. 96(4), pages 847-860.
    9. Xie, Minge & Singh, Kesar & Strawderman, William E., 2011. "Confidence Distributions and a Unifying Framework for Meta-Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 320-333.
    10. Min-ge Xie & Kesar Singh, 2013. "Confidence Distribution, the Frequentist Distribution Estimator of a Parameter: A Review," International Statistical Review, International Statistical Institute, vol. 81(1), pages 3-39, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seungyong Hwang & Randy C. S. Lai & Thomas C. M. Lee, 2022. "Generalized Fiducial Inference for Threshold Estimation in Dose–Response and Regression Settings," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(1), pages 109-124, March.
    2. Ionut Bebu & George Luta & Thomas Mathew & Brian K. Agan, 2016. "Generalized Confidence Intervals and Fiducial Intervals for Some Epidemiological Measures," IJERPH, MDPI, vol. 13(6), pages 1-13, June.
    3. La Vecchia, Davide & Moor, Alban & Scaillet, Olivier, 2023. "A higher-order correct fast moving-average bootstrap for dependent data," Journal of Econometrics, Elsevier, vol. 235(1), pages 65-81.
    4. Veronese, Piero & Melilli, Eugenio, 2018. "Some asymptotic results for fiducial and confidence distributions," Statistics & Probability Letters, Elsevier, vol. 134(C), pages 98-105.
    5. Piero Veronese & Eugenio Melilli, 2021. "Confidence Distribution for the Ability Parameter of the Rasch Model," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 131-166, March.
    6. Guang Yang & Dungang Liu & Junyuan Wang & Min‐ge Xie, 2016. "Meta‐analysis framework for exact inferences with application to the analysis of rare events," Biometrics, The International Biometric Society, vol. 72(4), pages 1378-1386, December.
    7. Hsin-I Lee & Hungyen Chen & Hirohisa Kishino & Chen-Tuo Liao, 2016. "A Reference Population-Based Conformance Proportion," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 21(4), pages 684-697, December.
    8. Susan Athey & Julie Tibshirani & Stefan Wager, 2016. "Generalized Random Forests," Papers 1610.01271, arXiv.org, revised Apr 2018.
    9. Xuhua Liu & Xingzhong Xu, 2016. "Confidence distribution inferences in one-way random effects model," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 59-74, March.
    10. David R. Bickel, 2014. "Small-scale Inference: Empirical Bayes and Confidence Methods for as Few as a Single Comparison," International Statistical Review, International Statistical Institute, vol. 82(3), pages 457-476, December.
    11. Lihua Lei & Emmanuel J. Candès, 2021. "Conformal inference of counterfactuals and individual treatment effects," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(5), pages 911-938, November.
    12. Hector, Emily C. & Luo, Lan & Song, Peter X.-K., 2023. "Parallel-and-stream accelerator for computationally fast supervised learning," Computational Statistics & Data Analysis, Elsevier, vol. 177(C).
    13. Tang, Lu & Zhou, Ling & Song, Peter X.-K., 2020. "Distributed simultaneous inference in generalized linear models via confidence distribution," Journal of Multivariate Analysis, Elsevier, vol. 176(C).
    14. Lai Xinglin, 2021. "Modelling hetegeneous treatment effects by quantitle local polynomial decision tree and forest," Papers 2111.15320, arXiv.org, revised Mar 2022.
    15. Piero Veronese & Eugenio Melilli, 2015. "Fiducial and Confidence Distributions for Real Exponential Families," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 471-484, June.
    16. Xiaokang Luo & Tirthankar Dasgupta & Minge Xie & Regina Y. Liu, 2021. "Leveraging the Fisher randomization test using confidence distributions: Inference, combination and fusion learning," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(4), pages 777-797, September.
    17. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    18. Yang Liu & Jan Hannig, 2017. "Generalized Fiducial Inference for Logistic Graded Response Models," Psychometrika, Springer;The Psychometric Society, vol. 82(4), pages 1097-1125, December.
    19. Nancy Reid & David R. Cox, 2015. "On Some Principles of Statistical Inference," International Statistical Review, International Statistical Institute, vol. 83(2), pages 293-308, August.
    20. Randy C. S. Lai & Jan Hannig & Thomas C. M. Lee, 2015. "Generalized Fiducial Inference for Ultrahigh-Dimensional Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 760-772, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:167:y:2022:i:c:s0167947321002115. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.