IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i20p3190-d1496926.html
   My bibliography  Save this article

Smooth Sigmoid Surrogate (SSS): An Alternative to Greedy Search in Decision Trees

Author

Listed:
  • Xiaogang Su

    (Department of Mathematical Science, University of Texas at El Paso, El Paso, TX 79968, USA)

  • George Ekow Quaye

    (Division of Health Services and Outcomes Research, Children’s Mercy Kansas City, Kansas City, MO 64108, USA)

  • Yishu Wei

    (Reddit Inc., San Francisco, CA 94102, USA)

  • Joseph Kang

    (US Census Bureau, Washington, DC 20233, USA)

  • Lei Liu

    (Division of Biostatistics, Washington University in St. Louis, St. Louis, MO 63110, USA)

  • Qiong Yang

    (Department of Biostatistics, School of Public Health, Boston University, Boston, MA 02118, USA)

  • Juanjuan Fan

    (Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA)

  • Richard A. Levine

    (Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA)

Abstract

Greedy search (GS) or exhaustive search plays a crucial role in decision trees and their various extensions. We introduce an alternative splitting method called smooth sigmoid surrogate (SSS) in which the indicator threshold function used in GS is approximated by a smooth sigmoid function. This approach allows for parametric smoothing or regularization of the erratic and discrete GS process, making it more effective in identifying the true cutoff point, particularly in the presence of weak signals, as well as less prone to the inherent end-cut preference problem. Additionally, SSS provides a convenient means of evaluating the best split by referencing a parametric nonlinear model. Moreover, in many variants of recursive partitioning, SSS can be reformulated as a one-dimensional smooth optimization problem, rendering it computationally more efficient than GS. Extensive simulation studies and real data examples are provided to evaluate and demonstrate its effectiveness.

Suggested Citation

  • Xiaogang Su & George Ekow Quaye & Yishu Wei & Joseph Kang & Lei Liu & Qiong Yang & Juanjuan Fan & Richard A. Levine, 2024. "Smooth Sigmoid Surrogate (SSS): An Alternative to Greedy Search in Decision Trees," Mathematics, MDPI, vol. 12(20), pages 1-28, October.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:20:p:3190-:d:1496926
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/20/3190/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/20/3190/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Torsten Hothorn & Achim Zeileis, 2008. "Generalized Maximally Selected Statistics," Biometrics, The International Biometric Society, vol. 64(4), pages 1263-1269, December.
    2. Jushan Bai, 1997. "Estimation Of A Change Point In Multiple Regression Models," The Review of Economics and Statistics, MIT Press, vol. 79(4), pages 551-563, November.
    3. Hothorn, Torsten & Lausen, Berthold, 2003. "On the exact distribution of maximally selected rank statistics," Computational Statistics & Data Analysis, Elsevier, vol. 43(2), pages 121-137, June.
    4. Shih, Yu-Shan & Tsai, Hsin-Wen, 2004. "Variable selection bias in regression trees with constant fits," Computational Statistics & Data Analysis, Elsevier, vol. 45(3), pages 595-607, April.
    5. Xiaogang Su & Karen Meneses & Patrick McNees & Wesley O. Johnson, 2011. "Interaction trees: exploring the differential effects of an intervention programme for breast cancer survivors," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 60(3), pages 457-474, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gerhard Tutz & Moritz Berger, 2016. "Item-focussed Trees for the Identification of Items in Differential Item Functioning," Psychometrika, Springer;The Psychometric Society, vol. 81(3), pages 727-750, September.
    2. Achim Zeileis & Torsten Hothorn, 2013. "A toolbox of permutation tests for structural change," Statistical Papers, Springer, vol. 54(4), pages 931-954, November.
    3. Carolin Strobl & Julia Kopf & Achim Zeileis, 2015. "Rasch Trees: A New Method for Detecting Differential Item Functioning in the Rasch Model," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 289-316, June.
    4. Strobl, Carolin & Boulesteix, Anne-Laure & Augustin, Thomas, 2007. "Unbiased split selection for classification trees based on the Gini Index," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 483-501, September.
    5. Benati, Luca, 2007. "Drift and breaks in labor productivity," Journal of Economic Dynamics and Control, Elsevier, vol. 31(8), pages 2847-2877, August.
    6. Dickinson, David & Liu, Jia, 2007. "The real effects of monetary policy in China: An empirical analysis," China Economic Review, Elsevier, vol. 18(1), pages 87-111.
    7. Umar, Muhammad & Su, Chi-Wei & Rizvi, Syed Kumail Abbas & Lobonţ, Oana-Ramona, 2021. "Driven by fundamentals or exploded by emotions: Detecting bubbles in oil prices," Energy, Elsevier, vol. 231(C).
    8. Bill Russell & Dooruj Rambaccussing, 2019. "Breaks and the statistical process of inflation: the case of estimating the ‘modern’ long-run Phillips curve," Empirical Economics, Springer, vol. 56(5), pages 1455-1475, May.
    9. Felix Pretis & Michael Mann & Robert Kaufmann, 2015. "Testing competing models of the temperature hiatus: assessing the effects of conditioning variables and temporal uncertainties through sample-wide break detection," Climatic Change, Springer, vol. 131(4), pages 705-718, August.
    10. Koo, Bonsoo & Seo, Myung Hwan, 2015. "Structural-break models under mis-specification: Implications for forecasting," Journal of Econometrics, Elsevier, vol. 188(1), pages 166-181.
    11. Qiu, Zhiping & Peng, Limin & Manatunga, Amita & Guo, Ying, 2019. "A smooth nonparametric approach to determining cut-points of a continuous scale," Computational Statistics & Data Analysis, Elsevier, vol. 134(C), pages 186-210.
    12. Yuan Xu & Huading Shi & Yang Fei & Chao Wang & Li Mo & Mi Shu, 2021. "Identification of Soil Heavy Metal Sources in a Large-Scale Area Affected by Industry," Sustainability, MDPI, vol. 13(2), pages 1-18, January.
    13. Martin T. Bohl & Alexander Pütz & Pierre L. Siklos & Christoph Sulewski, 2018. "Information Transmission under Increasing Political Tension – Evidence for the Berlin Produce Exchange 1887-1896," CQE Working Papers 7618, Center for Quantitative Economics (CQE), University of Muenster.
    14. Graham Elliott & Allan Timmermann, 2016. "Economic Forecasting," Economics Books, Princeton University Press, edition 1, number 10740.
    15. Ronald Bewley & Minxian Yang, 2006. "A hybrid forecasting approach for piece-wise stationary time series," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 25(7), pages 513-527.
    16. Dakpogan, Arnaud & Smit, Eon, 2018. "The effect of electricity losses on GDP in Benin," MPRA Paper 89545, University Library of Munich, Germany.
    17. Martin Larch & João Nogueira Martins, 2007. "Fiscal indicators - Proceedings of the the Directorate-General for Economic and Financial Affairs Workshop held on 22 September 2006 in Brussels," European Economy - Economic Papers 2008 - 2015 297, Directorate General Economic and Financial Affairs (DG ECFIN), European Commission.
    18. Du, Xiaodong & Yu, Cindy L. & Hayes, Dermot J., 2011. "Speculation and volatility spillover in the crude oil and agricultural commodity markets: A Bayesian analysis," Energy Economics, Elsevier, vol. 33(3), pages 497-503, May.
    19. Afonso, Antonio & Claeys, Peter, 2008. "The dynamic behaviour of budget components and output," Economic Modelling, Elsevier, vol. 25(1), pages 93-117, January.
    20. Marotta, Giuseppe, 2009. "Structural breaks in the lending interest rate pass-through and the euro," Economic Modelling, Elsevier, vol. 26(1), pages 191-205, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:20:p:3190-:d:1496926. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.