IDEAS home Printed from https://ideas.repec.org/a/spr/jglopt/v60y2014i1p79-102.html
   My bibliography  Save this article

Restructuring forward step of MARS algorithm using a new knot selection procedure based on a mapping approach

Author

Listed:
  • Elcin Koc
  • Cem Iyigun

Abstract

In high dimensional data modeling, Multivariate Adaptive Regression Splines (MARS) is a popular nonparametric regression technique used to define the nonlinear relationship between a response variable and the predictors with the help of splines. MARS uses piecewise linear functions for local fit and apply an adaptive procedure to select the number and location of breaking points (called knots). The function estimation is basically generated via a two-stepwise procedure: forward selection and backward elimination. In the first step, a large number of local fits is obtained by selecting large number of knots via a lack-of-fit criteria; and in the latter one, the least contributing local fits or knots are removed. In conventional adaptive spline procedure, knots are selected from a set of all distinct data points that makes the forward selection procedure computationally expensive and leads to high local variance. To avoid this drawback, it is possible to restrict the knot points to a subset of data points. In this context, a new method is proposed for knot selection which bases on a mapping approach like self organizing maps. By this method, less but more representative data points are become eligible to be used as knots for function estimation in forward step of MARS. The proposed method is applied to many simulated and real datasets, and the results show that it proposes a time efficient forward step for the knot selection and model estimation without degrading the model accuracy and prediction performance. Copyright Springer Science+Business Media New York 2014

Suggested Citation

  • Elcin Koc & Cem Iyigun, 2014. "Restructuring forward step of MARS algorithm using a new knot selection procedure based on a mapping approach," Journal of Global Optimization, Springer, vol. 60(1), pages 79-102, September.
  • Handle: RePEc:spr:jglopt:v:60:y:2014:i:1:p:79-102
    DOI: 10.1007/s10898-013-0107-5
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10898-013-0107-5
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10898-013-0107-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Julia Tsai & Victoria Chen & M. Beck & Jining Chen, 2004. "Stochastic Dynamic Programming Formulation for a Wastewater Treatment Decision-Making Framework," Annals of Operations Research, Springer, vol. 132(1), pages 207-221, November.
    2. Victoria C. P. Chen & David Ruppert & Christine A. Shoemaker, 1999. "Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming," Operations Research, INFORMS, vol. 47(1), pages 38-53, February.
    3. Victoria C. P. Chen & Dirk Günther & Ellis L. Johnson, 2003. "Solving for an optimal airline yield management policy via statistical learning," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 52(1), pages 19-30, January.
    4. Wataru Sakamoto, 2007. "MARS: selecting basis functions and knots with an empirical Bayes method," Computational Statistics, Springer, vol. 22(4), pages 583-597, December.
    5. D. G. T. Denison & B. K. Mallick & A. F. M. Smith, 1998. "Automatic Bayesian curve fitting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 333-350.
    6. Pilla, Venkata L. & Rosenberger, Jay M. & Chen, Victoria & Engsuwan, Narakorn & Siddappa, Sheela, 2012. "A multivariate adaptive regression splines cutting plane approach for solving a two-stage stochastic programming fleet assignment model," European Journal of Operational Research, Elsevier, vol. 216(1), pages 162-171.
    7. Lee, Tian-Shyug & Chiu, Chih-Chou & Chou, Yu-Chao & Lu, Chi-Jie, 2006. "Mining the customer credit using classification and regression tree and multivariate adaptive regression splines," Computational Statistics & Data Analysis, Elsevier, vol. 50(4), pages 1113-1130, February.
    8. Wong, Chi-ming & Kohn, Robert, 1996. "A Bayesian approach to additive semiparametric regression," Journal of Econometrics, Elsevier, vol. 74(2), pages 209-235, October.
    9. Aldrin, Magne, 2006. "Improved predictions penalizing both slope and curvature in additive models," Computational Statistics & Data Analysis, Elsevier, vol. 50(2), pages 267-284, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. España, Victor J. & Aparicio, Juan & Barber, Xavier & Esteve, Miriam, 2024. "Estimating production functions through additive models based on regression splines," European Journal of Operational Research, Elsevier, vol. 312(2), pages 684-699.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dachuan Shih & Seoung Kim & Victoria Chen & Jay Rosenberger & Venkata Pilla, 2014. "Efficient computer experiment-based optimization through variable selection," Annals of Operations Research, Springer, vol. 216(1), pages 287-305, May.
    2. Pilla, Venkata L. & Rosenberger, Jay M. & Chen, Victoria & Engsuwan, Narakorn & Siddappa, Sheela, 2012. "A multivariate adaptive regression splines cutting plane approach for solving a two-stage stochastic programming fleet assignment model," European Journal of Operational Research, Elsevier, vol. 216(1), pages 162-171.
    3. Elcin Koc & Cem Iyigun & İnci Batmaz & Gerhard-Wilhelm Weber, 2014. "Efficient adaptive regression spline algorithms based on mapping approach with a case study on finance," Journal of Global Optimization, Springer, vol. 60(1), pages 103-120, September.
    4. Ayşe Özmen, 2023. "Sparse regression modeling for short- and long‐term natural gas demand prediction," Annals of Operations Research, Springer, vol. 322(2), pages 921-946, March.
    5. Huiyuan Fan & Prashant K. Tarun & Victoria C. P. Chen & Dachuan T. Shih & Jay M. Rosenberger & Seoung Bum Kim & Robert A. Horton, 2018. "Data-driven optimization for Dallas Fort Worth International Airport deicing activities," Annals of Operations Research, Springer, vol. 263(1), pages 361-384, April.
    6. Zehua Yang & Victoria C. P. Chen & Michael E. Chang & Melanie L. Sattler & Aihong Wen, 2009. "A Decision-Making Framework for Ozone Pollution Control," Operations Research, INFORMS, vol. 57(2), pages 484-498, April.
    7. Bozağaç, Doruk & Batmaz, İnci & Oğuztüzün, Halit, 2016. "Dynamic simulation metamodeling using MARS: A case of radar simulation," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 124(C), pages 69-86.
    8. Ariyajunya, Bancha & Chen, Ying & Chen, Victoria C.P. & Kim, Seoung Bum & Rosenberger, Jay, 2021. "Addressing state space multicollinearity in solving an ozone pollution dynamic control problem," European Journal of Operational Research, Elsevier, vol. 289(2), pages 683-695.
    9. Panagiotelis, Anastasios & Smith, Michael, 2008. "Bayesian identification, selection and estimation of semiparametric functions in high-dimensional additive models," Journal of Econometrics, Elsevier, vol. 143(2), pages 291-316, April.
    10. Jayne Lois San Juan & Carlo James Caligan & Maria Mikayla Garcia & Jericho Mitra & Andres Philip Mayol & Charlle Sy & Aristotle Ubando & Alvin Culaba, 2020. "Multi-Objective Optimization of an Integrated Algal and Sludge-Based Bioenergy Park and Wastewater Treatment System," Sustainability, MDPI, vol. 12(18), pages 1-22, September.
    11. Kuhlenkasper, Torben & Kauermann, Göran, 2010. "Female wage profiles: An additive mixed model approach to employment breaks due to childcare," HWWI Research Papers 2-18, Hamburg Institute of International Economics (HWWI).
    12. Ying Chen & Krystel K. Castillo-Villar & Bing Dong, 2021. "Stochastic control of a micro-grid using battery energy storage in solar-powered buildings," Annals of Operations Research, Springer, vol. 303(1), pages 197-216, August.
    13. Gianluca Frasso & Jonathan Jaeger & Philippe Lambert, 2016. "Parameter estimation and inference in dynamic systems described by linear partial differential equations," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 100(3), pages 259-287, July.
    14. M. P. Wand, 2000. "A Comparison of Regression Spline Smoothing Procedures," Computational Statistics, Springer, vol. 15(4), pages 443-462, December.
    15. Shively, Thomas S. & Kockelman, Kara & Damien, Paul, 2010. "A Bayesian semi-parametric model to estimate relationships between crash counts and roadway characteristics," Transportation Research Part B: Methodological, Elsevier, vol. 44(5), pages 699-715, June.
    16. Boracchi, Patrizia & Biganzoli, Elia & Marubini, Ettore, 2003. "Joint modelling of cause-specific hazard functions with cubic splines: an application to a large series of breast cancer patients," Computational Statistics & Data Analysis, Elsevier, vol. 42(1-2), pages 243-262, February.
    17. Chen, Shiyi & Jeong, Kiho & Härdle, Wolfgang Karl, 2008. "Recurrent support vector regression for a nonlinear ARMA model with applications to forecasting financial returns," SFB 649 Discussion Papers 2008-051, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    18. Ibtissem Baklouti, 2014. "A Psychological Approach To Microfinance Credit Scoring Via A Classification And Regression Tree," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 21(4), pages 193-208, October.
    19. Basna, Rani & Nassar, Hiba & Podgórski, Krzysztof, 2022. "Data driven orthogonal basis selection for functional data analysis," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    20. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jglopt:v:60:y:2014:i:1:p:79-102. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.