IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v33y2021i3p1103-1119.html
   My bibliography  Save this article

Mixed-Integer Convex Nonlinear Optimization with Gradient-Boosted Trees Embedded

Author

Listed:
  • Miten Mistry

    (Department of Computing, Imperial College London, South Kensington SW7 2AZ, United Kingdom)

  • Dimitrios Letsios

    (Department of Computing, Imperial College London, South Kensington SW7 2AZ, United Kingdom)

  • Gerhard Krennrich

    (BASF SE, Ludwigshafen am Rhein, Germany)

  • Robert M. Lee

    (BASF SE, Ludwigshafen am Rhein, Germany)

  • Ruth Misener

    (Department of Computing, Imperial College London, South Kensington SW7 2AZ, United Kingdom)

Abstract

Decision trees usefully represent sparse, high-dimensional, and noisy data. Having learned a function from these data, we may want to thereafter integrate the function into a larger decision-making problem, for example, for picking the best chemical process catalyst. We study a large-scale, industrially relevant mixed-integer nonlinear nonconvex optimization problem involving both gradient-boosted trees and penalty functions mitigating risk. This mixed-integer optimization problem with convex penalty terms broadly applies to optimizing pretrained regression tree models. Decision makers may wish to optimize discrete models to repurpose legacy predictive models or they may wish to optimize a discrete model that accurately represents a data set. We develop several heuristic methods to find feasible solutions and an exact branch-and-bound algorithm leveraging structural properties of the gradient-boosted trees and penalty functions. We computationally test our methods on a concrete mixture design instance and a chemical catalysis industrial instance.

Suggested Citation

  • Miten Mistry & Dimitrios Letsios & Gerhard Krennrich & Robert M. Lee & Ruth Misener, 2021. "Mixed-Integer Convex Nonlinear Optimization with Gradient-Boosted Trees Embedded," INFORMS Journal on Computing, INFORMS, vol. 33(3), pages 1103-1119, July.
  • Handle: RePEc:inm:orijoc:v:33:y:2021:i:3:p:1103-1119
    DOI: 10.1287/ijoc.2020.0993
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijoc.2020.0993
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2020.0993?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Rohit Kannan & Paul I. Barton, 2017. "The cluster problem in constrained global optimization," Journal of Global Optimization, Springer, vol. 69(3), pages 629-676, November.
    2. Kuhn, Max, 2008. "Building Predictive Models in R Using the caret Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i05).
    3. Achim Wechsung & Spencer Schaber & Paul Barton, 2014. "The cluster problem revisited," Journal of Global Optimization, Springer, vol. 58(3), pages 429-438, March.
    4. P. Bonami & M. A. Lejeune, 2009. "An Exact Solution Approach for Portfolio Optimization Problems Under Stochastic and Integer Constraints," Operations Research, INFORMS, vol. 57(3), pages 650-670, June.
    5. Juan Pablo Vielma & Shabbir Ahmed & George Nemhauser, 2010. "Mixed-Integer Models for Nonseparable Piecewise-Linear Optimization: Unifying Framework and Extensions," Operations Research, INFORMS, vol. 58(2), pages 303-315, April.
    6. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    7. Pierre Bonami & Miguel A. Lejeune, 2009. "An Exact Solution Approach for Integer Constrained Portfolio Optimization Problems Under Stochastic Constraints," Post-Print hal-00421756, HAL.
    8. R. Misener & C. A. Floudas, 2010. "Piecewise-Linear Approximations of Multidimensional Functions," Journal of Optimization Theory and Applications, Springer, vol. 145(1), pages 120-147, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Keliang Wang & Leonardo Lozano & Carlos Cardonha & David Bergman, 2023. "Optimizing over an Ensemble of Trained Neural Networks," INFORMS Journal on Computing, INFORMS, vol. 35(3), pages 652-674, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Codas, Andrés & Camponogara, Eduardo, 2012. "Mixed-integer linear optimization for optimal lift-gas allocation with well-separator routing," European Journal of Operational Research, Elsevier, vol. 217(1), pages 222-231.
    2. Miguel A. Lejeune & François Margot, 2016. "Solving Chance-Constrained Optimization Problems with Stochastic Quadratic Inequalities," Operations Research, INFORMS, vol. 64(4), pages 939-957, August.
    3. Kamesh Korangi & Christophe Mues & Cristi'an Bravo, 2024. "Large-scale Time-Varying Portfolio Optimisation using Graph Attention Networks," Papers 2407.15532, arXiv.org.
    4. Dimitris Bertsimas & Ryan Cory-Wright, 2022. "A Scalable Algorithm for Sparse Portfolio Selection," INFORMS Journal on Computing, INFORMS, vol. 34(3), pages 1489-1511, May.
    5. Ran Ji & Miguel A. Lejeune, 2021. "Data-Driven Optimization of Reward-Risk Ratio Measures," INFORMS Journal on Computing, INFORMS, vol. 33(3), pages 1120-1137, July.
    6. Murray, Chase C. & Talukdar, Debabrata & Gosavi, Abhijit, 2010. "Joint Optimization of Product Price, Display Orientation and Shelf-Space Allocation in Retail Category Management," Journal of Retailing, Elsevier, vol. 86(2), pages 125-136.
    7. Yagli, Gokhan Mert & Yang, Dazhi & Srinivasan, Dipti, 2019. "Automatic hourly solar forecasting using machine learning models," Renewable and Sustainable Energy Reviews, Elsevier, vol. 105(C), pages 487-498.
    8. Distaso, Walter & Roccazzella, Francesco & Vrins, Frédéric, 2023. "Business cycle and realized losses in the consumer credit industry," LIDAM Discussion Papers LFIN 2023007, Université catholique de Louvain, Louvain Finance (LFIN).
    9. Zhi-Hai Zhang & Kang Li, 2015. "A novel probabilistic formulation for locating and sizing emergency medical service stations," Annals of Operations Research, Springer, vol. 229(1), pages 813-835, June.
    10. Martin Branda & Max Bucher & Michal Červinka & Alexandra Schwartz, 2018. "Convergence of a Scholtes-type regularization method for cardinality-constrained optimization problems with an application in sparse robust portfolio optimization," Computational Optimization and Applications, Springer, vol. 70(2), pages 503-530, June.
    11. Cristiano Arbex Valle, 2024. "Portfolio optimisation: bridging the gap between theory and practice," Papers 2407.00887, arXiv.org, revised Sep 2024.
    12. Zhang, Zhi-Hai & Unnikrishnan, Avinash, 2016. "A coordinated location-inventory problem in closed-loop supply chain," Transportation Research Part B: Methodological, Elsevier, vol. 89(C), pages 127-148.
    13. Eduardo Bered Fernandes Vieira & Tiago Pascoal Filomena, 2020. "Liquidity Constraints for Portfolio Selection Based on Financial Volume," Computational Economics, Springer;Society for Computational Economics, vol. 56(4), pages 1055-1077, December.
    14. Massol, Olivier & Banal-Estañol, Albert, 2014. "Export diversification through resource-based industrialization: The case of natural gas," European Journal of Operational Research, Elsevier, vol. 237(3), pages 1067-1082.
    15. Moarefdoost, M. Mohsen & Lamadrid, Alberto J. & Zuluaga, Luis F., 2016. "A robust model for the ramp-constrained economic dispatch problem with uncertain renewable energy," Energy Economics, Elsevier, vol. 56(C), pages 310-325.
    16. Liu, Kanglin & Zhang, Zhi-Hai, 2018. "Capacitated disassembly scheduling under stochastic yield and demand," European Journal of Operational Research, Elsevier, vol. 269(1), pages 244-257.
    17. Chien-Ming Chen & Joe Zhu, 2011. "Efficient Resource Allocation via Efficiency Bootstraps: An Application to R&D Project Budgeting," Operations Research, INFORMS, vol. 59(3), pages 729-741, June.
    18. Xiaojin Zheng & Xiaoling Sun & Duan Li & Jie Sun, 2014. "Successive convex approximations to cardinality-constrained convex programs: a piecewise-linear DC approach," Computational Optimization and Applications, Springer, vol. 59(1), pages 379-397, October.
    19. Philipp Baumann & Norbert Trautmann, 2013. "Portfolio-optimization models for small investors," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 77(3), pages 345-356, June.
    20. Tiago P. Filomena & Miguel A. Lejeune, 2014. "Warm-Start Heuristic for Stochastic Portfolio Optimization with Fixed and Proportional Transaction Costs," Journal of Optimization Theory and Applications, Springer, vol. 161(1), pages 308-329, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:33:y:2021:i:3:p:1103-1119. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.