IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v315y2024i2p703-714.html
   My bibliography  Save this article

Bilevel optimization for feature selection in the data-driven newsvendor problem

Author

Listed:
  • Serrano, Breno
  • Minner, Stefan
  • Schiffer, Maximilian
  • Vidal, Thibaut

Abstract

We study the feature-based newsvendor problem, in which a decision-maker has access to historical data consisting of demand observations and exogenous features. In this setting, we investigate feature selection, aiming to derive sparse, explainable models with improved out-of-sample performance. Up to now, state-of-the-art methods utilize regularization, which penalizes the number of selected features or the norm of the solution vector. As an alternative, we introduce a novel bilevel programming formulation. The upper-level problem selects a subset of features that minimizes an estimate of the out-of-sample cost of ordering decisions based on a held-out validation set. The lower-level problem learns the optimal coefficients of the decision function on a training set, using only the features selected by the upper-level. We present a mixed integer linear program reformulation for the bilevel program, which can be solved to optimality with standard optimization solvers. Our computational experiments show that the method accurately recovers ground-truth features already for instances with a sample size of a few hundred observations. In contrast, regularization-based techniques often fail at feature recovery or require thousands of observations to obtain similar accuracy. Regarding out-of-sample generalization, we achieve improved or comparable cost performance.

Suggested Citation

  • Serrano, Breno & Minner, Stefan & Schiffer, Maximilian & Vidal, Thibaut, 2024. "Bilevel optimization for feature selection in the data-driven newsvendor problem," European Journal of Operational Research, Elsevier, vol. 315(2), pages 703-714.
  • Handle: RePEc:eee:ejores:v:315:y:2024:i:2:p:703-714
    DOI: 10.1016/j.ejor.2024.01.025
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221724000432
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2024.01.025?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Nicholas C. Petruzzi & Maqbool Dada, 1999. "Pricing and the Newsvendor Problem: A Review with Extensions," Operations Research, INFORMS, vol. 47(2), pages 183-194, April.
    2. Miyashiro, Ryuhei & Takano, Yuichi, 2015. "Mixed integer second-order cone programming formulations for variable selection in linear regression," European Journal of Operational Research, Elsevier, vol. 247(3), pages 721-731.
    3. Gah-Yi Ban & Cynthia Rudin, 2019. "The Big Data Newsvendor: Practical Insights from Machine Learning," Operations Research, INFORMS, vol. 67(1), pages 90-108, January.
    4. Gah-Yi Ban, 2020. "Confidence Intervals for Data-Driven Inventory Policies with Demand Censoring," Operations Research, INFORMS, vol. 68(2), pages 309-326, March.
    5. Kogan, Konstantin & Lou, Sheldon, 2003. "Multi-stage newsboy problem: A dynamic model," European Journal of Operational Research, Elsevier, vol. 149(2), pages 448-458, September.
    6. Khouja, Moutaz, 1999. "The single-period (news-vendor) problem: literature review and suggestions for future research," Omega, Elsevier, vol. 27(5), pages 537-553, October.
    7. Aharon Ben-Tal & Dick den Hertog & Anja De Waegenaere & Bertrand Melenberg & Gijs Rennen, 2013. "Robust Solutions of Optimization Problems Affected by Uncertain Probabilities," Management Science, INFORMS, vol. 59(2), pages 341-357, April.
    8. Lau, Hon-Shiang & Hing-Ling Lau, Amy, 1996. "The newsstand problem: A capacitated multiple-product single-period inventory problem," European Journal of Operational Research, Elsevier, vol. 94(1), pages 29-42, October.
    9. Yuichi Takano & Ryuhei Miyashiro, 2020. "Best subset selection via cross-validation criterion," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 475-488, July.
    10. Cao, Dong & Chen, Mingyuan, 2006. "Capacitated plant selection in a decentralized manufacturing environment: A bilevel optimization approach," European Journal of Operational Research, Elsevier, vol. 169(1), pages 97-110, February.
    11. Retsef Levi & Georgia Perakis & Joline Uichanco, 2015. "The Data-Driven Newsvendor Problem: New Bounds and Insights," Operations Research, INFORMS, vol. 63(6), pages 1294-1306, December.
    12. Dimitris Bertsimas & Nathan Kallus, 2020. "From Predictive to Prescriptive Analytics," Management Science, INFORMS, vol. 66(3), pages 1025-1044, March.
    13. Omar Besbes & Alp Muharremoglu, 2013. "On Implications of Demand Censoring in the Newsvendor Problem," Management Science, INFORMS, vol. 59(6), pages 1407-1424, June.
    14. Huber, Jakob & Müller, Sebastian & Fleischmann, Moritz & Stuckenschmidt, Heiner, 2019. "A data-driven newsvendor problem: From data to decision," European Journal of Operational Research, Elsevier, vol. 278(3), pages 904-915.
    15. Tian, Yu-Xin & Zhang, Chuan, 2023. "An end-to-end deep learning model for solving data-driven newsvendor problem with accessibility to textual review data," International Journal of Production Economics, Elsevier, vol. 265(C).
    16. Dimitris Bertsimas & Aurélie Thiele, 2006. "A Robust Optimization Approach to Inventory Theory," Operations Research, INFORMS, vol. 54(1), pages 150-168, February.
    17. Wang Chi Cheung & David Simchi-Levi, 2019. "Sampling-Based Approximation Schemes for Capacitated Stochastic Inventory Control Models," Mathematics of Operations Research, INFORMS, vol. 44(2), pages 668-692, May.
    18. Retsef Levi & Robin O. Roundy & David B. Shmoys, 2007. "Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models," Mathematics of Operations Research, INFORMS, vol. 32(4), pages 821-839, November.
    19. Sachs, Anna-Lena & Minner, Stefan, 2014. "The data-driven newsvendor with censored demand observations," International Journal of Production Economics, Elsevier, vol. 149(C), pages 28-36.
    20. Beutel, Anna-Lena & Minner, Stefan, 2012. "Safety stock planning under causal demand forecasting," International Journal of Production Economics, Elsevier, vol. 140(2), pages 637-645.
    21. Fontaine, Pirmin & Minner, Stefan, 2014. "Benders Decomposition for Discrete–Continuous Linear Bilevel Problems with application to traffic network design," Transportation Research Part B: Methodological, Elsevier, vol. 70(C), pages 163-172.
    22. Georgia Perakis & Guillaume Roels, 2008. "Regret in the Newsvendor Model with Partial Information," Operations Research, INFORMS, vol. 56(1), pages 188-203, February.
    23. Jinfeng Yue & Bintong Chen & Min-Chiang Wang, 2006. "Expected Value of Distribution Information for the Newsvendor Problem," Operations Research, INFORMS, vol. 54(6), pages 1128-1136, December.
    24. Afshin Oroojlooyjadid & Lawrence V. Snyder & Martin Takáč, 2020. "Applying deep learning to the newsvendor problem," IISE Transactions, Taylor & Francis Journals, vol. 52(4), pages 444-463, April.
    25. Andrés Gómez & Oleg A. Prokopyev, 2021. "A Mixed-Integer Fractional Optimization Approach to Best Subset Selection," INFORMS Journal on Computing, INFORMS, vol. 33(2), pages 551-565, May.
    26. Qin, Yan & Wang, Ruoxuan & Vakharia, Asoo J. & Chen, Yuwen & Seref, Michelle M.H., 2011. "The newsvendor problem: Review and directions for future research," European Journal of Operational Research, Elsevier, vol. 213(2), pages 361-374, September.
    27. Young Woong Park & Diego Klabjan, 2020. "Subset selection for multiple linear regression via optimization," Journal of Global Optimization, Springer, vol. 77(3), pages 543-574, July.
    28. Wang, Charles X. & Webster, Scott, 2009. "The loss-averse newsvendor problem," Omega, Elsevier, vol. 37(1), pages 93-105, February.
    29. Xin Chen & Melvyn Sim & David Simchi-Levi & Peng Sun, 2007. "Risk Aversion in Inventory Management," Operations Research, INFORMS, vol. 55(5), pages 828-842, October.
    30. Chuen-Teck See & Melvyn Sim, 2010. "Robust Approximation to Multiperiod Inventory Management," Operations Research, INFORMS, vol. 58(3), pages 583-594, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yujie Ma & Xueer Chen & Shuang Ma, 2024. "Optimal Sustainable Manufacturing for Product Family Architecture in Intelligent Manufacturing: A Hierarchical Joint Optimization Approach," Sustainability, MDPI, vol. 16(7), pages 1-28, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Satya S. Malladi & Alan L. Erera & Chelsea C. White, 2023. "Inventory control with modulated demand and a partially observed modulation process," Annals of Operations Research, Springer, vol. 321(1), pages 343-369, February.
    2. Erkip, Nesim Kohen, 2023. "Can accessing much data reshape the theory? Inventory theory under the challenge of data-driven systems," European Journal of Operational Research, Elsevier, vol. 308(3), pages 949-959.
    3. Huber, Jakob & Müller, Sebastian & Fleischmann, Moritz & Stuckenschmidt, Heiner, 2019. "A data-driven newsvendor problem: From data to decision," European Journal of Operational Research, Elsevier, vol. 278(3), pages 904-915.
    4. Yang, Cheng-Hu & Wang, Hai-Tang & Ma, Xin & Talluri, Srinivas, 2023. "A data-driven newsvendor problem: A high-dimensional and mixed-frequency method," International Journal of Production Economics, Elsevier, vol. 266(C).
    5. Thais de Castro Moraes & Jiancheng Qin & Xue-Ming Yuan & Ek Peng Chew, 2023. "Evolving Hybrid Deep Neural Network Models for End-to-End Inventory Ordering Decisions," Logistics, MDPI, vol. 7(4), pages 1-18, November.
    6. Liu, Congzheng & Letchford, Adam N. & Svetunkov, Ivan, 2022. "Newsvendor problems: An integrated method for estimation and optimisation," European Journal of Operational Research, Elsevier, vol. 300(2), pages 590-601.
    7. Georgia Perakis & Melvyn Sim & Qinshen Tang & Peng Xiong, 2023. "Robust Pricing and Production with Information Partitioning and Adaptation," Management Science, INFORMS, vol. 69(3), pages 1398-1419, March.
    8. Xin, Linwei & Goldberg, David A., 2021. "Time (in)consistency of multistage distributionally robust inventory models with moment constraints," European Journal of Operational Research, Elsevier, vol. 289(3), pages 1127-1141.
    9. Pirayesh Neghab, Davood & Khayyati, Siamak & Karaesmen, Fikri, 2022. "An integrated data-driven method using deep learning for a newsvendor problem with unobservable features," European Journal of Operational Research, Elsevier, vol. 302(2), pages 482-496.
    10. Qiu, Ruozhen & Shang, Jennifer & Huang, Xiaoyuan, 2014. "Robust inventory decision under distribution uncertainty: A CVaR-based optimization approach," International Journal of Production Economics, Elsevier, vol. 153(C), pages 13-23.
    11. Bai, Qingguo & Xu, Jianteng & Gong, Yeming & Chauhan, Satyaveer S., 2022. "Robust decisions for regulated sustainable manufacturing with partial demand information: Mandatory emission capacity versus emission tax," European Journal of Operational Research, Elsevier, vol. 298(3), pages 874-893.
    12. Mengshi Lu & Zuo‐Jun Max Shen, 2021. "A Review of Robust Operations Management under Model Uncertainty," Production and Operations Management, Production and Operations Management Society, vol. 30(6), pages 1927-1943, June.
    13. Olivares-Nadal, Alba V., 2024. "Constructing decision rules for multiproduct newsvendors: An integrated estimation-and-optimization framework," European Journal of Operational Research, Elsevier, vol. 315(3), pages 1021-1037.
    14. Qiu, Ruozhen & Sun, Minghe & Lim, Yun Fong, 2017. "Optimizing (s, S) policies for multi-period inventory models with demand distribution uncertainty: Robust dynamic programing approaches," European Journal of Operational Research, Elsevier, vol. 261(3), pages 880-892.
    15. Meng Qi & Ying Cao & Zuo-Jun (Max) Shen, 2022. "Distributionally Robust Conditional Quantile Prediction with Fixed Design," Management Science, INFORMS, vol. 68(3), pages 1639-1658, March.
    16. Rui Wang & Xiao Yan & Chuanjin Zhu, 2023. "Solving a Distribution-Free Multi-Period Newsvendor Problem With Advance Purchase Discount via an Online Ordering Solution," SAGE Open, , vol. 13(2), pages 21582440231, June.
    17. Gah-Yi Ban, 2020. "Confidence Intervals for Data-Driven Inventory Policies with Demand Censoring," Operations Research, INFORMS, vol. 68(2), pages 309-326, March.
    18. van Eekelen, Wouter, 2023. "Distributionally robust views on queues and related stochastic models," Other publications TiSEM 9b99fc05-9d68-48eb-ae8c-9, Tilburg University, School of Economics and Management.
    19. van der Laan, Niels & Teunter, Ruud H. & Romeijnders, Ward & Kilic, Onur A., 2022. "The data-driven newsvendor problem: Achieving on-target service-levels using distributionally robust chance-constrained optimization," International Journal of Production Economics, Elsevier, vol. 249(C).
    20. Pascal M. Notz & Richard Pibernik, 2022. "Prescriptive Analytics for Flexible Capacity Management," Management Science, INFORMS, vol. 68(3), pages 1756-1775, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:315:y:2024:i:2:p:703-714. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.