IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v33y2021i1p262-279.html
   My bibliography  Save this article

Sparse Convex Regression

Author

Listed:
  • Dimitris Bertsimas

    (Sloan School of Management and Operations Research Center, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139;)

  • Nishanth Mundru

    (Kenan-Flagler Business School, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599)

Abstract

We consider the problem of best k -subset convex regression using n observations in d variables. For the case without sparsity, we develop a scalable algorithm for obtaining high quality solutions in practical times that compare favorably with other state of the art methods. We show that by using a cutting plane method, the least squares convex regression problem can be solved for sizes ( n , d ) = ( 10 4 , 10 ) in minutes and ( n , d ) = ( 10 5 , 1 0 2 ) in hours. Our algorithm can be adapted to solve variants such as finding the best convex or concave functions with coordinate-wise monotonicity, norm-bounded subgradients, and minimize the ℓ 1 loss—all with similar scalability to the least squares convex regression problem. Under sparsity, we propose algorithms which iteratively solve for the best subset of features based on first order and cutting plane methods. We show that our methods scale for sizes ( n , d , k = 10 4 , 1 0 2 , 10 ) in minutes and ( n , d , k = 10 5 , 1 0 2 , 10 ) in hours. We demonstrate that these methods control for the false discovery rate effectively.

Suggested Citation

  • Dimitris Bertsimas & Nishanth Mundru, 2021. "Sparse Convex Regression," INFORMS Journal on Computing, INFORMS, vol. 33(1), pages 262-279, January.
  • Handle: RePEc:inm:orijoc:v:33:y:2021:i:1:p:262-279
    DOI: 10.1287/ijoc.2020.0954
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/ijoc.2020.0954
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2020.0954?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Eunji Lim & Peter W. Glynn, 2012. "Consistency of Multidimensional Convex Regression," Operations Research, INFORMS, vol. 60(1), pages 196-208, February.
    2. Gad Allon & Michael Beenstock & Steven Hackman & Ury Passy & Alexander Shapiro, 2007. "Nonparametric estimation of concave production technologies by entropic methods," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(4), pages 795-816.
    3. Varian, Hal R, 1982. "The Nonparametric Approach to Demand Analysis," Econometrica, Econometric Society, vol. 50(4), pages 945-973, July.
    4. Varian, Hal R, 1984. "The Nonparametric Approach to Production Analysis," Econometrica, Econometric Society, vol. 52(3), pages 579-597, May.
    5. NESTEROV, Yu., 2005. "Smooth minimization of non-smooth functions," LIDAM Reprints CORE 1819, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    6. Mekaroonreung, Maethee & Johnson, Andrew L., 2012. "Estimating the shadow prices of SO2 and NOx for U.S. coal power plants: A convex nonparametric least squares approach," Energy Economics, Elsevier, vol. 34(3), pages 723-732.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alireza Olama & Eduardo Camponogara & Paulo R. C. Mendes, 2023. "Distributed primal outer approximation algorithm for sparse convex programming with separable structures," Journal of Global Optimization, Springer, vol. 86(3), pages 637-670, July.
    2. Dai, Sheng, 2023. "Variable selection in convex quantile regression: L1-norm or L0-norm regularization?," European Journal of Operational Research, Elsevier, vol. 305(1), pages 338-355.
    3. Zhiqiang Liao, 2024. "Variable selection in convex nonparametric least squares via structured Lasso: An application to the Swedish electricity distribution networks," Papers 2409.01911, arXiv.org, revised Nov 2024.
    4. Liao, Zhiqiang & Dai, Sheng & Kuosmanen, Timo, 2024. "Convex support vector regression," European Journal of Operational Research, Elsevier, vol. 313(3), pages 858-870.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lee, Chia-Yen & Johnson, Andrew L. & Moreno-Centeno, Erick & Kuosmanen, Timo, 2013. "A more efficient algorithm for Convex Nonparametric Least Squares," European Journal of Operational Research, Elsevier, vol. 227(2), pages 391-400.
    2. Ian Crawford, 2004. "Necessary and sufficient conditions for latent separability," CeMMAP working papers CWP02/04, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    3. Kuosmanen, Timo & Johnson, Andrew, 2017. "Modeling joint production of multiple outputs in StoNED: Directional distance function approach," European Journal of Operational Research, Elsevier, vol. 262(2), pages 792-801.
    4. Laura Blow & Martin Browning & Ian Crawford, 2004. "Nonparametric methods for the characteristic model," CeMMAP working papers 18/04, Institute for Fiscal Studies.
    5. Sundström, David, 2016. "On Specification and Inference in the Econometrics of Public Procurement," Umeå Economic Studies 931, Umeå University, Department of Economics.
    6. Thomas Demuynck & John Rehbeck, 2023. "Computing revealed preference goodness-of-fit measures with integer programming," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 76(4), pages 1175-1195, November.
    7. Carvajal, Andres & Ray, Indrajit & Snyder, Susan, 2004. "Equilibrium behavior in markets and games: testable restrictions and identification," Journal of Mathematical Economics, Elsevier, vol. 40(1-2), pages 1-40, February.
    8. H. Spencer Banzhaf & Yaqin Liu & Martin Smith & Frank Asche, 2019. "Non-Parametric Tests of the Tragedy of the Commons," NBER Working Papers 26398, National Bureau of Economic Research, Inc.
    9. Nanjing Jian & Shane G. Henderson, 2020. "Estimating the Probability that a Function Observed with Noise Is Convex," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 376-389, April.
    10. Crooker, John & Kling, Catherine L., 2000. "Nonparametric Bounds on Welfare Measures: A New Tool for Nonmarket Valuation," Journal of Environmental Economics and Management, Elsevier, vol. 39(2), pages 145-161, March.
    11. Marshall B. Reinsdorf, 2010. "Terms Of Trade Effects: Theory And Measurement," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 56(s1), pages 177-205, June.
    12. Laurens Cherchye & Thomas Demuynck & Bram Rock & Kristof Witte, 2014. "Non‐parametric Analysis of Multi‐output Production with Joint Inputs," Economic Journal, Royal Economic Society, vol. 124(577), pages 735-775, June.
    13. Jan Heufer, 2014. "Generating Random Optimising Choices," Computational Economics, Springer;Society for Computational Economics, vol. 44(3), pages 295-305, October.
    14. Laurens Cherchye & Bram De Rock & Frederic Vermeulen & Selma Walther, 2021. "Where did it go wrong? Marriage and divorce in Malawi," Quantitative Economics, Econometric Society, vol. 12(2), pages 505-545, May.
    15. Laura Blow & Martin Browning & Ian Crawford, 2008. "Revealed Preference Analysis of Characteristics Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 75(2), pages 371-389.
    16. Snyder, Susan K., 1999. "Testable restrictions of Pareto optimal public good provision," Journal of Public Economics, Elsevier, vol. 71(1), pages 97-119, January.
    17. Robert C. Feenstra & Benjamin R. Mandel & Marshall B. Reinsdorf & Matthew J. Slaughter, 2013. "Effects of Terms of Trade Gains and Tariff Changes on the Measurement of US Productivity Growth," American Economic Journal: Economic Policy, American Economic Association, vol. 5(1), pages 59-93, February.
    18. Aguiar, Victor H. & Kashaev, Nail & Allen, Roy, 2023. "Prices, profits, proxies, and production," Journal of Econometrics, Elsevier, vol. 235(2), pages 666-693.
    19. Gad Allon & Michael Beenstock & Steven Hackman & Ury Passy & Alexander Shapiro, 2007. "Nonparametric estimation of concave production technologies by entropic methods," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(4), pages 795-816.
    20. Deb, Rahul & Fenske, James, 2009. "A Nonparametric Test of Strategic Behavior in the Cournot Model," MPRA Paper 16560, University Library of Munich, Germany.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:33:y:2021:i:1:p:262-279. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.