IDEAS home Printed from https://ideas.repec.org/a/spr/joptap/v156y2013i2d10.1007_s10957-012-0118-2.html
   My bibliography  Save this article

Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results

Author

Listed:
  • Mauro Gaggero

    (National Research Council of Italy)

  • Giorgio Gnecco

    (University of Genova)

  • Marcello Sanguineti

    (University of Genova)

Abstract

Value-function approximation is investigated for the solution via Dynamic Programming (DP) of continuous-state sequential N-stage decision problems, in which the reward to be maximized has an additive structure over a finite number of stages. Conditions that guarantee smoothness properties of the value function at each stage are derived. These properties are exploited to approximate such functions by means of certain nonlinear approximation schemes, which include splines of suitable order and Gaussian radial-basis networks with variable centers and widths. The accuracies of suboptimal solutions obtained by combining DP with these approximation tools are estimated. The results provide insights into the successful performances appeared in the literature about the use of value-function approximators in DP. The theoretical analysis is applied to a problem of optimal consumption, with simulation results illustrating the use of the proposed solution methodology. Numerical comparisons with classical linear approximators are presented.

Suggested Citation

  • Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2013. "Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 380-416, February.
  • Handle: RePEc:spr:joptap:v:156:y:2013:i:2:d:10.1007_s10957-012-0118-2
    DOI: 10.1007/s10957-012-0118-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10957-012-0118-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10957-012-0118-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kenneth L. Judd, 1998. "Numerical Methods in Economics," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262100711, April.
    2. Victoria C. P. Chen & David Ruppert & Christine A. Shoemaker, 1999. "Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming," Operations Research, INFORMS, vol. 47(1), pages 38-53, February.
    3. Wim M. Nawijn, 1990. "Look-Ahead Policies for Admission to a Single Server Loss System," Operations Research, INFORMS, vol. 38(5), pages 854-862, October.
    4. Semmler, Willi & Sieveking, Malte, 2000. "Critical debt and debt dynamics," Journal of Economic Dynamics and Control, Elsevier, vol. 24(5-7), pages 1121-1144, June.
    5. Jerome Adda & Russell W. Cooper, 2003. "Dynamic Economics: Quantitative Methods and Applications," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012014, April.
    6. Martin L. Puterman & Moon Chirl Shin, 1978. "Modified Policy Iteration Algorithms for Discounted Markov Decision Problems," Management Science, INFORMS, vol. 24(11), pages 1127-1137, July.
    7. Angelo Alessandri & Giorgio Gnecco & Marcello Sanguineti, 2010. "Minimizing Sequences for a Family of Functional Optimal Estimation Problems," Journal of Optimization Theory and Applications, Springer, vol. 147(2), pages 243-262, November.
    8. R. Zoppoli & M. Sanguineti & T. Parisini, 2002. "Approximating Networks and Extended Ritz Method for the Solution of Functional Optimization Problems," Journal of Optimization Theory and Applications, Springer, vol. 112(2), pages 403-440, February.
    9. G. Gnecco & M. Sanguineti, 2010. "Suboptimal Solutions to Dynamic Optimization Problems via Approximations of the Policy Functions," Journal of Optimization Theory and Applications, Springer, vol. 146(3), pages 764-794, September.
    10. Karp, Larry & Lee, In Ho, 2000. "Learning-by-Doing and the Choice of Technology: the Role of Patience," Department of Agricultural & Resource Economics, UC Berkeley, Working Paper Series qt4vh9x271, Department of Agricultural & Resource Economics, UC Berkeley.
    11. Sharon A. Johnson & Jery R. Stedinger & Christine A. Shoemaker & Ying Li & José Alberto Tejada-Guibert, 1993. "Numerical Solution of Continuous-State Dynamic Programs Using Linear and Spline Interpolation," Operations Research, INFORMS, vol. 41(3), pages 484-500, June.
    12. Karp, Larry & Lee, In Ho, 2001. "Learning-by-Doing and the Choice of Technology: The Role of Patience," Journal of Economic Theory, Elsevier, vol. 100(1), pages 73-92, September.
    13. Boldrin, Michele & Montrucchio, Luigi, 1986. "On the indeterminacy of capital accumulation paths," Journal of Economic Theory, Elsevier, vol. 40(1), pages 26-39, October.
    14. Michael Kopel & Gustav Feichtinger & Herbert Dawid, 1997. "Complex solutions of nonconcave dynamic optimization models (*)," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 9(3), pages 427-439.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
    2. Andrea Bacigalupo & Giorgio Gnecco & Marco Lepidi & Luigi Gambarotta, 2020. "Machine-Learning Techniques for the Optimal Design of Acoustic Metamaterials," Journal of Optimization Theory and Applications, Springer, vol. 187(3), pages 630-653, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
    2. John Stachurski, 2009. "Economic Dynamics: Theory and Computation," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012774, April.
    3. M. Baglietto & C. Cervellera & M. Sanguineti & R. Zoppoli, 2010. "Management of water resource systems in the presence of uncertainties by nonlinear approximation techniques and deterministic sampling," Computational Optimization and Applications, Springer, vol. 47(2), pages 349-376, October.
    4. Cervellera, C. & Macciò, D., 2011. "A comparison of global and semi-local approximation in T-stage stochastic optimization," European Journal of Operational Research, Elsevier, vol. 208(2), pages 109-118, January.
    5. Heer Burkhard & Maußner Alfred, 2011. "Value Function Iteration as a Solution Method for the Ramsey Model," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(4), pages 494-515, August.
    6. Maliar, Lilia & Maliar, Serguei & Tsener, Inna, 2022. "Capital-skill complementarity and inequality: Twenty years after," Economics Letters, Elsevier, vol. 220(C).
    7. Lilia Maliar & Serguei Maliar & John B. Taylor & Inna Tsener, 2020. "A tractable framework for analyzing a class of nonstationary Markov models," Quantitative Economics, Econometric Society, vol. 11(4), pages 1289-1323, November.
    8. Somayeh Moazeni & Warren B. Powell & Boris Defourny & Belgacem Bouzaiene-Ayari, 2017. "Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization," INFORMS Journal on Computing, INFORMS, vol. 29(2), pages 332-349, May.
    9. Chen, Ruoran & Deng, Tianhu & Huang, Simin & Qin, Ruwen, 2015. "Optimal crude oil procurement under fluctuating price in an oil refinery," European Journal of Operational Research, Elsevier, vol. 245(2), pages 438-445.
    10. Ben Klemens, 2021. "Attributing Value to Patents and Trademarks in Complex Production Chains," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 12(2), pages 842-875, June.
    11. Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
    12. Nikolay Gospodinov & Damba Lkhagvasuren, 2014. "A Moment‐Matching Method For Approximating Vector Autoregressive Processes By Finite‐State Markov Chains," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(5), pages 843-859, August.
    13. Zehua Yang & Victoria C. P. Chen & Michael E. Chang & Melanie L. Sattler & Aihong Wen, 2009. "A Decision-Making Framework for Ozone Pollution Control," Operations Research, INFORMS, vol. 57(2), pages 484-498, April.
    14. Diego Klabjan & Daniel Adelman, 2007. "An Infinite-Dimensional Linear Programming Algorithm for Deterministic Semi-Markov Decision Processes on Borel Spaces," Mathematics of Operations Research, INFORMS, vol. 32(3), pages 528-550, August.
    15. Chen, Victoria C. P., 1999. "Application of orthogonal arrays and MARS to inventory forecasting stochastic dynamic programs," Computational Statistics & Data Analysis, Elsevier, vol. 30(3), pages 317-341, May.
    16. Justin McCrary, 2010. "Dynamic Perspectives on Crime," Chapters, in: Bruce L. Benson & Paul R. Zimmerman (ed.), Handbook on the Economics of Crime, chapter 4, Edward Elgar Publishing.
    17. Serguei Maliar & John Taylor & Lilia Maliar, 2016. "The Impact of Alternative Transitions to Normalized Monetary Policy," 2016 Meeting Papers 794, Society for Economic Dynamics.
    18. James G. Mulligan & Nilotpal Das, 2005. "Persistent Adoption of Time-Saving Process Innovations," Working Papers 05-03, University of Delaware, Department of Economics.
    19. King, Robert P. & Lohano, Heman D., 2006. "Accuracy of Numerical Solution to Dynamic Programming Models," Staff Papers 14230, University of Minnesota, Department of Applied Economics.
    20. Padula, Mario, 2010. "An approximate consumption function," Journal of Economic Dynamics and Control, Elsevier, vol. 34(3), pages 404-416, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joptap:v:156:y:2013:i:2:d:10.1007_s10957-012-0118-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.