Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results

My bibliography Save this article

Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results

Author

Listed:

Mauro Gaggero
(National Research Council of Italy)
Giorgio Gnecco
(University of Genova)
Marcello Sanguineti
(University of Genova)

Registered:

Abstract

Value-function approximation is investigated for the solution via Dynamic Programming (DP) of continuous-state sequential N-stage decision problems, in which the reward to be maximized has an additive structure over a finite number of stages. Conditions that guarantee smoothness properties of the value function at each stage are derived. These properties are exploited to approximate such functions by means of certain nonlinear approximation schemes, which include splines of suitable order and Gaussian radial-basis networks with variable centers and widths. The accuracies of suboptimal solutions obtained by combining DP with these approximation tools are estimated. The results provide insights into the successful performances appeared in the literature about the use of value-function approximators in DP. The theoretical analysis is applied to a problem of optimal consumption, with simulation results illustrating the use of the proposed solution methodology. Numerical comparisons with classical linear approximators are presented.

Suggested Citation

Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2013. "Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 380-416, February.

Handle: RePEc:spr:joptap:v:156:y:2013:i:2:d:10.1007_s10957-012-0118-2
DOI: 10.1007/s10957-012-0118-2

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Kenneth L. Judd, 1998. "Numerical Methods in Economics," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262100711, December.
Wim M. Nawijn, 1990. "Look-Ahead Policies for Admission to a Single Server Loss System," Operations Research, INFORMS, vol. 38(5), pages 854-862, October.
Jerome Adda & Russell W. Cooper, 2003. "Dynamic Economics: Quantitative Methods and Applications," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012014, December.
Angelo Alessandri & Giorgio Gnecco & Marcello Sanguineti, 2010. "Minimizing Sequences for a Family of Functional Optimal Estimation Problems," Journal of Optimization Theory and Applications, Springer, vol. 147(2), pages 243-262, November.
Karp, Larry & Lee, In Ho, 2001. "Learning-by-Doing and the Choice of Technology: The Role of Patience," Journal of Economic Theory, Elsevier, vol. 100(1), pages 73-92, September.
- Karp, Larry S. & Lee, In Ho, 2000. "Learning-by-Doing and the Choice of Technology: The Role of Patience," CUDARE Working Papers 25108, University of California, Berkeley, Department of Agricultural and Resource Economics.
- Karp, Larry & Lee, In Ho, 2000. "Learning-by-Doing and the Choice of Technology: the Role of Patience," Department of Agricultural & Resource Economics, UC Berkeley, Working Paper Series qt4vh9x271, Department of Agricultural & Resource Economics, UC Berkeley.
Sharon A. Johnson & Jery R. Stedinger & Christine A. Shoemaker & Ying Li & José Alberto Tejada-Guibert, 1993. "Numerical Solution of Continuous-State Dynamic Programs Using Linear and Spline Interpolation," Operations Research, INFORMS, vol. 41(3), pages 484-500, June.
Boldrin, Michele & Montrucchio, Luigi, 1986. "On the indeterminacy of capital accumulation paths," Journal of Economic Theory, Elsevier, vol. 40(1), pages 26-39, October.
Victoria C. P. Chen & David Ruppert & Christine A. Shoemaker, 1999. "Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming," Operations Research, INFORMS, vol. 47(1), pages 38-53, February.
Semmler, Willi & Sieveking, Malte, 2000. "Critical debt and debt dynamics," Journal of Economic Dynamics and Control, Elsevier, vol. 24(5-7), pages 1121-1144, June.
Martin L. Puterman & Moon Chirl Shin, 1978. "Modified Policy Iteration Algorithms for Discounted Markov Decision Problems," Management Science, INFORMS, vol. 24(11), pages 1127-1137, July.
R. Zoppoli & M. Sanguineti & T. Parisini, 2002. "Approximating Networks and Extended Ritz Method for the Solution of Functional Optimization Problems," Journal of Optimization Theory and Applications, Springer, vol. 112(2), pages 403-440, February.
G. Gnecco & M. Sanguineti, 2010. "Suboptimal Solutions to Dynamic Optimization Problems via Approximations of the Policy Functions," Journal of Optimization Theory and Applications, Springer, vol. 146(3), pages 764-794, September.
Michael Kopel & Gustav Feichtinger & Herbert Dawid, 1997. "Complex solutions of nonconcave dynamic optimization models (*)," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 9(3), pages 427-439.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
Andrea Bacigalupo & Giorgio Gnecco & Marco Lepidi & Luigi Gambarotta, 2020. "Machine-Learning Techniques for the Optimal Design of Acoustic Metamaterials," Journal of Optimization Theory and Applications, Springer, vol. 187(3), pages 630-653, December.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
John Stachurski, 2009. "Economic Dynamics: Theory and Computation," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012774, December.
M. Baglietto & C. Cervellera & M. Sanguineti & R. Zoppoli, 2010. "Management of water resource systems in the presence of uncertainties by nonlinear approximation techniques and deterministic sampling," Computational Optimization and Applications, Springer, vol. 47(2), pages 349-376, October.
Cervellera, C. & Macciò, D., 2011. "A comparison of global and semi-local approximation in T-stage stochastic optimization," European Journal of Operational Research, Elsevier, vol. 208(2), pages 109-118, January.
Heer Burkhard & Maußner Alfred, 2011. "Value Function Iteration as a Solution Method for the Ramsey Model," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(4), pages 494-515, August.
- Burkhard Heer & Alfred Maussner, 2008. "Value Function Iteration as a Solution Method for the Ramsey Model," CESifo Working Paper Series 2278, CESifo.
Maliar, Lilia & Maliar, Serguei & Tsener, Inna, 2022. "Capital-skill complementarity and inequality: Twenty years after," Economics Letters, Elsevier, vol. 220(C).
- Maliar, Serguei & Tsener, Inna, 2020. "Capital-Skill Complementarity and Inequality: Twenty Years After," CEPR Discussion Papers 15228, C.E.P.R. Discussion Papers.
Lilia Maliar & Serguei Maliar & John B. Taylor & Inna Tsener, 2020. "A tractable framework for analyzing a class of nonstationary Markov models," Quantitative Economics, Econometric Society, vol. 11(4), pages 1289-1323, November.
- Lilia Maliar & Serguei Maliar & John B. Taylor & Inna Tsener, 2015. "A Tractable Framework for Analyzing a Class of Nonstationary Markov Models," Economics Working Papers 15105, Hoover Institution, Stanford University.
- Lilia Maliar & Serguei Maliar & John Taylor & Inna Tsener, 2015. "A Tractable Framework for Analyzing a Class of Nonstationary Markov Models," NBER Working Papers 21155, National Bureau of Economic Research, Inc.
Somayeh Moazeni & Warren B. Powell & Boris Defourny & Belgacem Bouzaiene-Ayari, 2017. "Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization," INFORMS Journal on Computing, INFORMS, vol. 29(2), pages 332-349, May.
Chen, Ruoran & Deng, Tianhu & Huang, Simin & Qin, Ruwen, 2015. "Optimal crude oil procurement under fluctuating price in an oil refinery," European Journal of Operational Research, Elsevier, vol. 245(2), pages 438-445.
Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
- Vidal-Sanz, Jose M. & Yildirim, Gökhan, 2012. "Valuing customer portfolios with endogenous mass-and-direct-marketing interventions using a stochastic dynamic programming decomposition," DEE - Working Papers. Business Economics. WB wb121304, Universidad Carlos III de Madrid. Departamento de EconomÃa de la Empresa.
Nikolay Gospodinov & Damba Lkhagvasuren, 2014. "A Moment‐Matching Method For Approximating Vector Autoregressive Processes By Finite‐State Markov Chains," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(5), pages 843-859, August.
- Nikolay Gospodinov & Damba Lkhagvasuren, 2011. "A Moment-Matching Method for Approximating Vector Autoregressive Processes by Finite-State Markov Chains," Working Papers 11005, Concordia University, Department of Economics, revised 16 Dec 2011.
- Nikolay Gospodinov & Damba Lkhagvasuren, 2013. "A moment-matching method for approximating vector autoregressive processes by finite-state Markov chains," FRB Atlanta Working Paper 2013-05, Federal Reserve Bank of Atlanta.
Zehua Yang & Victoria C. P. Chen & Michael E. Chang & Melanie L. Sattler & Aihong Wen, 2009. "A Decision-Making Framework for Ozone Pollution Control," Operations Research, INFORMS, vol. 57(2), pages 484-498, April.
Diego Klabjan & Daniel Adelman, 2007. "An Infinite-Dimensional Linear Programming Algorithm for Deterministic Semi-Markov Decision Processes on Borel Spaces," Mathematics of Operations Research, INFORMS, vol. 32(3), pages 528-550, August.
Chen, Victoria C. P., 1999. "Application of orthogonal arrays and MARS to inventory forecasting stochastic dynamic programs," Computational Statistics & Data Analysis, Elsevier, vol. 30(3), pages 317-341, May.
Justin McCrary, 2010. "Dynamic Perspectives on Crime," Chapters, in: Bruce L. Benson & Paul R. Zimmerman (ed.), Handbook on the Economics of Crime, chapter 4, Edward Elgar Publishing.
Serguei Maliar & John Taylor & Lilia Maliar, 2016. "The Impact of Alternative Transitions to Normalized Monetary Policy," 2016 Meeting Papers 794, Society for Economic Dynamics.
King, Robert P. & Lohano, Heman D., 2006. "Accuracy of Numerical Solution to Dynamic Programming Models," Staff Papers 14230, University of Minnesota, Department of Applied Economics.
Padula, Mario, 2010. "An approximate consumption function," Journal of Economic Dynamics and Control, Elsevier, vol. 34(3), pages 404-416, March.
- Mario Padula & UniversitÃ di Salerno, 2006. "An approximate consumption function," Computing in Economics and Finance 2006 133, Society for Computational Economics.
- Mario Padula, 2008. "An Approximate Consumption Function," CSEF Working Papers 199, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
- Mario Padula, 2008. "An approximate consumption function," Working Papers 2008_24, Department of Economics, University of Venice "Ca' Foscari".
Jinhui H. Bai & Roger Lagunoff, 2013. "Revealed Political Power," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 54(4), pages 1085-1115, November.
- Roger Lagunoff & Jinhui H. Bai, 2010. "Revealed Political Power," 2010 Meeting Papers 542, Society for Economic Dynamics.
- Jinhui H. Bai & Roger Laguno ff, 2010. "Revealed Political Power," Levine's Working Paper Archive 661465000000000106, David K. Levine.
- Jinhui Bai and Roger Lagunoff, 2010. "Revealed Political Power," Working Papers gueconwpa~10-10-01, Georgetown University, Department of Economics.
Ufuk Akcigit, 2009. "Firm Size, Innovation Dynamics and Growth," 2009 Meeting Papers 1267, Society for Economic Dynamics.

More about this item

Keywords

Sequential decision problems; Dynamic programming; Approximation schemes; Curse of dimensionality; Suboptimal solutions; Optimal consumption;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joptap:v:156:y:2013:i:2:d:10.1007_s10957-012-0118-2. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data