An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes
Author
Abstract
Suggested Citation
DOI: 10.1007/s10957-012-9989-5
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Mas-Colell, Andreu & Whinston, Michael D. & Green, Jerry R., 1995. "Microeconomic Theory," OUP Catalogue, Oxford University Press, number 9780195102680.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Yuqing Zheng & Guoshan Zhang, 2020. "Suboptimal Control for Nonlinear Systems with Disturbance via Integral Sliding Mode Control and Policy Iteration," Journal of Optimization Theory and Applications, Springer, vol. 185(2), pages 652-677, May.
- Thomas Spooner & Rahul Savani, 2020. "A Natural Actor-Critic Algorithm with Downside Risk Constraints," Papers 2007.04203, arXiv.org.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Wright, Austin L. & Sonin, Konstantin & Driscoll, Jesse & Wilson, Jarnickae, 2020.
"Poverty and economic dislocation reduce compliance with COVID-19 shelter-in-place protocols,"
Journal of Economic Behavior & Organization, Elsevier, vol. 180(C), pages 544-554.
- Austin L. Wright & Konstantin Sonin & Jesse Driscoll & Jarnickae Wilson, 2020. "Poverty and Economic Dislocation Reduce Compliance with COVID-19 Shelter-in-Place Protocols," Working Papers 2020-40, Becker Friedman Institute for Research In Economics.
- Sonin, Konstantin & Wright, Austin L. & Driscoll, Jesse & Wilson, Jarnickae, 2020. "Poverty and Economic Dislocation Reduce Compliance with COVID-19 Shelter-in-Place Protocols," CEPR Discussion Papers 14618, C.E.P.R. Discussion Papers.
- Jolian McHardy & Michael Reynolds & Stephen Trotter, 2012.
"The Stackelberg Model as a Partial Solution to the Problem of Pricing in a Network,"
Working Paper series
19_12, Rimini Centre for Economic Analysis.
- Jolian McHardy & Michael Reynolds & Stephen Trotter, 2012. "The Stackelberg Model as a Partial Solution to the Problem of Pricing in a Network," Working Papers 2012008, The University of Sheffield, Department of Economics.
- Janvier D. Nkurunziza, 2005. "Reputation and Credit without Collateral in Africa`s Formal Banking," Economics Series Working Papers WPS/2005-02, University of Oxford, Department of Economics.
- Stephanie Rosenkranz & Patrick W. Schmitz, 2007.
"Can Coasean Bargaining Justify Pigouvian Taxation?,"
Economica, London School of Economics and Political Science, vol. 74(296), pages 573-585, November.
- Rosenkranz, Stephanie & Schmitz, Patrick W., 2004. "Can Coasean Bargaining Justify Pigouvian Taxation?," CEPR Discussion Papers 4263, C.E.P.R. Discussion Papers.
- Rosenkranz, Stephanie & Schmitz, Patrick W., 2006. "Can Coasean bargaining justify Pigouvian taxation?," Bonn Econ Discussion Papers 7/2006, University of Bonn, Bonn Graduate School of Economics (BGSE).
- Vadim Borokhov, 2014. "On the properties of nodal price response matrix in electricity markets," Papers 1404.3678, arXiv.org, revised Jan 2015.
- Yuzhou Jiang & Ramteen Sioshansi, 2023. "What Duality Theory Tells Us About Giving Market Operators the Authority to Dispatch Energy Storage," The Energy Journal, , vol. 44(3), pages 89-110, May.
- Daniel Sutter & Daniel J. Smith, 2017. "Coordination in disaster: Nonprice learning and the allocation of resources after natural disasters," The Review of Austrian Economics, Springer;Society for the Development of Austrian Economics, vol. 30(4), pages 469-492, December.
- Hanming Fang & Peter Norman, 2014.
"Toward an efficiency rationale for the public provision of private goods,"
Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 56(2), pages 375-408, June.
- Hanming Fang & Peter Norman, 2008. "Toward an Efficiency Rationale for the Public Provision of Private Goods," NBER Working Papers 13827, National Bureau of Economic Research, Inc.
- Peter Norman & Hanming Fang, 2010. "Toward an Efficiency Rationale for the Public Provision of Private Goods," 2010 Meeting Papers 1185, Society for Economic Dynamics.
- Hanming Fang & Peter Norman, 2008. "Toward an Efficiency Rationale for the Public Provision of Private Goods," 2008 Meeting Papers 1097, Society for Economic Dynamics.
- Gan, Li & Ju, Gaosheng & Zhu, Xi, 2015. "Nonparametric estimation of structural labor supply and exact welfare change under nonconvex piecewise-linear budget sets," Journal of Econometrics, Elsevier, vol. 188(2), pages 526-544.
- Peterson, Jeffrey M. & Boisvert, Richard N. & de Gorter, Harry, 1999. "Multifunctionality and Optimal Environmental Policies for Agriculture in an Open Economy," Working Papers 127701, Cornell University, Department of Applied Economics and Management.
- Tian, Guoqiang, 2009. "Implementation of Pareto efficient allocations," Journal of Mathematical Economics, Elsevier, vol. 45(1-2), pages 113-123, January.
- Ahmad Naimzada & Marina Pireddu, 2019. "The first fundamental theorem of welfare in a general equilibrium evolutionary setting," Working Papers 415, University of Milano-Bicocca, Department of Economics, revised 06 Jun 2019.
- Gajanan Panchal & Vipul Jain & Naoufel Cheikhrouhou & Matthias Gurtner, 2017. "Equilibrium analysis in multi-echelon supply chain with multi-dimensional utilities of inertial players," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 16(4), pages 417-436, August.
- Aldasoro, Iñaki & Delli Gatti, Domenico & Faia, Ester, 2017.
"Bank networks: Contagion, systemic risk and prudential policy,"
Journal of Economic Behavior & Organization, Elsevier, vol. 142(C), pages 164-188.
- Iñaki Aldasoro & Domenico Delli Gatti & Ester Faia, 2015. "Bank Networks: Contagion, Systemic Risk and Prudential Policy," DISCE - Working Papers del Dipartimento di Economia e Finanza def028, Università Cattolica del Sacro Cuore, Dipartimenti e Istituti di Scienze Economiche (DISCE).
- Inaki Aldasoro & Domenico Delli Gatti & Ester Faia, 2015. "Bank Networks: Contagion, Systemic Risk and Prudential Policy," CESifo Working Paper Series 5182, CESifo.
- Iñaki Aldasoro & Domenico Delli Gatti & Ester Faia, 2016. "Bank networks: contagion, systemic risk and prudential policy," BIS Working Papers 597, Bank for International Settlements.
- Aldasoro, Iñaki & Delli Gatti, Domenico & Faia, Ester, 2015. "Bank networks: Contagion, systemic risk and prudential policy," SAFE Working Paper Series 87, Leibniz Institute for Financial Research SAFE, revised 2015.
- Faia, Ester & Delli Gatti, Domenico & Aldasoro, Inaki, 2015. "Bank Networks: Contagion, Systemic Risk and Prudential Policy," CEPR Discussion Papers 10540, C.E.P.R. Discussion Papers.
- Gatti, Nicolas & Cecil, Michael & Baylis, Kathy & Estes, Lyndon & Blekking, Jordan & Heckelei, Thomas & Vergopolan, Noemi & Evans, Tom, 2023. "Is closing the agricultural yield gap a “risky” endeavor?," Agricultural Systems, Elsevier, vol. 208(C).
- Aldo Montesano, 2018. "Social welfare for an economy of angelic agents," International Review of Economics, Springer;Happiness Economics and Interpersonal Relations (HEIRS), vol. 65(2), pages 185-200, June.
- Alexei A. Gaivoronski & Per Jonny Nesse & Olai Bendik Erdal, 2017. "Internet service provision and content services: paid peering and competition between internet providers," Netnomics, Springer, vol. 18(1), pages 43-79, May.
- Romero-Jordán, Desiderio & del Río, Pablo & Peñasco, Cristina, 2016.
"An analysis of the welfare and distributive implications of factors influencing household electricity consumption,"
Energy Policy, Elsevier, vol. 88(C), pages 361-370.
- Desiderio Romero-Jordán & Pablo Del Río & Cristina Peñasco, 2015. "An analysis of the welfare and distributive implications of factors influencing household electricity consumption," Working Papers 1503, Instituto de Políticas y Bienes Públicos (IPP), CSIC.
- Araoz, Veronica & Jörnsten, Kurt, 2011. "Semi-Lagrangean approach for price discovery in markets with non-convexities," European Journal of Operational Research, Elsevier, vol. 214(2), pages 411-417, October.
- Chorvat, Terrence, 2006. "Taxing utility," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 35(1), pages 1-16, February.
More about this item
Keywords
Actor–critic algorithm; Constrained Markov decision processes; Long-run average cost criterion; Function approximation;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joptap:v:153:y:2012:i:3:d:10.1007_s10957-012-9989-5. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.