My bibliography
Save this item
On the convergence of reinforcement learning
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Hofbauer, Josef & Hopkins, Ed, 2005.
"Learning in perturbed asymmetric games,"
Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
- Köke, Sonja & Lange, Andreas & Nicklisch, Andreas, 2015. "Adversity is a school of wisdomː Experimental evidence on cooperative protection against stochastic losses," WiSo-HH Working Paper Series 22, University of Hamburg, Faculty of Business, Economics and Social Sciences, WISO Research Laboratory.
- Josephson, Jens, 2008.
"A numerical analysis of the evolutionary stability of learning rules,"
Journal of Economic Dynamics and Control, Elsevier, vol. 32(5), pages 1569-1599, May.
- Josephson, Jens, 2001. "A Numerical Analysis of the Evolutionary Stability of Learning Rules," SSE/EFI Working Paper Series in Economics and Finance 474, Stockholm School of Economics.
- Mario Bravo & Mathieu Faure, 2013.
"Reinforcement Learning with Restrictions on the Action Set,"
AMSE Working Papers
1335, Aix-Marseille School of Economics, France, revised 01 Jul 2013.
- Mario Bravo & Mathieu Faure, 2015. "Reinforcement Learning with Restrictions on the Action Set," Post-Print hal-01457301, HAL.
- Ding, Jieyao & Nicklisch, Andreas, 2013. "On the impulse in impulse learning," Economics Letters, Elsevier, vol. 121(2), pages 294-297.
- Nicklisch, Andreas & Köke, Sonja & Lange, Andreas, 2016. "Is Adversity a School of Wisdom? Experimental Evidence on Cooperative Protection Against Stochastic Losses," VfS Annual Conference 2016 (Augsburg): Demographic Change 145716, Verein für Socialpolitik / German Economic Association.
- Chmura, Thorsten & Goerg, Sebastian J. & Selten, Reinhard, 2012.
"Learning in experimental 2×2 games,"
Games and Economic Behavior, Elsevier, vol. 76(1), pages 44-73.
- Chmura, Thorsten & Goerg, Sebastian J. & Selten, Reinhard, 2008. "Learning in experimental 2×2 games," Bonn Econ Discussion Papers 18/2008, University of Bonn, Bonn Graduate School of Economics (BGSE).
- Thorsten Chmura & Sebastian Goerg & Reinhard Selten, 2011. "Learning in experimental 2 x 2 games," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2011_26, Max Planck Institute for Research on Collective Goods.
- Naoki Funai, 2013. "An Adaptive Learning Model in Coordination Games," Discussion Papers 13-14, Department of Economics, University of Birmingham.
- Jieyao Ding & Andreas Nicklisch, 2013. "On the Impulse in Impulse Learning," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2013_02, Max Planck Institute for Research on Collective Goods.
- Chernov, G. & Susin, I., 2019. "Models of learning in games: An overview," Journal of the New Economic Association, New Economic Association, vol. 44(4), pages 77-125.
- Ianni, Antonella, 2014.
"Learning strict Nash equilibria through reinforcement,"
Journal of Mathematical Economics, Elsevier, vol. 50(C), pages 148-155.
- Ianni, Antonella, 2011. "Learning Strict Nash Equilibria through Reinforcement," MPRA Paper 33936, University Library of Munich, Germany.
- Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2015.
"From imitation to collusion: Long-run learning in a low-information environment,"
Journal of Economic Theory, Elsevier, vol. 155(C), pages 185-205.
- Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2012. "From imitation to collusion: Long-run learning in a low-information environment," Discussion Papers, Research Unit: Economics of Change SP II 2012-301r, WZB Berlin Social Science Center.
- Friedman, D & Huck, S & Oprea, R & Weidenholzer, S, 2012. "From Imitation to Collusion: Long-run Learning in a Low-Information Environment," Economics Discussion Papers 8954, University of Essex, Department of Economics.
- Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2012. "From imitation to collusion: Long-run learning in a low-information environment," Discussion Papers, Research Unit: Economics of Change SP II 2012-301, WZB Berlin Social Science Center.
- Daniel Friedman & Steffen Huck & Ryan Oprea & Simon Weidenholzer, 2012. "From Imitation to Collusion: Long-run Learning in a Low-Information Environment," Levine's Working Paper Archive 786969000000000457, David K. Levine.
- Jaspersen, Johannes G. & Montibeller, Gilberto, 2020. "On the learning patterns and adaptive behavior of terrorist organizations," European Journal of Operational Research, Elsevier, vol. 282(1), pages 221-234.
- Fortini, Sandra & Petrone, Sonia & Sporysheva, Polina, 2018. "On a notion of partially conditionally identically distributed sequences," Stochastic Processes and their Applications, Elsevier, vol. 128(3), pages 819-846.
- Albert Banal-Estañol & Augusto Rupérez Micola, 2009.
"Composition of Electricity Generation Portfolios, Pivotal Dynamics, and Market Prices,"
Management Science, INFORMS, vol. 55(11), pages 1813-1831, November.
- Augusto Rupérez-Micola & Albert Banal-Estañol, 2007. "Composition of electricity generation portfolios, pivotal dynamics and market prices," Economics Working Papers 1083, Department of Economics and Business, Universitat Pompeu Fabra.
- Naoki Funai, 2019. "Convergence results on stochastic adaptive learning," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 68(4), pages 907-934, November.
- Oyarzun, Carlos & Sarin, Rajiv, 2013.
"Learning and risk aversion,"
Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
- Mele, Antonio & Molnár, Krisztina & Santoro, Sergio, 2020.
"On the perils of stabilizing prices when agents are learning,"
Journal of Monetary Economics, Elsevier, vol. 115(C), pages 339-353.
- Mele, Antonio & Molnar, Krisztina & Santoro, Sergio, 2014. "On the perils of stabilizing prices when agents are learning," Discussion Paper Series in Economics 1/2015, Norwegian School of Economics, Department of Economics.
- Mele, Antonio & Molnar, Krisztina & Santoro, Sergio, 2018. "On the perils of stabilizing prices when agents are learning," Discussion Paper Series in Economics 22/2018, Norwegian School of Economics, Department of Economics.
- Antonio Mele & Krisztina Molnar & Sergio Santoro, 2015. "On the perils of stabilizing prices when agents are learning," School of Economics Discussion Papers 0215, School of Economics, University of Surrey.
- Antonio Mele & Krisztina Molnár & Sergio Santoro, 2015. "On the Perils of Stabilizing Prices when Agents are Learning," CESifo Working Paper Series 5173, CESifo.
- Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021.
"Avoiding the bullies: The resilience of cooperation among unequals,"
PLOS Computational Biology, Public Library of Science, vol. 17(4), pages 1-18, April.
- Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021. "Avoiding the bullies: The resilience of cooperation among unequals," Papers 2104.08636, arXiv.org.
- Nazaria Solferino & Viviana Solferino & Serena F. Taurino, 2018. "The economics analysis of a Q-learning model of cooperation with punishment and risk taking preferences," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 13(3), pages 601-613, October.
- Erik Mohlin & Robert Ostling & Joseph Tao-yi Wang, 2014. "Learning by Imitation in Games: Theory, Field, and Laboratory," Economics Series Working Papers 734, University of Oxford, Department of Economics.
- Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
- Christoph March, 2019.
"The Behavioral Economics of Artificial Intelligence: Lessons from Experiments with Computer Players,"
CESifo Working Paper Series
7926, CESifo.
- March, Christoph, 2019. "The behavioral economics of artificial intelligence: Lessons from experiments with computer players," BERG Working Paper Series 154, Bamberg University, Bamberg Economic Research Group.
- Giacomo Aletti & Caterina May & Piercesare Secchi, 2012. "A Functional Equation Whose Unknown is $\mathcal{P}([0,1])$ Valued," Journal of Theoretical Probability, Springer, vol. 25(4), pages 1207-1232, December.
- Kuang Xu & Se-Young Yun, 2020. "Reinforcement with Fading Memories," Mathematics of Operations Research, INFORMS, vol. 45(4), pages 1258-1288, November.
- Beggs, Alan, 2022.
"Reference points and learning,"
Journal of Mathematical Economics, Elsevier, vol. 100(C).
- Alan Beggs, 2015. "Reference Points and Learning," Economics Series Working Papers 767, University of Oxford, Department of Economics.
- Maxwell Pak & Bing Xu, 2016. "Generalized reinforcement learning in perfect-information games," International Journal of Game Theory, Springer;Game Theory Society, vol. 45(4), pages 985-1011, November.
- Jacques Durieu & Philippe Solal, 2012.
"Models of Adaptive Learning in Game Theory,"
Chapters, in: Richard Arena & Agnès Festré & Nathalie Lazaric (ed.), Handbook of Knowledge and Economics, chapter 11,
Edward Elgar Publishing.
- Jacques Durieu & Philippe Solal, 2012. "Models of adaptive learning in game theory," Post-Print halshs-00667674, HAL.
- Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
- Schuster, Stephan, 2010. "Network Formation with Adaptive Agents," MPRA Paper 27388, University Library of Munich, Germany.
- Hopkins, Ed & Posch, Martin, 2005.
"Attainability of boundary points under reinforcement learning,"
Games and Economic Behavior, Elsevier, vol. 53(1), pages 110-125, October.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Edinburgh School of Economics Discussion Paper Series 79, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Levine's Working Paper Archive 506439000000000350, David K. Levine.
- Cominetti, Roberto & Melo, Emerson & Sorin, Sylvain, 2010. "A payoff-based learning procedure and its application to traffic games," Games and Economic Behavior, Elsevier, vol. 70(1), pages 71-83, September.
- Han, Jungsuk & Sangiorgi, Francesco, 2018.
"Searching for information,"
Journal of Economic Theory, Elsevier, vol. 175(C), pages 342-373.
- Han, Jungsuk & Sangiorgi, Francesco, 2015. "Searching for Information," Working Paper Series 300, Sveriges Riksbank (Central Bank of Sweden).
- repec:esx:essedp:715 is not listed on IDEAS
- Alanyali, Murat, 2010. "A note on adjusted replicator dynamics in iterated games," Journal of Mathematical Economics, Elsevier, vol. 46(1), pages 86-98, January.
- Conor Mayo-Wilson & Kevin Zollman & David Danks, 2013. "Wisdom of crowds versus groupthink: learning in groups and in isolation," International Journal of Game Theory, Springer;Game Theory Society, vol. 42(3), pages 695-723, August.
- Oyarzun, Carlos & Ruf, Johannes, 2014. "Convergence in models with bounded expected relative hazard rates," Journal of Economic Theory, Elsevier, vol. 154(C), pages 229-244.
- Manxi Wu & Saurabh Amin & Asuman Ozdaglar, 2021. "Multi-agent Bayesian Learning with Best Response Dynamics: Convergence and Stability," Papers 2109.00719, arXiv.org.
- Naoki Funai, 2013. "An Adaptive Learning Model in Coordination Games," Games, MDPI, vol. 4(4), pages 1-22, November.
- Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
- March, Christoph, 2021. "Strategic interactions between humans and artificial intelligence: Lessons from experiments with computer players," Journal of Economic Psychology, Elsevier, vol. 87(C).
- Ilaria Brunetti & Yezekael Hayel & Eitan Altman, 2018. "State-Policy Dynamics in Evolutionary Games," Dynamic Games and Applications, Springer, vol. 8(1), pages 93-116, March.
- Manxi Wu & Saurabh Amin, 2019. "Securing Infrastructure Facilities: When Does Proactive Defense Help?," Dynamic Games and Applications, Springer, vol. 9(4), pages 984-1025, December.
- Funai, Naoki, 2022. "Reinforcement learning with foregone payoff information in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 638-660.
- Mario Bravo, 2016. "An Adjusted Payoff-Based Procedure for Normal Form Games," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1469-1483, November.
- Georgios Chasparis & Jeff Shamma & Anders Rantzer, 2015. "Nonconvergence to saddle boundary points under perturbed reinforcement learning," International Journal of Game Theory, Springer;Game Theory Society, vol. 44(3), pages 667-699, August.
- Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
- Roger Waldeck & Eric Darmon, 2006. "Can boundedly rational sellers learn to play Nash?," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 1(2), pages 147-169, November.
- Pemantle, Robin & Skyrms, Brian, 2004. "Network formation by reinforcement learning: the long and medium run," Mathematical Social Sciences, Elsevier, vol. 48(3), pages 315-327, November.
- Georgios Chasparis & Jeff Shamma, 2012. "Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation," Dynamic Games and Applications, Springer, vol. 2(1), pages 18-50, March.