My bibliography
Save this item
Optimal learning and experimentation in bandit problems
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Mathur, Sudhanshu & Morozov, Sergei, 2009. "Massively Parallel Computation Using Graphics Processors with Application to Optimal Experimentation in Dynamic Control," MPRA Paper 16721, University Library of Munich, Germany.
- Noah Gans & George Knox & Rachel Croson, 2007. "Simple Models of Discrete Choice and Their Performance in Bandit Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 9(4), pages 383-408, December.
- Nina Deliu, 2024. "Reinforcement learning for sequential decision making in population research," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(6), pages 5057-5080, December.
- Pai, Mallesh & Hansen, Karsten, 2020. "Algorithmic Collusion: Supra-competitive Prices via Independent Algorithms," CEPR Discussion Papers 14372, C.E.P.R. Discussion Papers.
- Samuel N. Cohen & Tanut Treetanthiploet, 2019. "Gittins' theorem under uncertainty," Papers 1907.05689, arXiv.org, revised Jun 2021.
- Stephen Chick & Martin Forster & Paolo Pertile, 2017.
"A Bayesian decision theoretic model of sequential experimentation with delayed response,"
Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1439-1462, November.
- Stephen Chick & Martin Forster & Paolo Pertile, 2015. "A Bayesian Decision-Theoretic Model of Sequential Experimentation with Delayed Response," Discussion Papers 15/09, Department of Economics, University of York.
- Raluca M. Ursu & Qingliang Wang & Pradeep K. Chintagunta, 2020. "Search Duration," Marketing Science, INFORMS, vol. 39(5), pages 849-871, September.
- Philipp Afèche & Barış Ata, 2013. "Bayesian Dynamic Pricing in Queueing Systems with Unknown Delay Cost Characteristics," Manufacturing & Service Operations Management, INFORMS, vol. 15(2), pages 292-304, May.
- Sergei Morozov & Sudhanshu Mathur, 2012. "Massively Parallel Computation Using Graphics Processors with Application to Optimal Experimentation in Dynamic Control," Computational Economics, Springer;Society for Computational Economics, vol. 40(2), pages 151-182, August.
- Michael Jong Kim, 2020. "Variance Regularization in Sequential Bayesian Optimization," Mathematics of Operations Research, INFORMS, vol. 45(3), pages 966-992, August.
- Brenner, Thomas & Vriend, Nicolaas J., 2006.
"On the behavior of proposers in ultimatum games,"
Journal of Economic Behavior & Organization, Elsevier, vol. 61(4), pages 617-631, December.
- Thomas Brenner & Nicolaas J. Vriend, 2003. "On the Behavior of Proposers in Ultimatum Games," Working Papers 502, Queen Mary University of London, School of Economics and Finance.
- Thomas Brenner & Nicolaas J. Vriend, 2003. "On the Behavior of Proposers in Ultimatum Games," CEEL Working Papers 0304, Cognitive and Experimental Economics Laboratory, Department of Economics, University of Trento, Italia.
- T. Brenner & N.J. Vriend, 2003. "On the Behavior of Proposers in Ultimatum Games," Papers on Economics and Evolution 2003-08, Philipps University Marburg, Department of Geography.
- Konon, Alexander, 2016. "Career choice under uncertainty," VfS Annual Conference 2016 (Augsburg): Demographic Change 145583, Verein für Socialpolitik / German Economic Association.
- Janet M. Currie & W. Bentley MacLeod, 2018.
"Understanding Doctor Decision Making: The Case of Depression,"
NBER Working Papers
24955, National Bureau of Economic Research, Inc.
- Janet M. Currie & W. Bentley MacLeod, 2020. "Understanding Doctor Decision Making: The Case of Depression," Working Papers 2020-77, Princeton University. Economics Department..
- Ilya O. Ryzhov & Warren B. Powell & Peter I. Frazier, 2012. "The Knowledge Gradient Algorithm for a General Class of Online Learning Problems," Operations Research, INFORMS, vol. 60(1), pages 180-195, February.
- Kevin Glazebrook & Joern Meissner & Jochen Schurr, 2012. "How big should my store be? On the interplay between shelf-space, demand learning and assortment decisions," Working Papers MRG/0021, Department of Management Science, Lancaster University, revised Dec 2012.
- Eric M. Schwartz & Eric T. Bradlow & Peter S. Fader, 2017. "Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments," Marketing Science, INFORMS, vol. 36(4), pages 500-522, July.
- Hart E. Posen & Daniel A. Levinthal, 2012. "Chasing a Moving Target: Exploitation and Exploration in Dynamic Environments," Management Science, INFORMS, vol. 58(3), pages 587-601, March.
- Stephen E. Chick & Noah Gans, 2009. "Economic Analysis of Simulation Selection Problems," Management Science, INFORMS, vol. 55(3), pages 421-437, March.
- Brenner, Thomas & Vriend, Nicolaas J., 2006.
"On the behavior of proposers in ultimatum games,"
Journal of Economic Behavior & Organization, Elsevier, vol. 61(4), pages 617-631, December.
- Thomas Brenner & Nicolaas J. Vriend, 2003. "On the Behavior of Proposers in Ultimatum Games," Working Papers 502, Queen Mary University of London, School of Economics and Finance.
- Thomas Brenner & Nicolaas J. Vriend, 2003. "On the Behavior of Proposers in Ultimatum Games," Working Papers 502, Queen Mary University of London, School of Economics and Finance.
- T. Brenner & N.J. Vriend, 2003. "On the Behavior of Proposers in Ultimatum Games," Papers on Economics and Evolution 2003-08, Philipps University Marburg, Department of Geography.
- Thomas Brenner & Nicolaas J. Vriend, 2003. "On the Behavior of Proposers in Ultimatum Games," CEEL Working Papers 0304, Cognitive and Experimental Economics Laboratory, Department of Economics, University of Trento, Italia.
- Teymourian, Ehsan & Yang, Jian, 2025. "Simple fixes that accommodate switching costs in multi-armed bandits," European Journal of Operational Research, Elsevier, vol. 320(3), pages 616-627.
- Mingyu Joo & Michael L. Thompson & Greg M. Allenby6, 2019. "Optimal Product Design by Sequential Experiments in High Dimensions," Management Science, INFORMS, vol. 65(7), pages 3235-3254, July.
- Victor F. Araman & René A. Caldentey, 2022. "Diffusion Approximations for a Class of Sequential Experimentation Problems," Management Science, INFORMS, vol. 68(8), pages 5958-5979, August.
- Stephen E. Chick & Peter Frazier, 2012. "Sequential Sampling with Economics of Selection Procedures," Management Science, INFORMS, vol. 58(3), pages 550-569, March.
- Karsten T. Hansen & Kanishka Misra & Mallesh M. Pai, 2021. "Frontiers: Algorithmic Collusion: Supra-competitive Prices via," Marketing Science, INFORMS, vol. 40(1), pages 1-12, January.
- Morozov, Sergei & Mathur, Sudhanshu, 2009. "Massively parallel computation using graphics processors with application to optimal experimentation in dynamic control," MPRA Paper 30298, University Library of Munich, Germany, revised 04 Apr 2011.
- Felipe Caro & Jérémie Gallien, 2007. "Dynamic Assortment with Demand Learning for Seasonal Consumer Goods," Management Science, INFORMS, vol. 53(2), pages 276-292, February.
- Kanishka Misra & Eric M. Schwartz & Jacob Abernethy, 2019. "Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments," Marketing Science, INFORMS, vol. 38(2), pages 226-252, March.