IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v8y2020i9p1623-d416078.html
   My bibliography  Save this article

An Adversarial Search Method Based on an Iterative Optimal Strategy

Author

Listed:
  • Chanjuan Liu

    (School of Computer Science and Technology, Dalian University of Technology; Dalian 116024, China)

  • Junming Yan

    (CNGC North Automatic Control Technology Institute, Taiyuan 030000, China)

  • Yuanye Ma

    (School of Computer Science and Technology, Dalian University of Technology; Dalian 116024, China)

  • Tianhao Zhao

    (School of Computer Science and Technology, Dalian University of Technology; Dalian 116024, China)

  • Qiang Zhang

    (School of Computer Science and Technology, Dalian University of Technology; Dalian 116024, China)

  • Xiaopeng Wei

    (School of Computer Science and Technology, Dalian University of Technology; Dalian 116024, China)

Abstract

A deeper game-tree search can yield a higher decision quality in a heuristic minimax algorithm. However, exceptions can occur as a result of pathological nodes, which are considered to exist in all game trees and can cause a deeper game-tree search, resulting in worse play. To reduce the impact of pathological nodes on the search quality, we propose an iterative optimal minimax (IOM) algorithm by optimizing the backup rule of the classic minimax algorithm. The main idea is that calculating the state values of the intermediate nodes involves not only the static evaluation function involved but also a search into the future, where the latter is given a higher weight. We experimentally demonstrated that the proposed IOM algorithm improved game-playing performance compared to the existing algorithms.

Suggested Citation

  • Chanjuan Liu & Junming Yan & Yuanye Ma & Tianhao Zhao & Qiang Zhang & Xiaopeng Wei, 2020. "An Adversarial Search Method Based on an Iterative Optimal Strategy," Mathematics, MDPI, vol. 8(9), pages 1-16, September.
  • Handle: RePEc:gam:jmathe:v:8:y:2020:i:9:p:1623-:d:416078
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/8/9/1623/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/8/9/1623/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John C. Harsanyi & Reinhard Selten, 1988. "A General Theory of Equilibrium Selection in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262582384, April.
    2. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nathalie Greenan & Marc-Arthur Diaye & Patricia Crifo, 2004. "Pourquoi les entreprises évaluent-elles individuellement leurs salariés ?," Économie et Prévision, Programme National Persée, vol. 164(3), pages 27-55.
    2. Bosch-Domènech, Antoni & Vriend, Nicolaas J., 2013. "On the role of non-equilibrium focal points as coordination devices," Journal of Economic Behavior & Organization, Elsevier, vol. 94(C), pages 52-67.
    3. van Damme, Eric & Hurkens, Sjaak, 1999. "Endogenous Stackelberg Leadership," Games and Economic Behavior, Elsevier, vol. 28(1), pages 105-129, July.
    4. Dennis L. Gärtner, 2022. "Corporate Leniency in a Dynamic World: The Preemptive Push of an Uncertain Future," Journal of Industrial Economics, Wiley Blackwell, vol. 70(1), pages 119-146, March.
    5. Antonio Cabrales & Rosemarie Nagel & Roc Armenter, 2007. "Equilibrium selection through incomplete information in coordination games: an experimental study," Experimental Economics, Springer;Economic Science Association, vol. 10(3), pages 221-234, September.
    6. Paul Pezanis-Christou & Abdolkarim Sadrieh, 2003. "Elicited bid functions in (a)symmetric first-price auctions," Working Papers 85, Barcelona School of Economics.
    7. Gabriele Camera & Cary Deck & David Porter, 2020. "Do economic inequalities affect long-run cooperation and prosperity?," Experimental Economics, Springer;Economic Science Association, vol. 23(1), pages 53-83, March.
    8. Tian Zhu & Merry H. Ma, 2022. "Deriving the Optimal Strategy for the Two Dice Pig Game via Reinforcement Learning," Stats, MDPI, vol. 5(3), pages 1-14, August.
    9. Simai He & Jay Sethuraman & Xuan Wang & Jiawei Zhang, 2017. "A NonCooperative Approach to Cost Allocation in Joint Replenishment," Operations Research, INFORMS, vol. 65(6), pages 1562-1573, December.
    10. Maarten C.W. Janssen, 1997. "Focal Points," Tinbergen Institute Discussion Papers 97-091/1, Tinbergen Institute.
    11. Xiaoyue Li & John M. Mulvey, 2023. "Optimal Portfolio Execution in a Regime-switching Market with Non-linear Impact Costs: Combining Dynamic Program and Neural Network," Papers 2306.08809, arXiv.org.
    12. Michael Kosfeld, 2002. "Stochastic strategy adjustment in coordination games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 20(2), pages 321-339.
    13. Sanjeev Goyal & Fernando Vega-Redondo, 2000. "Learning, Network Formation and Coordination," Econometric Society World Congress 2000 Contributed Papers 0113, Econometric Society.
    14. Pedro Afonso Fernandes, 2024. "Forecasting with Neuro-Dynamic Programming," Papers 2404.03737, arXiv.org.
    15. Andrea Isoni & Robert Sugden & Jiwei Zheng, 2018. "The Pizza Night Game: Efficiency, Conflict and Inequality in Tacit Bargaining Games with Focal Points," Working Paper series, University of East Anglia, Centre for Behavioural and Experimental Social Science (CBESS) 18-01, School of Economics, University of East Anglia, Norwich, UK..
    16. Ennis, Huberto M. & Keister, Todd, 2005. "Government policy and the probability of coordination failures," European Economic Review, Elsevier, vol. 49(4), pages 939-973, May.
    17. Smith, Lisa C. & Chavas, Jean-Paul, 1999. "Supply response of West African agricultural households," FCND discussion papers 69, International Food Policy Research Institute (IFPRI).
    18. Fletschner, Diana K., 2000. "Enhancing Rural Women'S Access To Capital: Why It Is Important And How It Can Be Done. The Case Of Colombia," Staff Papers 12640, University of Wisconsin-Madison, Department of Agricultural and Applied Economics.
    19. Nathan Companez & Aldeida Aleti, 2016. "Can Monte-Carlo Tree Search learn to sacrifice?," Journal of Heuristics, Springer, vol. 22(6), pages 783-813, December.
    20. Pal, Rupayan, 2010. "Technology adoption in a differentiated duopoly: Cournot versus Bertrand," Research in Economics, Elsevier, vol. 64(2), pages 128-136, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:8:y:2020:i:9:p:1623-:d:416078. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.