IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v72y2024i1p373-388.html
   My bibliography  Save this article

An Approximate Dynamic Programming Approach to Repeated Games with Vector Losses

Author

Listed:
  • Vijay Kamble

    (Department of Information and Decision Sciences, University of Illinois, Chicago, Illinois 60607)

  • Patrick Loiseau

    (University Grenoble Alpes, INRIA, Centre National de la Recherche Scientifique, Institut Polytechnique de Grenoble, Laboratoire d’Informatique de Grenoble, 38058 Grenoble, France; Max-Planck Institute for Software Systems, D-66123 Saarbrücken, Germany)

  • Jean Walrand

    (Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California 94720)

Abstract

We describe an approximate dynamic programming (ADP) approach to compute approximations of the optimal strategies and of the minimal losses that can be guaranteed in discounted repeated games with vector-valued losses. Among other applications, such vector-valued games prominently arise in the analysis of worst-case regret in repeated decision making in unknown environments, also known as the adversarial online learning framework. At the core of our approach is a characterization of the lower Pareto frontier of the set of expected losses that a player can guarantee in these games as the unique fixed point of a set-valued dynamic programming operator. When applied to the problem of worst-case regret minimization with discounted losses, our approach yields algorithms that achieve markedly improved performance bounds compared with off-the-shelf online learning algorithms like Hedge. These results thus suggest the significant potential of ADP-based approaches in adversarial online learning.

Suggested Citation

  • Vijay Kamble & Patrick Loiseau & Jean Walrand, 2024. "An Approximate Dynamic Programming Approach to Repeated Games with Vector Losses," Operations Research, INFORMS, vol. 72(1), pages 373-388, January.
  • Handle: RePEc:inm:oropre:v:72:y:2024:i:1:p:373-388
    DOI: 10.1287/opre.2022.2334
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.2022.2334
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.2022.2334?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:72:y:2024:i:1:p:373-388. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.