IDEAS home Printed from https://ideas.repec.org/a/inm/ormoor/v50y2025i1p1-39.html
   My bibliography  Save this article

The Online Saddle Point Problem and Online Convex Optimization with Knapsacks

Author

Listed:
  • Adrian Rivera Cardoso

    (LinkedIn Corporation, Sunnyvale, California 94085)

  • He Wang

    (School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332)

  • Huan Xu

    (Alibaba Inc., Hangzhou 311121, China)

Abstract

We study the online saddle point problem, an online learning problem where at each iteration, a pair of actions needs to be chosen without knowledge of the current and future (convex-concave) payoff functions. The objective is to minimize the gap between the cumulative payoffs and the saddle point value of the aggregate payoff function, which we measure using a metric called saddle point regret (SP-Regret). The problem generalizes the online convex optimization framework, but here, we must ensure that both players incur cumulative payoffs close to that of the Nash equilibrium of the sum of the games. We propose an algorithm that achieves SP-Regret proportional to ln ( T ) T in the general case, and log ( T ) SP-Regret for the strongly convex-concave case. We also consider the special case where the payoff functions are bilinear and the decision sets are the probability simplex. In this setting, we are able to design algorithms that reduce the bounds on SP-Regret from a linear dependence in the dimension of the problem to a logarithmic one. We also study the problem under bandit feedback and provide an algorithm that achieves sublinear SP-Regret. We then consider an online convex optimization with knapsacks problem motivated by a wide variety of applications, such as dynamic pricing, auctions, and crowdsourcing. We relate this problem to the online saddle point problem and establish O ( T ) regret using a primal-dual algorithm.

Suggested Citation

  • Adrian Rivera Cardoso & He Wang & Huan Xu, 2025. "The Online Saddle Point Problem and Online Convex Optimization with Knapsacks," Mathematics of Operations Research, INFORMS, vol. 50(1), pages 1-39, February.
  • Handle: RePEc:inm:ormoor:v:50:y:2025:i:1:p:1-39
    DOI: 10.1287/moor.2018.0330
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/moor.2018.0330
    Download Restriction: no

    File URL: https://libkey.io/10.1287/moor.2018.0330?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormoor:v:50:y:2025:i:1:p:1-39. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.