IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v70y2022i6p3261-3281.html
   My bibliography  Save this article

Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes

Author

Listed:
  • Yichun Hu

    (Cornell University, New York, New York 10044)

  • Nathan Kallus

    (Cornell University, New York, New York 10044)

  • Xiaojie Mao

    (Tsinghua University, 100084 Beijing, China)

Abstract

We study a nonparametric contextual bandit problem in which the expected reward functions belong to a Hölder class with smoothness parameter β . We show how this interpolates between two extremes that were previously studied in isolation: nondifferentiable bandits (β at most 1), with which rate-optimal regret is achieved by running separate noncontextual bandits in different context regions, and parametric-response bandits (infinite β ), with which rate-optimal regret can be achieved with minimal or no exploration because of infinite extrapolatability. We develop a novel algorithm that carefully adjusts to all smoothness settings, and we prove its regret is rate-optimal by establishing matching upper and lower bounds, recovering the existing results at the two extremes. In this sense, our work bridges the gap between the existing literature on parametric and nondifferentiable contextual bandit problems and between bandit algorithms that exclusively use global or local information, shedding light on the crucial interplay of complexity and regret in contextual bandits.

Suggested Citation

  • Yichun Hu & Nathan Kallus & Xiaojie Mao, 2022. "Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes," Operations Research, INFORMS, vol. 70(6), pages 3261-3281, November.
  • Handle: RePEc:inm:oropre:v:70:y:2022:i:6:p:3261-3281
    DOI: 10.1287/opre.2021.2237
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.2021.2237
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.2021.2237?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:70:y:2022:i:6:p:3261-3281. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.