IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v65y2013i5p993-1006.html
   My bibliography  Save this article

One-armed bandit process with a covariate

Author

Listed:
  • You Liang
  • Xikui Wang
  • Yanqing Yi

Abstract

We generalize the bandit process with a covariate introduced by Woodroofe in several significant directions: a linear regression model characterizing the unknown arm, an unknown variance for regression residuals and general discounting sequence for a non-stationary model. With the Bayesian regression approach, we assume a normal-gamma conjugate prior distribution of the unknown parameters. It is shown that the optimal strategy is determined by a sequence of index values which are monotonic and determined by the observed value of the covariate and updated posterior distributions. We further show that the myopic strategy is not optimal in general. Such structural properties help to understand the tradeoff between information gathering and immediate expected payoff and may provide certain insight for covariate adjusted response adaptive design of clinical trials. Copyright The Institute of Statistical Mathematics, Tokyo 2013

Suggested Citation

  • You Liang & Xikui Wang & Yanqing Yi, 2013. "One-armed bandit process with a covariate," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(5), pages 993-1006, October.
  • Handle: RePEc:spr:aistmt:v:65:y:2013:i:5:p:993-1006
    DOI: 10.1007/s10463-013-0401-5
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10463-013-0401-5
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10463-013-0401-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Xikui Wang & Mikelis G. Bickis, 2003. "One-armed bandit models with continuous and delayed responses," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 58(2), pages 209-219, November.
    2. Xikui Wang & Yanqing Yi, 2009. "An optimal investment and consumption model with stochastic returns," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 25(1), pages 45-55, January.
    3. Xikui Wang & Yan Wang, 2010. "Optimal investment and consumption with stochastic dividends," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 26(6), pages 792-808, November.
    4. Wang, Xikui, 2000. "A bandit process with delayed responses," Statistics & Probability Letters, Elsevier, vol. 48(3), pages 303-307, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Williamson, S. Faye & Jacko, Peter & Jaki, Thomas, 2022. "Generalisations of a Bayesian decision-theoretic randomisation procedure and the impact of delayed responses," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    2. He, Yong & Zhou, Xia & Chen, Peimin & Wang, Xiaoyang, 2022. "An analytical solution for the robust investment-reinsurance strategy with general utilities," The North American Journal of Economics and Finance, Elsevier, vol. 63(C).
    3. Huiling Wu, 2016. "Optimal Investment-Consumption Strategy under Inflation in a Markovian Regime-Switching Market," Discrete Dynamics in Nature and Society, Hindawi, vol. 2016, pages 1-17, July.
    4. Wang, Xikui, 2002. "Asymptotic properties of bandit processes with geometric responses," Statistics & Probability Letters, Elsevier, vol. 60(2), pages 211-217, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:65:y:2013:i:5:p:993-1006. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.