IDEAS home Printed from https://ideas.repec.org/a/inm/orijds/v3y2024i2p145-161.html
   My bibliography  Save this article

Thompson Sampling-Based Partially Observable Online Change Detection for Exponential Families

Author

Listed:
  • Jie Guo

    (Department of Industrial Engineering, Tsinghua University, Beijing 100084, China)

  • Hao Yan

    (School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, Tempe, Arizona 85287)

  • Chen Zhang

    (Department of Industrial Engineering, Tsinghua University, Beijing 100084, China)

Abstract

This paper proposes a holistic sequential change detection framework for partially observable high-dimensional data streams with exponential-family distributions. The framework first proposes a general composite decomposition for exponential-family distributed data by projecting its natural parameter onto normal bases and abnormal bases, which enables efficient inference for sparse changes. Then, the inference results are used for detection scheme construction, and different types of test statistics can be compacted in our framework. Last, by further designing the test statistic as the reward function in the combinatorial multi-armed bandit problem, a Thompson sampling-based sensor allocation strategy is constructed to select the most anomalous variables. Theoretical properties of the detection framework are discussed. Finally, examples of Gaussian, Poisson, and binomial distributed data streams are given in numerical studies and case studies to evaluate the performance of our proposed method.

Suggested Citation

  • Jie Guo & Hao Yan & Chen Zhang, 2024. "Thompson Sampling-Based Partially Observable Online Change Detection for Exponential Families," INFORMS Joural on Data Science, INFORMS, vol. 3(2), pages 145-161, October.
  • Handle: RePEc:inm:orijds:v:3:y:2024:i:2:p:145-161
    DOI: 10.1287/ijds.2022.00011
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijds.2022.00011
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijds.2022.00011?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yixin Wang & David M. Blei, 2019. "Frequentist Consistency of Variational Bayes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(527), pages 1147-1161, July.
    2. Linmiao Zhang & Kaibo Wang & Nan Chen, 2016. "Monitoring wafers’ geometric quality using an additive Gaussian process model," IISE Transactions, Taylor & Francis Journals, vol. 48(1), pages 1-15, January.
    3. James W. Hardin & Joseph W. Hilbe, 2012. "Generalized Linear Models and Extensions, 3rd Edition," Stata Press books, StataCorp LP, edition 3, number glmext.
    4. Zou, Changliang & Qiu, Peihua, 2009. "Multivariate Statistical Process Control Using LASSO," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1586-1596.
    5. Chen Zhang & Hao Yan & Seungho Lee & Jianjun Shi, 2018. "Weakly correlated profile monitoring based on sparse multi-channel functional principal component analysis," IISE Transactions, Taylor & Francis Journals, vol. 50(10), pages 878-891, October.
    6. Y. Mei, 2010. "Efficient scalable schemes for monitoring a large number of data streams," Biometrika, Biometrika Trust, vol. 97(2), pages 419-433.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bornmann, Lutz & Leydesdorff, Loet & Wang, Jian, 2014. "How to improve the prediction based on citation impact percentiles for years shortly after the publication date?," Journal of Informetrics, Elsevier, vol. 8(1), pages 175-180.
    2. Salmon, Claire & Tanguy, Jeremy, 2016. "Rural Electrification and Household Labor Supply: Evidence from Nigeria," World Development, Elsevier, vol. 82(C), pages 48-68.
    3. Stanislav Kolenikov, 2001. "Review of Stata 7," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(5), pages 637-646.
    4. Lianjie Shu & Jinyu Fan, 2018. "A distribution‐free control chart for monitoring high‐dimensional processes based on interpoint distances," Naval Research Logistics (NRL), John Wiley & Sons, vol. 65(4), pages 317-330, June.
    5. Carina Steckenleiter & Michael Lechner & Tim Pawlowski & Ute Schüttoff, 2023. "Do local expenditures on sports facilities affect sports participation?," Economic Inquiry, Western Economic Association International, vol. 61(4), pages 1103-1128, October.
    6. Molly C. Klanderman & Kathryn B. Newhart & Tzahi Y. Cath & Amanda S. Hering, 2020. "Fault isolation for a complex decentralized waste water treatment facility," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(4), pages 931-951, August.
    7. Gael M. Martin & David T. Frazier & Christian P. Robert, 2020. "Computing Bayes: Bayesian Computation from 1763 to the 21st Century," Monash Econometrics and Business Statistics Working Papers 14/20, Monash University, Department of Econometrics and Business Statistics.
    8. Steckenleiter, Carina & Lechner, Michael & Pawlowski, Tim & Schüttoff, Ute, 2019. "Do local public expenditures on sports facilities affect sports participation in Germany?," Economics Working Paper Series 1905, University of St. Gallen, School of Economics and Political Science.
    9. Gary Koop & Dimitris Korobilis, 2023. "Bayesian Dynamic Variable Selection In High Dimensions," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 64(3), pages 1047-1074, August.
    10. Ahmad, M. Rauf & Ahmed, S. Ejaz, 2021. "On the distribution of the T2 statistic, used in statistical process monitoring, for high-dimensional data," Statistics & Probability Letters, Elsevier, vol. 168(C).
    11. Aysit TANSEL & H. Mehmet TASCI, 2001. "Determinants of Unemployment Duration for Men and Women in Turkey," Middle East and North Africa 330400055, EcoMod.
    12. Jay Bartroff & Jinlin Song, 2016. "A Rejection Principle for Sequential Tests of Multiple Hypotheses Controlling Familywise Error Rates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 3-19, March.
    13. Jové Llopis, Elisenda & Segarra Blasco, Agustí, 1958-, 2015. "Innovation success: What is the role of innovation strategies?," Working Papers 2072/260961, Universitat Rovira i Virgili, Department of Economics.
    14. Yudong Chen & Tengyao Wang & Richard J. Samworth, 2022. "High‐dimensional, multiscale online changepoint detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(1), pages 234-266, February.
    15. Lutz Bornmann, 2015. "Interrater reliability and convergent validity of F1000Prime peer review," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(12), pages 2415-2426, December.
    16. Chen, Yudong & Wang, Tengyao & Samworth, Richard J., 2022. "High-dimensional, multiscale online changepoint detection," LSE Research Online Documents on Economics 113665, London School of Economics and Political Science, LSE Library.
    17. Fallahdizcheh, Amirhossein & Wang, Chao, 2022. "Transfer learning of degradation modeling and prognosis based on multivariate functional analysis with heterogeneous sampling rates," Reliability Engineering and System Safety, Elsevier, vol. 223(C).
    18. Cui, Junfeng & Wang, Guanghui & Zou, Changliang & Wang, Zhaojun, 2023. "Change-point testing for parallel data sets with FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    19. Gael M. Martin & David T. Frazier & Christian P. Robert, 2021. "Approximating Bayes in the 21st Century," Monash Econometrics and Business Statistics Working Papers 24/21, Monash University, Department of Econometrics and Business Statistics.
    20. Alexandra Spicer & Olena Stavrunova & Susan Thorp, 2016. "How Portfolios Evolve after Retirement: Evidence from Australia," The Economic Record, The Economic Society of Australia, vol. 92(297), pages 241-267, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijds:v:3:y:2024:i:2:p:145-161. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.