IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v30y2018i4p750-767.html
   My bibliography  Save this article

A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model

Author

Listed:
  • Yan Li

    (Department of Operations Research and Financial Engineering, Princeton University, Princeton, New Jersey 08544)

  • Kristofer G. Reyes

    (Department of Materials Design and Innovation, University at Buffalo, Buffalo, New York 14260)

  • Jorge Vazquez-Anderson

    (Department of Chemical Engineering, University of Texas at Austin, Austin, Texas 78712)

  • Yingfei Wang

    (Michael G. Foster School of Business, University of Washington, Seattle, Washington 98195)

  • Lydia M. Contreras

    (Department of Chemical Engineering, University of Texas at Austin, Austin, Texas 78712)

  • Warren B. Powell

    (Department of Operations Research and Financial Engineering, Princeton University, Princeton, New Jersey 08544)

Abstract

We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules. Experimentally, such regions can be inferred from fluorescence measurements obtained by binding a complementary probe with fluorescence markers to the targeted regions. We perform a regularized, sparse linear model with a log link function where the marginal contribution to the thermodynamic cycle of each nucleotide is purely additive. The SpKG algorithm uniquely combines the Bayesian ranking and selection problem with the frequentist l 1 regularized regression approach Lasso. We use this algorithm to identify the sparsity pattern of the linear model as well as sequentially decide the best regions to test before exhausting an experimental budget. We also develop two new algorithms: batch SpKG and batch SpKG-LM. The first algorithm generates more suggestions sequentially to run parallel experiments. The second one dynamically adds new alternatives, in the form of types of probes, which are created by inserting, deleting, or mutating nucleotides within existing probes. In simulation, we demonstrate these algorithms on the Tetrahymena Group I intron (a midsize RNA molecule), showing that they efficiently learn the correct sparsity pattern, identify the most accessible region, and outperform several other policies.

Suggested Citation

  • Yan Li & Kristofer G. Reyes & Jorge Vazquez-Anderson & Yingfei Wang & Lydia M. Contreras & Warren B. Powell, 2018. "A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model," INFORMS Journal on Computing, INFORMS, vol. 30(4), pages 750-767, November.
  • Handle: RePEc:inm:orijoc:v:30:y:2018:i:4:p:750-767
    DOI: 10.1287/ijoc.2017.0803
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/ijoc.2017.0803
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2017.0803?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Marc C. Kennedy & Anthony O'Hagan, 2001. "Bayesian calibration of computer models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(3), pages 425-464.
    2. Diana M. Negoescu & Peter I. Frazier & Warren B. Powell, 2011. "The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery," INFORMS Journal on Computing, INFORMS, vol. 23(3), pages 346-363, August.
    3. Yiliang Ding & Yin Tang & Chun Kit Kwok & Yu Zhang & Philip C. Bevilacqua & Sarah M. Assmann, 2014. "In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features," Nature, Nature, vol. 505(7485), pages 696-700, January.
    4. Michael Kertesz & Yue Wan & Elad Mazor & John L. Rinn & Robert C. Nutter & Howard Y. Chang & Eran Segal, 2010. "Genome-wide measurement of RNA secondary structure in yeast," Nature, Nature, vol. 467(7311), pages 103-107, September.
    5. Peter Frazier & Warren Powell & Savas Dayanik, 2009. "The Knowledge-Gradient Policy for Correlated Normal Beliefs," INFORMS Journal on Computing, INFORMS, vol. 21(4), pages 599-613, November.
    6. Ilya O. Ryzhov & Warren B. Powell & Peter I. Frazier, 2012. "The Knowledge Gradient Algorithm for a General Class of Online Learning Problems," Operations Research, INFORMS, vol. 60(1), pages 180-195, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Donghun Lee, 2022. "Knowledge Gradient: Capturing Value of Information in Iterative Decisions under Uncertainty," Mathematics, MDPI, vol. 10(23), pages 1-20, November.
    2. Shan Jiang & Shu-Cherng Fang & Qingwei Jin, 2021. "Sparse Solutions by a Quadratically Constrained ℓ q (0 < q < 1) Minimization Model," INFORMS Journal on Computing, INFORMS, vol. 33(2), pages 511-530, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Warren B. Powell, 2016. "Perspectives of approximate dynamic programming," Annals of Operations Research, Springer, vol. 241(1), pages 319-356, June.
    2. Ilya O. Ryzhov & Martijn R. K. Mes & Warren B. Powell & Gerald van den Berg, 2019. "Bayesian Exploration for Approximate Dynamic Programming," Operations Research, INFORMS, vol. 67(1), pages 198-214, January.
    3. Emre Barut & Warren Powell, 2014. "Optimal learning for sequential sampling with non-parametric beliefs," Journal of Global Optimization, Springer, vol. 58(3), pages 517-543, March.
    4. Bin Han & Ilya O. Ryzhov & Boris Defourny, 2016. "Optimal Learning in Linear Regression with Combinatorial Feature Selection," INFORMS Journal on Computing, INFORMS, vol. 28(4), pages 721-735, November.
    5. Jing Xie & Peter I. Frazier, 2013. "Sequential Bayes-Optimal Policies for Multiple Comparisons with a Known Standard," Operations Research, INFORMS, vol. 61(5), pages 1174-1189, October.
    6. Gongwang Yu & Yao Liu & Zizhang Li & Shuyun Deng & Zhuoxing Wu & Xiaoyu Zhang & Wenbo Chen & Junnan Yang & Xiaoshu Chen & Jian-Rong Yang, 2023. "Genome-wide probing of eukaryotic nascent RNA structure elucidates cotranscriptional folding and its antimutagenic effect," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    7. Bolong Cheng & Arta Jamshidi & Warren Powell, 2015. "Optimal learning with a local parametric belief model," Journal of Global Optimization, Springer, vol. 63(2), pages 401-425, October.
    8. Shervin Shahrokhi Tehrani & Andrew T. Ching, 2024. "A Heuristic Approach to Explore: The Value of Perfect Information," Management Science, INFORMS, vol. 70(5), pages 3200-3224, May.
    9. Donghun Lee, 2022. "Knowledge Gradient: Capturing Value of Information in Iterative Decisions under Uncertainty," Mathematics, MDPI, vol. 10(23), pages 1-20, November.
    10. Jeong, Cheoljoon & Byon, Eunshin, 2024. "Calibration of building energy computer models via bias-corrected iteratively reweighted least squares method," Applied Energy, Elsevier, vol. 360(C).
    11. Yixiao Huang & Lei Zhao & Warren B. Powell & Yue Tong & Ilya O. Ryzhov, 2019. "Optimal Learning for Urban Delivery Fleet Allocation," Transportation Science, INFORMS, vol. 53(3), pages 623-641, May.
    12. Matthias Katzfuss & Joseph Guinness & Wenlong Gong & Daniel Zilber, 2020. "Vecchia Approximations of Gaussian-Process Predictions," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(3), pages 383-414, September.
    13. Mark Semelhago & Barry L. Nelson & Eunhye Song & Andreas Wächter, 2021. "Rapid Discrete Optimization via Simulation with Gaussian Markov Random Fields," INFORMS Journal on Computing, INFORMS, vol. 33(3), pages 915-930, July.
    14. Hao Wu & Michael Browne, 2015. "Random Model Discrepancy: Interpretations and Technicalities (A Rejoinder)," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 619-624, September.
    15. Xiaoyu Xiong & Benjamin D. Youngman & Theodoros Economou, 2021. "Data fusion with Gaussian processes for estimation of environmental hazard events," Environmetrics, John Wiley & Sons, Ltd., vol. 32(3), May.
    16. Petropoulos, G. & Wooster, M.J. & Carlson, T.N. & Kennedy, M.C. & Scholze, M., 2009. "A global Bayesian sensitivity analysis of the 1d SimSphere soil–vegetation–atmospheric transfer (SVAT) model using Gaussian model emulation," Ecological Modelling, Elsevier, vol. 220(19), pages 2427-2440.
    17. Drignei, Dorin, 2011. "A general statistical model for computer experiments with time series output," Reliability Engineering and System Safety, Elsevier, vol. 96(4), pages 460-467.
    18. Yuan, Jun & Ng, Szu Hui, 2013. "A sequential approach for stochastic computer model calibration and prediction," Reliability Engineering and System Safety, Elsevier, vol. 111(C), pages 273-286.
    19. Seokhyun Chung & Raed Al Kontar & Zhenke Wu, 2022. "Weakly Supervised Multi-output Regression via Correlated Gaussian Processes," INFORMS Joural on Data Science, INFORMS, vol. 1(2), pages 115-137, October.
    20. Edward Boone & Jan Hannig & Ryad Ghanam & Sujit Ghosh & Fabrizio Ruggeri & Serge Prudhomme, 2022. "Model Validation of a Single Degree-of-Freedom Oscillator: A Case Study," Stats, MDPI, vol. 5(4), pages 1-17, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:30:y:2018:i:4:p:750-767. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.