IDEAS home Printed from https://ideas.repec.org/a/spr/metcap/v24y2022i3d10.1007_s11009-021-09856-8.html
   My bibliography  Save this article

Duality Between the Local Score of One Sequence and Constrained Hidden Markov Model

Author

Listed:
  • Sabine Mercier

    (Université de Toulouse 2 Jean Jaurès)

  • Grégory Nuel

    (Laboratoire de Probabilités Statistique Modélisation (LPSM), CNRS 8001Sorbonne Université)

Abstract

We are interested here in a theoretical and practical approach for detecting atypical segments in a multi-state sequence. We prove in this article that the segmentation approach through an underlying constrained Hidden Markov Model (HMM) is equivalent to using the maximum scoring subsequence (also called local score), when the latter uses an appropriate rescaled scoring function. This equivalence allows results from both HMM or local score to be transposed into each other. We propose an adaptation of the standard forward-backward algorithm which provides exact estimates of posterior probabilities in a linear time. Additionally it can provide posterior probabilities on the segment length and starting/ending indexes. We explain how this equivalence allows one to manage ambiguous or uncertain sequence letters and to construct relevant scoring functions. We illustrate our approach by considering the TM-tendency scoring function.

Suggested Citation

  • Sabine Mercier & Grégory Nuel, 2022. "Duality Between the Local Score of One Sequence and Constrained Hidden Markov Model," Methodology and Computing in Applied Probability, Springer, vol. 24(3), pages 1411-1438, September.
  • Handle: RePEc:spr:metcap:v:24:y:2022:i:3:d:10.1007_s11009-021-09856-8
    DOI: 10.1007/s11009-021-09856-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11009-021-09856-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11009-021-09856-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bo Zhao & Joseph Glaz, 2017. "Scan statistics for detecting a local change in variance for two-dimensional normal data," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 46(11), pages 5517-5530, June.
    2. Jie Chen & Joseph Glaz, 2016. "Scan statistics for monitoring data modeled by a negative binomial distribution," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 45(6), pages 1632-1642, March.
    3. Claudie Hassenforder & Sabine Mercier, 2007. "Exact Distribution of the Local Score for Markovian Sequences," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 59(4), pages 741-755, December.
    4. Daudin, Jean-Jacques & Etienne, Marie Pierre & Vallois, Pierre, 2003. "Asymptotic behavior of the local score of independent and identically distributed random sequences," Stochastic Processes and their Applications, Elsevier, vol. 107(1), pages 1-28, September.
    5. Wolfsheimer Stefan & Hartmann Alexander & Rabus Ralf & Nuel Gregory, 2012. "Computing Posterior Probabilities for Score-based Alignments Using ppALIGN," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-37, May.
    6. Lagnoux, Agnès & Mercier, Sabine & Vallois, Pierre, 2019. "Probability density function of the local score position," Stochastic Processes and their Applications, Elsevier, vol. 129(10), pages 3664-3689.
    7. Arribas-Gil Ana & Matias Catherine, 2012. "A Context Dependent Pair Hidden Markov Model for Statistical Alignment," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-29, January.
    8. Chabriac, Claudie & Lagnoux, Agnès & Mercier, Sabine & Vallois, Pierre, 2014. "Elements related to the largest complete excursion of a reflected BM stopped at a fixed time. Application to local score," Stochastic Processes and their Applications, Elsevier, vol. 124(12), pages 4202-4223.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lagnoux, Agnès & Mercier, Sabine & Vallois, Pierre, 2019. "Probability density function of the local score position," Stochastic Processes and their Applications, Elsevier, vol. 129(10), pages 3664-3689.
    2. M. P. Etienne & P. Vallois, 2004. "Approximation of the Distribution of the Supremum of a Centered Random Walk. Application to the Local Score," Methodology and Computing in Applied Probability, Springer, vol. 6(3), pages 255-275, September.
    3. Chabriac, Claudie & Lagnoux, Agnès & Mercier, Sabine & Vallois, Pierre, 2014. "Elements related to the largest complete excursion of a reflected BM stopped at a fixed time. Application to local score," Stochastic Processes and their Applications, Elsevier, vol. 124(12), pages 4202-4223.
    4. Jie Chen & Thomas Ferguson & Paul Jorgensen, 2020. "Using Scan Statistics for Cluster Detection: Recognizing Real Bandwagons," Methodology and Computing in Applied Probability, Springer, vol. 22(4), pages 1481-1491, December.
    5. Alexandru Amarioarei & Cristian Preda, 2020. "One Dimensional Discrete Scan Statistics for Dependent Models and Some Related Problems," Mathematics, MDPI, vol. 8(4), pages 1-11, April.
    6. McGill, Paul, 2006. "An asymptotic estimate for Brownian motion with drift," Statistics & Probability Letters, Elsevier, vol. 76(11), pages 1164-1169, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metcap:v:24:y:2022:i:3:d:10.1007_s11009-021-09856-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.