IDEAS home Printed from https://ideas.repec.org/a/spr/metcap/v13y2011i1d10.1007_s11009-008-9107-1.html
   My bibliography  Save this article

Testing the Order of Markov Dependence in DNA Sequences

Author

Listed:
  • M. L. Menéndez

    (Polytechnical University of Madrid)

  • L. Pardo

    (Complutense University of Madrid)

  • M. C. Pardo

    (Complutense University of Madrid)

  • K. Zografos

    (University of Ioannina)

Abstract

DNA or protein sequences are usually modeled as probabilistic phenomena. The simplest model is created on the assumption that the nucleotides at the various sites are independently distributed. Usually the type of nucleotide at some site depends on the type at another site and therefore the DNA sequence is modeled as a Markov chain of random variables taking on the values A, G, C and T corresponding to the four nucleotides. First order or higher order Markov models provide better fit to a DNA sequence. Based on this remark, the aim of this paper is to present and study a family of test statistics for testing order Markov dependence in DNA sequences. This new family includes as a particular case the classical likelihood ratio test. A simulation study is presented in order to find test statistics, in this family, with a better behaviour than the likelihood ratio test.

Suggested Citation

  • M. L. Menéndez & L. Pardo & M. C. Pardo & K. Zografos, 2011. "Testing the Order of Markov Dependence in DNA Sequences," Methodology and Computing in Applied Probability, Springer, vol. 13(1), pages 59-74, March.
  • Handle: RePEc:spr:metcap:v:13:y:2011:i:1:d:10.1007_s11009-008-9107-1
    DOI: 10.1007/s11009-008-9107-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11009-008-9107-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11009-008-9107-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. L. Pardo & D. Morales & M. Salicrú & M. Menéndez, 1993. "Theϕ-divergence statistic in bivariate multinomial populations including stratification," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 40(1), pages 223-235, December.
    2. Menendez, M.L. & Pardo, J.A. & Pardo, L. & Zografos, K., 2006. "On tests of independence based on minimum [phi]-divergence estimator with constraints: An application to modeling DNA," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1100-1118, November.
    3. P. J. Avery & D. A. Henderson, 1999. "Fitting Markov chain models to discrete state series such as DNA sequences," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(1), pages 53-61.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Papapetrou, M. & Kugiumtzis, D., 2013. "Markov chain order estimation with conditional mutual information," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(7), pages 1593-1601.
    2. Papapetrou, M. & Kugiumtzis, D., 2020. "Tsallis conditional mutual information in investigating long range correlation in symbol sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 540(C).
    3. Andreas C. Georgiou & Alexandra Papadopoulou & Pavlos Kolias & Haris Palikrousis & Evanthia Farmakioti, 2021. "On State Occupancies, First Passage Times and Duration in Non-Homogeneous Semi-Markov Chains," Mathematics, MDPI, vol. 9(15), pages 1-17, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Papapetrou, M. & Kugiumtzis, D., 2013. "Markov chain order estimation with conditional mutual information," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(7), pages 1593-1601.
    2. Anastasios N. Arapis & Frosso S. Makri & Zaharias M. Psillakis, 2017. "Joint distribution of k-tuple statistics in zero-one sequences of Markov-dependent trials," Journal of Statistical Distributions and Applications, Springer, vol. 4(1), pages 1-13, December.
    3. J. Besag & D. Mondal, 2013. "Exact Goodness-of-Fit Tests for Markov Chains," Biometrics, The International Biometric Society, vol. 69(2), pages 488-496, June.
    4. repec:aaa:journl:v:3:y:1999:i:1:p:87-100 is not listed on IDEAS
    5. Jonsson, Robert, 2011. "A Markov Chain Model for Analysing the Progression of Patient’s Health States," Research Reports 2011:6, University of Gothenburg, Statistical Research Unit, School of Business, Economics and Law.
    6. Varin, Cristiano & Vidoni, Paolo, 2006. "Pairwise likelihood inference for ordinal categorical time series," Computational Statistics & Data Analysis, Elsevier, vol. 51(4), pages 2365-2373, December.
    7. Menendez, M.L. & Pardo, J.A. & Pardo, L. & Zografos, K., 2006. "On tests of independence based on minimum [phi]-divergence estimator with constraints: An application to modeling DNA," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1100-1118, November.
    8. Aylin Alin & Serdar Kurt, 2008. "Ordinary and penalized minimum power-divergence estimators in two-way contingency tables," Computational Statistics, Springer, vol. 23(3), pages 455-468, July.
    9. Jonsson, Robert, 2011. "Tests of Markov Order and Homogeneity in a Markov Chain," Research Reports 2011:7, University of Gothenburg, Statistical Research Unit, School of Business, Economics and Law.
    10. Gabadinho, Alexis & Ritschard, Gilbert, 2016. "Analyzing State Sequences with Probabilistic Suffix Trees: The PST R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 72(i03).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metcap:v:13:y:2011:i:1:d:10.1007_s11009-008-9107-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.