IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v14y2015i1p21-34n3.html
   My bibliography  Save this article

A hidden Markov-model for gene mapping based on whole-genome next generation sequencing data

Author

Listed:
  • Claesen Jürgen

    (Interuniversity Institute of Biostatistics and Statistical Bioinformatics, Hasselt University, Martelarenlaan 42, 3500 Hasselt, Belgium)

  • Burzykowski Tomasz

    (I-Biostat, Hasselt University, Martelarenlaan 42, 3500 Hasselt, Belgium)

Abstract

The analysis of polygenic, phenotypic characteristics such as quantitative traits or inheritable diseases requires reliable scoring of many genetic markers covering the entire genome. The advent of high-throughput sequencing technologies provides a new way to evaluate large numbers of single nucleotide polymorphisms as genetic markers. Combining the technologies with pooling of segregants, as performed in bulk segregant analysis, should, in principle, allow the simultaneous mapping of multiple genetic loci present throughout the genome. We propose a hidden Markov-model to analyze the marker data obtained by the bulk segregant next generation sequencing. The model includes several states, each associated with a different probability of observing the same/different nucleotide in an offspring as compared to the parent. The transitions between the molecular markers imply transitions between the states of the model. After estimating the transition probabilities and state-related probabilities of nucleotide (dis)similarity, the most probable state for each SNP is selected. The most probable states can then be used to indicate which genomic regions may be likely to contain trait-related genes. The application of the model is illustrated on the data from a study of ethanol tolerance in yeast. Software is written in R. R-functions, R-scripts and documentation are available on www.ibiostat.be/software/bioinformatics.

Suggested Citation

  • Claesen Jürgen & Burzykowski Tomasz, 2015. "A hidden Markov-model for gene mapping based on whole-genome next generation sequencing data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(1), pages 21-34, February.
  • Handle: RePEc:bpj:sagmbi:v:14:y:2015:i:1:p:21-34:n:3
    DOI: 10.1515/sagmb-2014-0007
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/sagmb-2014-0007
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/sagmb-2014-0007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Héctor Corrada Bravo & Rafael A. Irizarry, 2010. "Model-Based Quality Assessment and Base-Calling for Second-Generation Sequencing Data," Biometrics, The International Biometric Society, vol. 66(3), pages 665-674, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yuan Ji & Yanxun Xu & Qiong Zhang & Kam-Wah Tsui & Yuan Yuan & Clift Norris Jr. & Shoudan Liang & Han Liang, 2011. "BM-Map: Bayesian Mapping of Multireads for Next-Generation Sequencing Data," Biometrics, The International Biometric Society, vol. 67(4), pages 1215-1224, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:14:y:2015:i:1:p:21-34:n:3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.