IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i6p654-d519797.html
   My bibliography  Save this article

GASVeM: A New Machine Learning Methodology for Multi-SNP Analysis of GWAS Data Based on Genetic Algorithms and Support Vector Machines

Author

Listed:
  • Fidel Díez Díaz

    (CTIC Technological Centre, W3C Spain Office Host, Ada Byron 39, 33203 Gijón, Spain)

  • Fernando Sánchez Lasheras

    (Department of Mathematics, Faculty of Sciences, Universidad de Oviedo, 33007 Oviedo, Spain)

  • Víctor Moreno

    (Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, 08908 Barcelona, Spain)

  • Ferran Moratalla-Navarro

    (Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, 08908 Barcelona, Spain)

  • Antonio José Molina de la Torre

    (IBIOMED, University of Leon, Vegazana Campus, 24400 León, Spain)

  • Vicente Martín Sánchez

    (CIBERESP, University of Leon, Vegazana Campus, 24400 León, Spain)

Abstract

Genome-wide association studies (GWAS) are observational studies of a large set of genetic variants in an individual’s sample in order to find if any of these variants are linked to a particular trait. In the last two decades, GWAS have contributed to several new discoveries in the field of genetics. This research presents a novel methodology to which GWAS can be applied to. It is mainly based on two machine learning methodologies, genetic algorithms and support vector machines. The database employed for the study consisted of information about 370,750 single-nucleotide polymorphisms belonging to 1076 cases of colorectal cancer and 973 controls. Ten pathways with different degrees of relationship with the trait under study were tested. The results obtained showed how the proposed methodology is able to detect relevant pathways for a certain trait: in this case, colorectal cancer.

Suggested Citation

  • Fidel Díez Díaz & Fernando Sánchez Lasheras & Víctor Moreno & Ferran Moratalla-Navarro & Antonio José Molina de la Torre & Vicente Martín Sánchez, 2021. "GASVeM: A New Machine Learning Methodology for Multi-SNP Analysis of GWAS Data Based on Genetic Algorithms and Support Vector Machines," Mathematics, MDPI, vol. 9(6), pages 1-19, March.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:6:p:654-:d:519797
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/6/654/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/6/654/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. J. Vilán Vilán & J. Alonso Fernández & P. García Nieto & F. Sánchez Lasheras & F. de Cos Juez & C. Díaz Muñiz, 2013. "Support Vector Machines and Multilayer Perceptron Networks Used to Evaluate the Cyanotoxins Presence from Experimental Cyanobacteria Concentrations in the Trasona Reservoir (Northern Spain)," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 27(9), pages 3457-3476, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chang-ming Ji & Ting Zhou & Hai-tao Huang, 2014. "Operating Rules Derivation of Jinsha Reservoirs System with Parameter Calibrated Support Vector Regression," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 28(9), pages 2435-2451, July.
    2. García Nieto, P.J. & García-Gonzalo, E. & Sánchez Lasheras, F. & de Cos Juez, F.J., 2015. "Hybrid PSO–SVM-based method for forecasting of the remaining useful life for aircraft engines and evaluation of its reliability," Reliability Engineering and System Safety, Elsevier, vol. 138(C), pages 219-231.
    3. Paulino José García-Nieto & Esperanza García-Gonzalo & José Ramón Alonso Fernández & Cristina Díaz Muñiz, 2020. "A New Predictive Model for Evaluating Chlorophyll-a Concentration in Tanes Reservoir by Using a Gaussian Process Regression," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 34(15), pages 4921-4941, December.
    4. Nieto, P.J. García & Fernández, J.R. Alonso & Suárez, V.M. González & Muñiz, C. Díaz & García-Gonzalo, E. & Bayón, R. Mayo, 2015. "A hybrid PSO optimized SVM-based method for predicting of the cyanotoxin content from experimental cyanobacteria concentrations in the Trasona reservoir: A case study in Northern Spain," Applied Mathematics and Computation, Elsevier, vol. 260(C), pages 170-187.
    5. Yunfeng Xu & Chunzi Ma & Shouliang Huo & Dayi Zhang & Zhiping Xu & Guangren Qian & Beidou Xi, 2014. "Establishing Reference Conditions for Lake Water Quality: A Novel Extrapolation Approach," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 28(8), pages 2161-2178, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:6:p:654-:d:519797. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.