IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i21p4134-d964324.html
   My bibliography  Save this article

HSIC CR : A Lightweight Scoring Criterion Based on Measuring the Degree of Causality for the Detection of SNP Interactions

Author

Listed:
  • Junxi Zheng

    (School of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, China)

  • Juan Zeng

    (School of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, China)

  • Xinyang Wang

    (School of Information Science and Technology, Beijing Forestry University, Beijing 100083, China)

  • Gang Li

    (School of Information Technology Engineering, Guangzhou College of Commerce, Guangzhou 511363, China)

  • Jiaxian Zhu

    (School of Computer Science, Zhaoqing University, Zhaoqing 526061, China)

  • Fanghong Wang

    (School of Bussiness, Zhijiang College of Zhengjiang University of Technology, Shaoxin 310024, China)

  • Deyu Tang

    (School of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou 510006, China)

Abstract

Recently, research on detecting SNP interactions has attracted considerable attention, which is of great significance for exploring complex diseases. The formulation of effective swarm intelligence optimization algorithms is a primary resolution to this issue. To achieve this goal, an important problem needs to be solved in advance; that is, designing and selecting lightweight scoring criteria that can be calculated in O ( m ) time and can accurately estimate the degree of association between SNP combinations and disease status. In this study, we propose a high-accuracy scoring criterion (HSIC C R ) by measuring the degree of causality dedicated to assessing the degree. First, we approximate two kinds of dependencies according to the structural equation of the causal relationship between epistasis SNP combination and disease status. Then, inspired by these dependencies, we put forward this scoring criterion that integrates a widely used method of measuring statistical dependencies based on kernel functions (HSIC). However, the computing time complexity of HSIC is O ( m 2 ) , which is too costly to be an integral part of the scoring criterion. Since the sizes of the sample space of the disease status, SNP loci and SNP combination are small enough, we propose an efficient method of computing HSIC for variables with a small sample in O ( m ) time. Eventually, HSIC C R can be computed in O ( m ) time in practice. Finally, we compared HSIC C R with five representative high-accuracy scoring criteria that detect SNP interactions for 49 simulation disease models. The experimental results show that the accuracy of our proposed scoring criterion is, overall, state-of-the-art.

Suggested Citation

  • Junxi Zheng & Juan Zeng & Xinyang Wang & Gang Li & Jiaxian Zhu & Fanghong Wang & Deyu Tang, 2022. "HSIC CR : A Lightweight Scoring Criterion Based on Measuring the Degree of Causality for the Detection of SNP Interactions," Mathematics, MDPI, vol. 10(21), pages 1-17, November.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:21:p:4134-:d:964324
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/21/4134/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/21/4134/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Christopher S. Carlson & Michael A. Eberle & Leonid Kruglyak & Deborah A. Nickerson, 2004. "Mapping complex disease loci in whole-genome association studies," Nature, Nature, vol. 429(6990), pages 446-452, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tomas Drgon & Ping-Wu Zhang & Catherine Johnson & Donna Walther & Judith Hess & Michelle Nino & George R Uhl, 2010. "Genome Wide Association for Addiction: Replicated Results and Comparisons of Two Analytic Approaches," PLOS ONE, Public Library of Science, vol. 5(1), pages 1-13, January.
    2. Xin Li & Lin Zhou & Tao Jia & Ran Peng & Xiongwu Fu & Yuliang Zou, 2020. "Associating COVID-19 Severity with Urban Factors: A Case Study of Wuhan," IJERPH, MDPI, vol. 17(18), pages 1-20, September.
    3. Marika Plöthner & Martin Frank & J.-Matthias Graf Schulenburg, 2017. "Cost analysis of whole genome sequencing in German clinical practice," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 18(5), pages 623-633, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:21:p:4134-:d:964324. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.