IDEAS home Printed from https://ideas.repec.org/a/hin/complx/8917258.html
   My bibliography  Save this article

Building Up a Robust Risk Mathematical Platform to Predict Colorectal Cancer

Author

Listed:
  • Le Zhang
  • Chunqiu Zheng
  • Tian Li
  • Lei Xing
  • Han Zeng
  • Tingting Li
  • Huan Yang
  • Jia Cao
  • Badong Chen
  • Ziyuan Zhou

Abstract

Colorectal cancer (CRC), as a result of a multistep process and under multiple factors, is one of the most common life-threatening cancers worldwide. To identify the “high risk” populations is critical for early diagnosis and improvement of overall survival rate. Of the complicated genetic and environmental factors, which group is mostly concerning colorectal carcinogenesis remains contentious. For this reason, this study collects relatively complete information of genetic variations and environmental exposure for both CRC patients and cancer-free controls; a multimethod ensemble model for CRC-risk prediction is developed by employing such big data to train and test the model. Our results demonstrate that (1) the explored genetic and environmental biomarkers are validated to connect to the CRC by biological function- or population-based evidences, (2) the model can efficiently predict the risk of CRC after parameter optimization by the big CRC-related data, and (3) our innovated heterogeneous ensemble learning model (HELM) and generalized kernel recursive maximum correntropy (GKRMC) algorithm have high prediction power. Finally, we discuss why the HELM and GKRMC can outperform the classical regression algorithms and related subjects for future study.

Suggested Citation

  • Le Zhang & Chunqiu Zheng & Tian Li & Lei Xing & Han Zeng & Tingting Li & Huan Yang & Jia Cao & Badong Chen & Ziyuan Zhou, 2017. "Building Up a Robust Risk Mathematical Platform to Predict Colorectal Cancer," Complexity, Hindawi, vol. 2017, pages 1-14, October.
  • Handle: RePEc:hin:complx:8917258
    DOI: 10.1155/2017/8917258
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/8503/2017/8917258.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/8503/2017/8917258.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2017/8917258?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Chris A. Mattmann, 2013. "A vision for data science," Nature, Nature, vol. 493(7433), pages 473-475, January.
    2. Jiang, Beini & Dai, Weizhong & Khaliq, Abdul & Carey, Michelle & Zhou, Xiaobo & Zhang, Le, 2015. "Novel 3D GPU based numerical parallel diffusion algorithms in cylindrical coordinates for health care simulation," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 109(C), pages 1-19.
    3. Hailiang Huang & Pritam Chanda & Alvaro Alonso & Joel S Bader & Dan E Arking, 2011. "Gene-Based Tests of Association," PLOS Genetics, Public Library of Science, vol. 7(7), pages 1-15, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Le Zhang & Wanyu Bai & Na Yuan & Zhenglin Du, 2019. "Comprehensively benchmarking applications for detecting copy number variation," PLOS Computational Biology, Public Library of Science, vol. 15(5), pages 1-12, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daphne R. Raban & Avishag Gordon, 2020. "The evolution of data science and big data research: A bibliometric analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(3), pages 1563-1581, March.
    2. Charlotte Wang & Wen-Hsin Kao & Chuhsing Kate Hsiao, 2015. "Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-24, August.
    3. Diana Chang & Feng Gao & Andrea Slavney & Li Ma & Yedael Y Waldman & Aaron J Sams & Paul Billing-Ross & Aviv Madar & Richard Spritz & Alon Keinan, 2014. "Accounting for eXentricities: Analysis of the X Chromosome in GWAS Reveals X-Linked Genes Implicated in Autoimmune Diseases," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-31, December.
    4. Klimeš, Lubomír & Mauder, Tomáš & Charvát, Pavel & Štětina, Josef, 2018. "Front tracking in modelling of latent heat thermal energy storage: Assessment of accuracy and efficiency, benchmarking and GPU-based acceleration," Energy, Elsevier, vol. 155(C), pages 297-311.
    5. Joyce de Souza Zanirato Maia & Ana Paula Arantes Bueno & Joao Ricardo Sato, 2023. "Applications of Artificial Intelligence Models in Educational Analytics and Decision Making: A Systematic Review," World, MDPI, vol. 4(2), pages 1-26, May.
    6. Yan, Li & Cao, Huiying & Gao, Chao & Wang, Zhen & Li, Xuelong, 2023. "Mining of book-loan behavior based on coupling relationship analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 613(C).
    7. Pallav Bhatnagar & Emily Barron-Casella & Christopher J Bean & Jacqueline N Milton & Clinton T Baldwin & Martin H Steinberg & Michael DeBaun & James F Casella & Dan E Arking, 2013. "Genome-Wide Meta-Analysis of Systolic Blood Pressure in Children with Sickle Cell Disease," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-1, September.
    8. Alberto Fernández & Sara Río & Abdullah Bawakid & Francisco Herrera, 2017. "Fuzzy rule based classification systems for big data with MapReduce: granularity analysis," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(4), pages 711-730, December.
    9. Emily Mathieu, 2016. "AGGrEGATOr: A Gene-based GEne-Gene interActTiOn test for case-control association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(2), pages 151-171, April.
    10. Zheng Xu, 2023. "Association Testing of a Group of Genetic Markers Based on Next-Generation Sequencing Data and Continuous Response Using a Linear Model Framework," Mathematics, MDPI, vol. 11(6), pages 1-32, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:complx:8917258. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.