IDEAS home Printed from https://ideas.repec.org/a/hin/jjmath/9014764.html
   My bibliography  Save this article

A Group Feature Screening Procedure Based on Pearson Chi-Square Statistic for Biology Data with Categorical Response

Author

Listed:
  • Hanji He
  • Jianfeng He
  • Guangming Deng
  • Nian-Sheng Tang

Abstract

The analysis of biogenetic data makes an important contribution to the understanding of disease mechanisms and the diagnosis of rare diseases. In this analysis, the selection of significant features affecting the disease provides an effective basis for subsequent disease judgment and treatment direction. However, this is not a simple task as biogenetic data have challenges such as ultra-high dimensionality of potential features, imbalance of response variables, and genetic associations. This study focuses on the group structure in feature screening with biogenetic data. Specifically, group structure exists for biogenetic data, so we need to analyze the entire genome rather than individual strongly correlated genes. This study proposes a group feature screening method that considers group correlations using adjusted Pearson’s cardinality statistic to address this issue. The method can be applied to both continuous and discrete covariates. The performance of the proposed method is illustrated by simulation studies, where the proposed method performs well with imbalanced data and multicategorical responses. In the application of lung cancer diagnosis, the proposed method for imbalanced data categorization is impressive, and the dimension reduction using linear discriminant is still good.

Suggested Citation

  • Hanji He & Jianfeng He & Guangming Deng & Nian-Sheng Tang, 2024. "A Group Feature Screening Procedure Based on Pearson Chi-Square Statistic for Biology Data with Categorical Response," Journal of Mathematics, Hindawi, vol. 2024, pages 1-21, September.
  • Handle: RePEc:hin:jjmath:9014764
    DOI: 10.1155/2024/9014764
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/jmath/2024/9014764.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/jmath/2024/9014764.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2024/9014764?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jjmath:9014764. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.