IDEAS home Printed from https://ideas.repec.org/a/gam/jagris/v15y2025i4p409-d1591989.html
   My bibliography  Save this article

HGATGS: Hypergraph Attention Network for Crop Genomic Selection

Author

Listed:
  • Xuliang He

    (College of Information and Electrical Engineering, Shenyang Agricultural University, Shenyang 110866, China
    National Innovation Center for Digital Seed Industry, Beijing 100097, China)

  • Kaiyi Wang

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China)

  • Liyang Zhang

    (SDIC Seed Technology Co., Ltd., Beijing 100034, China)

  • Dongfeng Zhang

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China)

  • Feng Yang

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China)

  • Qiusi Zhang

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China)

  • Shouhui Pan

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China
    Beijing PAIDE Science and Technology Development Co., Ltd., Beijing 100097, China)

  • Jinlong Li

    (State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China)

  • Longpeng Bai

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China
    Key Laboratory of Fisheries Information, Ministry of Agriculture and Rural Affairs, Shanghai Ocean University, Shanghai 201306, China)

  • Jiahao Sun

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China
    College of Agronomy, Northwest A&F University, Yangling 712100, China)

  • Zhongqiang Liu

    (National Innovation Center for Digital Seed Industry, Beijing 100097, China)

Abstract

Many important plants’ agronomic traits, such as crop yield, stress tolerance, and other traits, are controlled by multiple genes and exhibit complex inheritance patterns. Traditional breeding methods often encounter difficulties in dealing with these traits due to their complexity. However, genomic selection (GS), which utilizes high-density molecular markers across the entire genome to facilitate selection in breeding programs, excels in capturing the genetic variation associated with these traits. This enables more accurate and efficient selection in breeding. The traditional crop genome selection model, based on statistical methods or machine learning models, often treats samples as independent entities while neglecting the abundance latent relational information among them. Consequently, this limitation hampers their predictive performance. In this study, we proposed a novel crop genome selection model based on hypergraph attention networks for genomic prediction (HGATGS). This model incorporates dynamic hyperedges that are designed based on sample similarity to validate the efficacy of high-order relationships between samples for phenotypic prediction. By introducing an attention mechanism, it assigns weights to different hyperedges and nodes, thereby enhancing the ability to capture kinship relationships among samples. Additionally, residual connections are incorporated between hypergraph convolutional layers to further improve model stability and performance. The model was validated on datasets for multiple crops, including wheat, corn, and rice. The results showed that HGATGS significantly outperformed traditional statistical methods and machine learning models on the Wheat 599, Rice 299, and G2F 2017 datasets. On Wheat 599, HGATGS achieved a correlation coefficient of 0.54, a 14.9% improvement over methods like R-BLUP and BayesA (0.47). On Rice 299, HGATGS reached 0.45, a 66.7% increase compared to other models like R-BLUP and SVR (0.27). On G2F 2017, HGATGS attained 0.88, slightly surpassing other models like R-BLUP and BayesA (0.87). We conducted ablation experiments to compare the model’s performance across three datasets, and found that the model integrating hypergraph attention and residual connections performed optimally. Subsequent comparisons of the model’s prediction performance with dynamically selected different k values revealed optimal performance when K = (3,4). The model’s prediction performance was also compared across different single nucleotide polymorphisms (SNPs) and sample sizes in various datasets, with HGATGS consistently outperforming the comparison models. Finally, visualizations of the constructed hypergraph structures showed that certain nodes have high connection densities with hyperedges. These nodes often represent varieties or genotypes with significant impacts on traits. During feature aggregation, these high-connectivity nodes contribute significantly to the prediction results and demonstrate better prediction performance across multiple traits in multiple crops. This demonstrates that the method of constructing hypergraphs through correlation relationships for prediction is highly effective.

Suggested Citation

  • Xuliang He & Kaiyi Wang & Liyang Zhang & Dongfeng Zhang & Feng Yang & Qiusi Zhang & Shouhui Pan & Jinlong Li & Longpeng Bai & Jiahao Sun & Zhongqiang Liu, 2025. "HGATGS: Hypergraph Attention Network for Crop Genomic Selection," Agriculture, MDPI, vol. 15(4), pages 1-23, February.
  • Handle: RePEc:gam:jagris:v:15:y:2025:i:4:p:409-:d:1591989
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2077-0472/15/4/409/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2077-0472/15/4/409/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jagris:v:15:y:2025:i:4:p:409-:d:1591989. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.