IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i4p589-d1588381.html
   My bibliography  Save this article

BTFBS: Binding Prediction of Bacterial Transcription Factors and Binding Sites Based on Deep Learning

Author

Listed:
  • Bingbing Jin

    (College of Sciences, Nanjing Agricultural University, Nanjing 210095, China)

  • Song Liang

    (College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China)

  • Xiaoqian Liu

    (College of Sciences, Nanjing Agricultural University, Nanjing 210095, China)

  • Rui Zhang

    (College of Sciences, Nanjing Agricultural University, Nanjing 210095, China)

  • Yun Zhu

    (College of Sciences, Nanjing Agricultural University, Nanjing 210095, China)

  • Yuanyuan Chen

    (College of Sciences, Nanjing Agricultural University, Nanjing 210095, China)

  • Guangjin Liu

    (College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China)

  • Tao Yang

    (College of Sciences, Nanjing Agricultural University, Nanjing 210095, China)

Abstract

The binding of transcription factors (TFs) to TF binding sites plays a vital role in the process of regulating gene expression and evolution. With the development of machine learning and deep learning, some successes have been achieved in predicting transcription factors and binding sites. In this paper, we develop a model, BTFBS, which predicts whether the bacterial transcription factors and binding sites combine or not. The model takes both the amino acid sequences of bacterial transcription factors and the nucleotide sequences of binding sites as inputs, and extracts features through convolutional neural network and MultiheadAttention. For the model inputs, we use two negative sample sampling methods: RS and EE. On the test dataset of RS, the accuracy, sensitivity, specificity, F1-score, and MCC of BTFBS are 0.91446, 0.89746, 0.93134, 0.91264, and 0.82946, respectively. Furthermore, on the test dataset of EE, the accuracy, sensitivity, specificity, F1-score and MCC of BTFBS are 0.87868, 0.89354, 0.86394, 0.87996, and 0.75796, respectively. Meanwhile, our findings indicate that the optimal approach for obtaining negative samples in the context of bacterial research is to utilize the whole genome sequences of the corresponding bacteria, as opposed to the shuffling method. The above results on the test dataset have shown that the proposed BTFBS model has a good performance and it can provide an experimental guide.

Suggested Citation

  • Bingbing Jin & Song Liang & Xiaoqian Liu & Rui Zhang & Yun Zhu & Yuanyuan Chen & Guangjin Liu & Tao Yang, 2025. "BTFBS: Binding Prediction of Bacterial Transcription Factors and Binding Sites Based on Deep Learning," Mathematics, MDPI, vol. 13(4), pages 1-15, February.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:4:p:589-:d:1588381
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/4/589/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/4/589/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:4:p:589-:d:1588381. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.