IDEAS home Printed from https://ideas.repec.org/p/cdl/ucscec/qt6sz4v0nc.html
   My bibliography  Save this paper

Combined spectral and speech features for pig speech recognition

Author

Listed:
  • Wu, Xuan
  • Zhou, Silong
  • Chen, Mingwei
  • Zhao, Yihang
  • Wang, Yifei
  • Zhao, Xianmeng
  • Li, Danyang
  • Pu, Haibo

Abstract

The sound of the pig is one of its important signs, which can reflect various states such as hunger, pain or emotional state, and directly indicates the growth and health status of the pig. Existing speech recognition methods usually start with spectral features. The use of spectrograms to achieve classification of different speech sounds, while working well, may not be the best approach for solving such tasks with single-dimensional feature input. Based on the above assumptions, in order to more accurately grasp the situation of pigs and take timely measures to ensure the health status of pigs, this paper proposes a pig sound classification method based on the dual role of signal spectrum and speech. Spectrograms can visualize information about the characteristics of the sound under different time periods. The audio data are introduced, and the spectrogram features of the model input as well as the audio time-domain features are complemented with each other and passed into a pre-designed parallel network structure. The network model with the best results and the classifier were selected for combination. An accuracy of 93.39% was achieved on the pig speech classification task, while the AUC also reached 0.99163, demonstrating the superiority of the method. This study contributes to the direction of computer vision and acoustics by recognizing the sound of pigs. In addition, a total of 4,000 pig sound datasets in four categories are established in this paper to provide a research basis for later research scholars.

Suggested Citation

  • Wu, Xuan & Zhou, Silong & Chen, Mingwei & Zhao, Yihang & Wang, Yifei & Zhao, Xianmeng & Li, Danyang & Pu, Haibo, 2022. "Combined spectral and speech features for pig speech recognition," Santa Cruz Department of Economics, Working Paper Series qt6sz4v0nc, Department of Economics, UC Santa Cruz.
  • Handle: RePEc:cdl:ucscec:qt6sz4v0nc
    as

    Download full text from publisher

    File URL: https://www.escholarship.org/uc/item/6sz4v0nc.pdf;origin=repeccitec
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cdl:ucscec:qt6sz4v0nc. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Lisa Schiff (email available below). General contact details of provider: https://edirc.repec.org/data/ecucsus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.