Author
Listed:
- Yun-Chia Liang
(Department of Industrial Engineering and Management, Yuan Ze University, No. 135, Yuan-Tung Rd., Chung-Li Dist., Taoyuan City 32003, Taiwan)
- Iven Wijaya
(Department of Industrial Engineering and Management, Yuan Ze University, No. 135, Yuan-Tung Rd., Chung-Li Dist., Taoyuan City 32003, Taiwan)
- Ming-Tao Yang
(Department of Chemical Engineering and Materials Science, Yuan Ze University, No. 135, Yuan-Tung Rd., Chung-Li Dist., Taoyuan City 32003, Taiwan
Far Eastern Memorial Hospital, No. 21, Sec. 2, Nanya S. Rd., Banciao Dist., New Taipei City 22000, Taiwan)
- Josue Rodolfo Cuevas Juarez
(Department of Industrial Engineering and Management, Yuan Ze University, No. 135, Yuan-Tung Rd., Chung-Li Dist., Taoyuan City 32003, Taiwan)
- Hou-Tai Chang
(Department of Industrial Engineering and Management, Yuan Ze University, No. 135, Yuan-Tung Rd., Chung-Li Dist., Taoyuan City 32003, Taiwan
Far Eastern Memorial Hospital, No. 21, Sec. 2, Nanya S. Rd., Banciao Dist., New Taipei City 22000, Taiwan)
Abstract
Recognizing why an infant cries is challenging as babies cannot communicate verbally with others to express their wishes or needs. This leads to difficulties for parents in identifying the needs and the health of their infants. This study used deep learning (DL) algorithms such as the convolutional neural network (CNN) and long short-term memory (LSTM) to recognize infants’ necessities such as hunger/thirst, need for a diaper change, emotional needs (e.g., need for touch/holding), and pain caused by medical treatment (e.g., injection). The classical artificial neural network (ANN) was also used for comparison. The inputs of ANN, CNN, and LSTM were the features extracted from 1607 10 s audio recordings of infants using mel-frequency cepstral coefficients (MFCC). Results showed that CNN and LSTM both provided decent performance, around 95% in accuracy, precision, and recall, in differentiating healthy and sick infants. For recognizing infants’ specific needs, CNN reached up to 60% accuracy, outperforming LSTM and ANN in almost all measures. These results could be applied as indicators for future applications to help parents understand their infant’s condition and needs.
Suggested Citation
Yun-Chia Liang & Iven Wijaya & Ming-Tao Yang & Josue Rodolfo Cuevas Juarez & Hou-Tai Chang, 2022.
"Deep Learning for Infant Cry Recognition,"
IJERPH, MDPI, vol. 19(10), pages 1-10, May.
Handle:
RePEc:gam:jijerp:v:19:y:2022:i:10:p:6311-:d:821539
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:10:p:6311-:d:821539. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.