Author
Listed:
- REEMA G. AL-ANAZI
(Department of Arabic Language and Literature, College of Humanities and Social Sciences, Princess Nourah bint Abdulrahman University, P. O. Box 84428, Riyadh, 11671, Saudi Arabia)
- ABDULLAH SAAD AL-DOBAIAN
(��Department of English Language, College of Language Sciences, King Saud University, P. O. Box 145111, Riyadh, Saudi Arabia)
- ASMA ABBAS HASSAN
(��Computer Science Department, Applied College at Mahayil, King Khalid University, Muhayel Aseer, 62529, Saudi Arabia)
- MANAR ALMANEA
(�Department of English, College of Languages and Translation, Imam Mohammad Ibn Saud Islamic University, Riyadh 11432, Saudi Arabia)
- AYMAN AHMAD ALGHAMDI
(�Department of Arabic Teaching, Arabic Language Institute, Umm Al-qura University, Saudi Arabia)
- SOMIA A. ASKLANY
(��Department of Computer Science and Information Technology, Faculty of Sciences and Arts, Turaif, Northern Border University, Arar 91431, Kingdom of Saudi Arabia)
- HANAN AL SULTAN
(*Department of English, College of Arts, King Faisal University, Saudi Arabia)
- JIHEN MAJDOUBI
(��†Department of Computer Science, College of Science and Humanities at Alghat, Majmaah University, Al-Majmaah 11952, Saudi Arabia)
Abstract
Humans prefer to convey information through speech utilizing similar language. Speech detection is the capability to recognize the spoken words of the speaking person. Recent work demonstrates the increased attention among researcher workers in this field specially in Brain-Like computing applications and emphasizes the real-world usability of speech for speaker recognition across different applications. Automatic speech recognition (ASR) is the method of identifying human speech and converting it into text. This study has gained much popularity in recent times. It is a crucial area of research for human-to-machine interaction. Pioneer methods are concerned with manual feature extraction and classical algorithms including Hidden Markov Models (HMM), Gaussian Mixture Model (GMM), and the Dynamic Time Warping (DTW) model. In recent years, neural networks, namely convolutional neural networks (CNN), recurrent neural networks (RNN), and Transformers, have been utilized in the context of ASR and reached outstanding performance over the past few years. This study introduces Intelligent Speech Recognition using the Fractal Amended Grasshopper Optimization Algorithm with Deep Learning (ISR-AGODL) approach. The presented ISR-AGODL technique correctly identifies and recognizes speech signals. In the ISR-AGODL technique, the speech signals are transformed into spectrograms. Besides, the features are derived using the deep convolutional neural networks (DCNN) model. Followed by the Fractals AGO technique is utilized for the choosing of hyperparameters. Finally, the recognition of speech signals is achieved using the extreme gradient boosting (XGBoost) model. The simulation outcomes of the ISR-AGODL method can be validated using a benchmark dataset. The experimental results of the ISR-AGODL method portrayed a superior accuracy outcome of 96.34% over other models.
Suggested Citation
Reema G. Al-Anazi & Abdullah Saad Al-Dobaian & Asma Abbas Hassan & Manar Almanea & Ayman Ahmad Alghamdi & Somia A. Asklany & Hanan Al Sultan & Jihen Majdoubi, 2024.
"Intelligent Speech Recognition Using Fractal Amended Grasshopper Optimization Algorithm With Deep Learning Approach,"
FRACTALS (fractals), World Scientific Publishing Co. Pte. Ltd., vol. 32(09n10), pages 1-12.
Handle:
RePEc:wsi:fracta:v:32:y:2024:i:09n10:n:s0218348x25400298
DOI: 10.1142/S0218348X25400298
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:fracta:v:32:y:2024:i:09n10:n:s0218348x25400298. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: https://www.worldscientific.com/worldscinet/fractals .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.