IDEAS home Printed from https://ideas.repec.org/a/aes/infoec/v28y2024i2p5-16.html
   My bibliography  Save this article

Spoken Digit Recognition using the k-Nearest-Neighbor method and Mel Frequency Cepstral Coefficients

Author

Listed:
  • Sorin MURARU
  • Catalina Lucia COCIANU

Abstract

This study investigates the utilization of the k-nearest-neighbor algorithm within the framework of machine learning for speech recognition applications. The AudioMNIST dataset is used for performing the evaluations in which the model predicts the spoken digit, namely from 0 to 9. Two different training-to-test percentage splits of the dataset are used, 70%-30% and 80%-20%, while the k parameter ranges from 1 to 12. To better adapt the predic-tion model, the Mel-frequency cepstrum coefficients are extracted from each audio sample, and the 13 filters are averaged over 25 ms frame windows with 10 ms frame overlap. In both training-to-test configurations the value for the k parameter that obtained the highest accu-racy (> 95%) is k=5, while the easiest to predict digits was “7†. These findings underscore the efficacy of k-nearest-neighbor in speech recognition tasks and highlight the importance of parameter selection and feature extraction techniques in optimizing model performance. Further exploration of kNN's applicability in diverse speech recognition contexts holds promise for advancing the field's understanding and practical implementations.

Suggested Citation

  • Sorin MURARU & Catalina Lucia COCIANU, 2024. "Spoken Digit Recognition using the k-Nearest-Neighbor method and Mel Frequency Cepstral Coefficients," Informatica Economica, Academy of Economic Studies - Bucharest, Romania, vol. 28(2), pages 5-16.
  • Handle: RePEc:aes:infoec:v:28:y:2024:i:2:p:5-16
    as

    Download full text from publisher

    File URL: https://revistaie.ase.ro/content/110/01%20-%20muraru,%20cocianu.pdf
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aes:infoec:v:28:y:2024:i:2:p:5-16. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Paul Pocatilu (email available below). General contact details of provider: https://edirc.repec.org/data/aseeero.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.