IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i6p1380-d1095144.html
   My bibliography  Save this article

Word Game Modeling Using Character-Level N-Gram and Statistics

Author

Listed:
  • Jamolbek Mattiev

    (Information Technologies Department, Urgench State University, Khamid Alimdjan 14, Urgench 220100, Uzbekistan)

  • Ulugbek Salaev

    (Information Technologies Department, Urgench State University, Khamid Alimdjan 14, Urgench 220100, Uzbekistan)

  • Branko Kavsek

    (Department of Information Sciences and Technologies, University of Primorska, Glagoljaška 8, 6000 Koper, Slovenia
    AI Laboratory, Jožef Stefan Institute, Jamova Cesta 39, 1000 Ljubljana, Slovenia)

Abstract

Word games are one of the most essential factors of vocabulary learning and matching letters to form words for children aged 5–12. These games help children to improve letter and word recognition, memory-building, and vocabulary retention skills. Since Uzbek is a low-resource language, there has not been enough research into designing word games for the Uzbek language. In this paper, we develop two models for designing the cubic-letter game, also known as the matching-letter game, in the Uzbek language, consisting of a predefined number of cubes, with a letter on each side of each six-sided cube, and word cards to form words using a combination of the cubes. More precisely, we provide the opportunity to form as many words as possible from the dataset, while minimizing the number of cubes. The proposed methods were created using a combination of a character-level n-gram model and letter position frequency in words at the level of vowels and consonants. To perform the experiments, a novel dataset, consisting of 4.5 k 3–5 letter words, was created by filtering based on child age groups for the Uzbek language, and three more datasets were generated, based on the support of experts for the Russian, English, and Slovenian languages. Experimental evaluations showed that both models achieved good results in terms of average coverage. In particular, the Vowel Priority ( VL ) approach obtained reasonably high coverage with 95.9% in Uzbek, 96.8% in English, and 94.2% in the Slovenian language in the case of eight cubes, based on the five-fold cross-validation method. Both models covered around 85% of five letter words in Uzbek, English, and Slovenian datasets, while this coverage was even higher (99%) in three letter words in the case of eight cubes.

Suggested Citation

  • Jamolbek Mattiev & Ulugbek Salaev & Branko Kavsek, 2023. "Word Game Modeling Using Character-Level N-Gram and Statistics," Mathematics, MDPI, vol. 11(6), pages 1-15, March.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:6:p:1380-:d:1095144
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/6/1380/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/6/1380/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mofareh Alqahtani, 2015. "The importance of vocabulary in language learning and how to be taught," International Journal of Teaching and Education, International Institute of Social and Economic Sciences, vol. 3(3), pages 21-34, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ehsan Namaziandost & Murad Hassan Mohammed Sawalmeh & Shouket Ahmad Tilwani & Meisam Ziafar & Arin Arianti & Ronald M. Hernández & Oleg Anatolevich Razzhivin & Yolvi Ocaña-Fernández & Doris Fuster-, 2021. "Manipulation of the Involvement Load of L2 Reading Tasks: A Useful Heuristic for Enhanced L2 Vocabulary Development," SAGE Open, , vol. 11(4), pages 21582440211, October.
    2. Natalia S. Intja & Arminda D. Henda & Secilia N. Kangodi, 2022. "The Effect of Mother Tongue on Grade 4 Learners when Learning English as a Second Language: Case Study of Kavango East Region in Namibia," International Journal of Research and Innovation in Social Science, International Journal of Research and Innovation in Social Science (IJRISS), vol. 6(10), pages 820-824, October.
    3. Du Thanh Tran & Hanh Thi Le, 2023. "The Impact of Quizlet on Vocabulary Improvement: A Case Study in Binh Duong Province Secondary Schools," World Journal of English Language, Sciedu Press, vol. 13(6), pages 235-235, July.
    4. Roland Happ & Susanne Schmidt & Olga Zlatkin-Troitschanskaia & William Walstad, 2023. "How Gender and Primary Language Influence the Acquisition of Economic Knowledge of Secondary School Students in the United States and Germany," JRFM, MDPI, vol. 16(3), pages 1-14, March.
    5. Naginder Kaur, 2020. "Metacognitive Awareness in Lexical Learning among Malaysian Students," International Journal of English Language and Literature Studies, Asian Economic and Social Society, vol. 9(3), pages 161-171, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:6:p:1380-:d:1095144. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.