IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i16p2553-d1458846.html
   My bibliography  Save this article

Effective Methods of Categorical Data Encoding for Artificial Intelligence Algorithms

Author

Listed:
  • Furkat Bolikulov

    (Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 461-701, Gyeonggi-do, Republic of Korea)

  • Rashid Nasimov

    (Department of Information Systems and Technologies, Tashkent State University of Economics, Tashkent 100066, Uzbekistan)

  • Akbar Rashidov

    (Department of Artificial Intelligence and Information Systems, Samarkand State University Named after Sharof Rashidov, Samarkand 140100, Uzbekistan)

  • Farkhod Akhmedov

    (Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 461-701, Gyeonggi-do, Republic of Korea)

  • Young-Im Cho

    (Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 461-701, Gyeonggi-do, Republic of Korea)

Abstract

It is known that artificial intelligence algorithms are based on calculations performed using various mathematical operations. In order for these calculation processes to be carried out correctly, some types of data cannot be fed directly into the algorithms. In other words, numerical data should be input to these algorithms, but not all data in datasets collected for artificial intelligence algorithms are always numerical. These data may not be quantitative but may be important for the study under consideration. That is, these data cannot be thrown away. In such a case, it is necessary to transfer categorical data to numeric type. In this research work, 14 encoding methods of transforming of categorical data were considered. At the same time, conclusions are given about the general conditions of using these methods. During the research, categorical data in the dataset that were collected in order to assess whether it is possible to give credit to customers will be transformed based on 14 methods. After applying each encoding method, experimental tests are conducted based on the classification algorithm, and they are evaluated. At the end of the study, the results of the experimental tests are discussed and research conclusions are presented.

Suggested Citation

  • Furkat Bolikulov & Rashid Nasimov & Akbar Rashidov & Farkhod Akhmedov & Young-Im Cho, 2024. "Effective Methods of Categorical Data Encoding for Artificial Intelligence Algorithms," Mathematics, MDPI, vol. 12(16), pages 1-22, August.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:16:p:2553-:d:1458846
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/16/2553/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/16/2553/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:16:p:2553-:d:1458846. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.