IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i17p3093-d899925.html
   My bibliography  Save this article

(SDGFI) Student’s Demographic and Geographic Feature Identification Using Machine Learning Techniques for Real-Time Automated Web Applications

Author

Listed:
  • Chaman Verma

    (Department of Media and Educational Informatics, Faculty of Informatics, Eötvös Loránd University, 1053 Budapest, Hungary)

  • Zoltán Illés

    (Department of Media and Educational Informatics, Faculty of Informatics, Eötvös Loránd University, 1053 Budapest, Hungary)

  • Deepak Kumar

    (Apex Institute of Technology, Chandigarh University, Mohali 140413, Punjab, India)

Abstract

Nowadays, Google Forms is becoming a cutting-edge tool for gathering research data in the educational domain. Several researchers are using real-time web applications to collect the responses of respondents. Demographic and geographic features are the most important in the researcher’s study. Identifying students’ demographics (gender, age-group, course, institution, or university) and geographic features (locality and country) is a challenging problem in machine learning. We proposed a novel predictive algorithm, Student Demographic Identification (SDI), to identify a student’s demographic features (age-group, course) with the highest accuracy. SDI has been tested on primary reliable samples. SDI has also been compared with the traditional machine algorithms Random Forest (RF), and Logistic Regression (LR), and Radial Support Vector Machine (R–SVM). The proposed algorithm significantly improved the performance metrics such as accuracy, F1-score, precision, recall, and Matthews Correlation Coefficient (MCC) of these classifiers. We also proposed significant features to identify students’ age-group, course, and gender. SDI has identified the student’s age group with an accuracy of 96% and the course with an accuracy of 97%. Gradient Boosting (GB) has improved the accuracy of LR, R-SVM, and RF to predict the student’s gender. Also, the RF algorithm with the support of GB attained the highest accuracy of 98% to identify the gender of the students. All three classifiers have also identified the student’s locality and institution with an identical accuracy of 99%. Our proposed SDI algorithm may be useful for real-time survey applications to predict students’ demographic features.

Suggested Citation

  • Chaman Verma & Zoltán Illés & Deepak Kumar, 2022. "(SDGFI) Student’s Demographic and Geographic Feature Identification Using Machine Learning Techniques for Real-Time Automated Web Applications," Mathematics, MDPI, vol. 10(17), pages 1-21, August.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:17:p:3093-:d:899925
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/17/3093/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/17/3093/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Neda Sharifi Asadi Malafe & Masoud Ahmadi & Fahime Baei, 2017. "The Relationship between Demographic Characteristics with Information and Communication Technology and Empowerment in General Organizations (Case Study: Sari Municipality)," International Review of Management and Marketing, Econjournals, vol. 7(2), pages 71-75.
    2. Tarik SEVINDI, 2020. "Investigation of Social Appearance Anxiety of Students of Faculty of Sport Sciences and Faculty of Education in Terms of Some Variables," Asian Journal of Education and Training, Asian Online Journal Publishing Group, vol. 6(3), pages 541-545.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Deepak Kumar & Chaman Verma & Pradeep Kumar Singh & Maria Simona Raboaca & Raluca-Andreea Felseghi & Kayhan Zrar Ghafoor, 2021. "Computational Statistics and Machine Learning Techniques for Effective Decision Making on Student’s Employment for Real-Time," Mathematics, MDPI, vol. 9(11), pages 1-29, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:17:p:3093-:d:899925. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.