IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0112987.html
   My bibliography  Save this article

A Novel Hybrid Classification Model of Genetic Algorithms, Modified k-Nearest Neighbor and Developed Backpropagation Neural Network

Author

Listed:
  • Nader Salari
  • Shamarina Shohaimi
  • Farid Najafi
  • Meenakshii Nallappan
  • Isthrinayagy Karishnarajah

Abstract

Among numerous artificial intelligence approaches, k-Nearest Neighbor algorithms, genetic algorithms, and artificial neural networks are considered as the most common and effective methods in classification problems in numerous studies. In the present study, the results of the implementation of a novel hybrid feature selection-classification model using the above mentioned methods are presented. The purpose is benefitting from the synergies obtained from combining these technologies for the development of classification models. Such a combination creates an opportunity to invest in the strength of each algorithm, and is an approach to make up for their deficiencies. To develop proposed model, with the aim of obtaining the best array of features, first, feature ranking techniques such as the Fisher's discriminant ratio and class separability criteria were used to prioritize features. Second, the obtained results that included arrays of the top-ranked features were used as the initial population of a genetic algorithm to produce optimum arrays of features. Third, using a modified k-Nearest Neighbor method as well as an improved method of backpropagation neural networks, the classification process was advanced based on optimum arrays of the features selected by genetic algorithms. The performance of the proposed model was compared with thirteen well-known classification models based on seven datasets. Furthermore, the statistical analysis was performed using the Friedman test followed by post-hoc tests. The experimental findings indicated that the novel proposed hybrid model resulted in significantly better classification performance compared with all 13 classification methods. Finally, the performance results of the proposed model was benchmarked against the best ones reported as the state-of-the-art classifiers in terms of classification accuracy for the same data sets. The substantial findings of the comprehensive comparative study revealed that performance of the proposed model in terms of classification accuracy is desirable, promising, and competitive to the existing state-of-the-art classification models.

Suggested Citation

  • Nader Salari & Shamarina Shohaimi & Farid Najafi & Meenakshii Nallappan & Isthrinayagy Karishnarajah, 2014. "A Novel Hybrid Classification Model of Genetic Algorithms, Modified k-Nearest Neighbor and Developed Backpropagation Neural Network," PLOS ONE, Public Library of Science, vol. 9(11), pages 1-50, November.
  • Handle: RePEc:plo:pone00:0112987
    DOI: 10.1371/journal.pone.0112987
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0112987
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0112987&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0112987?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Shapiro, Arnold F., 2002. "The merging of neural networks, fuzzy logic, and genetic algorithms," Insurance: Mathematics and Economics, Elsevier, vol. 31(1), pages 115-131, August.
    2. Borra, Simone & Di Ciaccio, Agostino, 2010. "Measuring the prediction error. A comparison of cross-validation, bootstrap and covariance penalty methods," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2976-2989, December.
    3. Chakraborty, Sounak, 2009. "Simultaneous cancer classification and gene selection with Bayesian nearest neighbor method: An integrated approach," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1462-1474, February.
    4. Fermín Segovia & Christine Bastin & Eric Salmon & Juan Manuel Górriz & Javier Ramírez & Christophe Phillips, 2014. "Combining PET Images and Neuropsychological Test Data for Automatic Diagnosis of Alzheimer's Disease," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-8, February.
    5. Giuseppe Jurman & Samantha Riccadonna & Cesare Furlanello, 2012. "A Comparison of MCC and CEN Error Measures in Multi-Class Prediction," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-8, August.
    6. Jay M Ver Hoef & Hailemariam Temesgen, 2013. "A Comparison of the Spatial Linear Model to Nearest Neighbor (k-NN) Methods for Forestry Applications," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-13, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuofen Hsu & Chaohsin Lin & Yaling Yang, 2008. "Integrating Neural Networks for Risk‐Adjustment Models," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 75(3), pages 617-642, September.
    2. Shapiro, Arnold F., 2004. "Fuzzy logic in insurance," Insurance: Mathematics and Economics, Elsevier, vol. 35(2), pages 399-424, October.
    3. Jing Li & Kuei-Ying Huang & Jionghua Jin & Jianjun Shi, 2008. "A survey on statistical methods for health care fraud detection," Health Care Management Science, Springer, vol. 11(3), pages 275-287, September.
    4. Abbasabadi, Narjes & Ashayeri, Mehdi & Azari, Rahman & Stephens, Brent & Heidarinejad, Mohammad, 2019. "An integrated data-driven framework for urban energy use modeling (UEUM)," Applied Energy, Elsevier, vol. 253(C), pages 1-1.
    5. Bergmeir, Christoph & Costantini, Mauro & Benítez, José M., 2014. "On the usefulness of cross-validation for directional forecast evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 76(C), pages 132-143.
    6. Bode, Gerrit & Thul, Simon & Baranski, Marc & Müller, Dirk, 2020. "Real-world application of machine-learning-based fault detection trained with experimental data," Energy, Elsevier, vol. 198(C).
    7. Stephan Birle & Mohamed Ahmed Hussein & Thomas Becker, 2016. "Management of Uncertainty by Statistical Process Control and a Genetic Tuned Fuzzy System," Discrete Dynamics in Nature and Society, Hindawi, vol. 2016, pages 1-11, July.
    8. Md. Shafiul Alam & Tanzi Ahmed Chowdhury & Abhishak Dhar & Fahad Saleh Al-Ismail & M. S. H. Choudhury & Md Shafiullah & Md. Ismail Hossain & Md. Alamgir Hossain & Aasim Ullah & Syed Masiur Rahman, 2023. "Solar and Wind Energy Integrated System Frequency Control: A Critical Review on Recent Developments," Energies, MDPI, vol. 16(2), pages 1-31, January.
    9. Dalkilic, Turkan Erbay & Tank, Fatih & Kula, Kamile Sanli, 2009. "Neural networks approach for determining total claim amounts in insurance," Insurance: Mathematics and Economics, Elsevier, vol. 45(2), pages 236-241, October.
    10. Yun Jiang & Li Chen & Hai Zhang & Xiao Xiao, 2019. "Breast cancer histopathological image classification using convolutional neural networks with small SE-ResNet module," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-21, March.
    11. Melissa Adelman & Francisco Haimovich & Andres Ham & Emmanuel Vazquez, 2018. "Predicting school dropout with administrative data: new evidence from Guatemala and Honduras," Education Economics, Taylor & Francis Journals, vol. 26(4), pages 356-372, July.
    12. Belles-Sampera, Jaume & Merigó, José M. & Guillén, Montserrat & Santolino, Miguel, 2013. "The connection between distortion risk measures and ordered weighted averaging operators," Insurance: Mathematics and Economics, Elsevier, vol. 52(2), pages 411-420.
    13. Sancho Salcedo-Sanz & Leo Carro-Calvo & Mercè Claramunt & Ana Castañer & Maite Mármol, 2014. "Effectively Tackling Reinsurance Problems by Using Evolutionary and Swarm Intelligence Algorithms," Risks, MDPI, vol. 2(2), pages 1-14, April.
    14. Kong, Hyeongwoo & Yun, Wonje & Kim, Woo Chang, 2023. "Tracking customer risk aversion," Finance Research Letters, Elsevier, vol. 54(C).
    15. Usta, Ilhan & Kantar, Yeliz Mert, 2011. "On the performance of the flexible maximum entropy distributions within partially adaptive estimation," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2172-2182, June.
    16. Roberto Patuelli & Peter Nijkamp & Simonetta Longhi & Aura Reggiani, 2008. "Neural Networks and Genetic Algorithms as Forecasting Tools: A Case Study on German Regions," Environment and Planning B, , vol. 35(4), pages 701-722, August.
    17. Mehdi Neshat & Ali Akbar Pourahmad & Mohammad Reza Hasani, 2016. "Designing an Adaptive Neuro Fuzzy Inference System for Prediction of Customers Satisfaction," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 15(04), pages 1-21, December.
    18. Keunhyun Park & Sadegh Sabouri & Torrey Lyons & Guang Tian & Reid Ewing, 2020. "Intrazonal or interzonal? Improving intrazonal travel forecast in a four-step travel demand model," Transportation, Springer, vol. 47(5), pages 2087-2108, October.
    19. Sancho Salcedo-Sanz & L. Carro-Calvo & Mercè Claramunt & Anna Castañer & Maite Marmol, 2013. "An Analysis of Black-box Optimization Problems in Reinsurance: Evolutionary-based Approaches," Working Papers XREAP2013-04, Xarxa de Referència en Economia Aplicada (XREAP), revised May 2013.
    20. Bergmeir, Christoph & Hyndman, Rob J. & Koo, Bonsoo, 2018. "A note on the validity of cross-validation for evaluating autoregressive time series prediction," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 70-83.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0112987. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.