Author
Listed:
- Muhammed Kürşad Uçar
- Majid Nour
- Hatem Sindi
- Kemal Polat
Abstract
Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem.
Suggested Citation
Muhammed Kürşad Uçar & Majid Nour & Hatem Sindi & Kemal Polat, 2020.
"The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets,"
Mathematical Problems in Engineering, Hindawi, vol. 2020, pages 1-17, May.
Handle:
RePEc:hin:jnlmpe:2836236
DOI: 10.1155/2020/2836236
Download full text from publisher
Citations
Citations are extracted by the
CitEc Project, subscribe to its
RSS feed for this item.
Cited by:
- Wei Dong & Hongzhen Liu, 2024.
"Distributed Sparse Precision Matrix Estimation via Alternating Block-Based Gradient Descent,"
Mathematics, MDPI, vol. 12(5), pages 1-15, February.
- Haojie Lian & Pengfei Sun & Zhuxuan Meng & Shengze Li & Peng Wang & Yilin Qu, 2023.
"LIDAR Point Cloud Augmentation for Dusty Weather Based on a Physical Simulation,"
Mathematics, MDPI, vol. 12(1), pages 1-15, December.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jnlmpe:2836236. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.