IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v293y2021i1p24-35.html
   My bibliography  Save this article

A novel embedded min-max approach for feature selection in nonlinear Support Vector Machine classification

Author

Listed:
  • Jiménez-Cordero, Asunción
  • Morales, Juan Miguel
  • Pineda, Salvador

Abstract

In recent years, feature selection has become a challenging problem in several machine learning fields, such as classification problems. Support Vector Machine (SVM) is a well-known technique applied in classification tasks. Various methodologies have been proposed in the literature to select the most relevant features in SVM. Unfortunately, all of them either deal with the feature selection problem in the linear classification setting or propose ad-hoc approaches that are difficult to implement in practice. In contrast, we propose an embedded feature selection method based on a min-max optimization problem, where a trade-off between model complexity and classification accuracy is sought. By leveraging duality theory, we equivalently reformulate the min-max problem and solve it without further ado using off-the-shelf software for nonlinear optimization. The efficiency and usefulness of our approach are tested on several benchmark data sets in terms of accuracy, number of selected features and interpretability.

Suggested Citation

  • Jiménez-Cordero, Asunción & Morales, Juan Miguel & Pineda, Salvador, 2021. "A novel embedded min-max approach for feature selection in nonlinear Support Vector Machine classification," European Journal of Operational Research, Elsevier, vol. 293(1), pages 24-35.
  • Handle: RePEc:eee:ejores:v:293:y:2021:i:1:p:24-35
    DOI: 10.1016/j.ejor.2020.12.009
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221720310195
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2020.12.009?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ghaddar, Bissan & Naoum-Sawaya, Joe, 2018. "High dimensional data classification and feature selection using support vector machines," European Journal of Operational Research, Elsevier, vol. 265(3), pages 993-1004.
    2. Blanquero, R. & Carrizosa, E. & Jiménez-Cordero, A. & Martín-Barragán, B., 2019. "Functional-bandwidth kernel for Support Vector Machine with Functional Data: An alternating optimization algorithm," European Journal of Operational Research, Elsevier, vol. 275(1), pages 195-207.
    3. Bertolazzi, P. & Felici, G. & Festa, P. & Fiscon, G. & Weitschek, E., 2016. "Integer programming models for feature selection: New extensions and a randomized solution algorithm," European Journal of Operational Research, Elsevier, vol. 250(2), pages 389-399.
    4. Li, An-Da & He, Zhen & Wang, Qing & Zhang, Yang, 2019. "Key quality characteristics selection for imbalanced production data using a two-phase bi-objective feature selection method," European Journal of Operational Research, Elsevier, vol. 274(3), pages 978-989.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ozcan, Erhan C. & Görgülü, Berk & Baydogan, Mustafa G., 2024. "Column generation-based prototype learning for optimizing area under the receiver operating characteristic curve," European Journal of Operational Research, Elsevier, vol. 314(1), pages 297-307.
    2. Mi, Yunlong & Quan, Pei & Shi, Yong & Wang, Zongrun, 2022. "Concept-cognitive computing system for dynamic classification," European Journal of Operational Research, Elsevier, vol. 301(1), pages 287-299.
    3. Fajemisin, Adejuyigbe O. & Maragno, Donato & den Hertog, Dick, 2024. "Optimization with constraint learning: A framework and survey," European Journal of Operational Research, Elsevier, vol. 314(1), pages 1-14.
    4. Yang, Dongchuan & Guo, Ju-e & Li, Yanzhao & Sun, Shaolong & Wang, Shouyang, 2023. "Short-term load forecasting with an improved dynamic decomposition-reconstruction-ensemble approach," Energy, Elsevier, vol. 263(PA).
    5. Labbé, Martine & Landete, Mercedes & Leal, Marina, 2023. "Dendrograms, minimum spanning trees and feature selection," European Journal of Operational Research, Elsevier, vol. 308(2), pages 555-567.
    6. Goodell, John W. & Ben Jabeur, Sami & Saâdaoui, Foued & Nasir, Muhammad Ali, 2023. "Explainable artificial intelligence modeling to forecast bitcoin prices," International Review of Financial Analysis, Elsevier, vol. 88(C).
    7. Lin, Fengming & Fang, Shu-Cherng & Fang, Xiaolei & Gao, Zheming & Luo, Jian, 2024. "A distributionally robust chance-constrained kernel-free quadratic surface support vector machine," European Journal of Operational Research, Elsevier, vol. 316(1), pages 46-60.
    8. Díaz, Verónica & Montoya, Ricardo & Maldonado, Sebastián, 2023. "Preference estimation under bounded rationality: Identification of attribute non-attendance in stated-choice data using a support vector machines approach," European Journal of Operational Research, Elsevier, vol. 304(2), pages 797-812.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, An-Da & He, Zhen & Wang, Qing & Zhang, Yang, 2019. "Key quality characteristics selection for imbalanced production data using a two-phase bi-objective feature selection method," European Journal of Operational Research, Elsevier, vol. 274(3), pages 978-989.
    2. Zhang, Yishi & Zhu, Ruilin & Chen, Zhijun & Gao, Jie & Xia, De, 2021. "Evaluating and selecting features via information theoretic lower bounds of feature inner correlations for high-dimensional data," European Journal of Operational Research, Elsevier, vol. 290(1), pages 235-247.
    3. Jiang, He & Tao, Changqi & Dong, Yao & Xiong, Ren, 2021. "Robust low-rank multiple kernel learning with compound regularization," European Journal of Operational Research, Elsevier, vol. 295(2), pages 634-647.
    4. Gao, Zheming & Fang, Shu-Cherng & Luo, Jian & Medhin, Negash, 2021. "A kernel-free double well potential support vector machine with applications," European Journal of Operational Research, Elsevier, vol. 290(1), pages 248-262.
    5. Manlio Gaudioso & Giovanni Giallombardo & Giovanna Miglionico, 2023. "Sparse optimization via vector k-norm and DC programming with an application to feature selection for support vector machines," Computational Optimization and Applications, Springer, vol. 86(2), pages 745-766, November.
    6. Ni, Ji & Chen, Bowei & Allinson, Nigel M. & Ye, Xujiong, 2020. "A hybrid model for predicting human physical activity status from lifelogging data," European Journal of Operational Research, Elsevier, vol. 281(3), pages 532-542.
    7. Basna Mohammed Salih Hasan & Nawzat Sadiq Ahmed, 2021. "Feature selection technique applied in Medical application by Supervised algorithm: A Review," International Journal of Science and Business, IJSAB International, vol. 5(3), pages 190-203.
    8. Davila-Pena, Laura & García-Jurado, Ignacio & Casas-Méndez, Balbina, 2022. "Assessment of the influence of features on a classification problem: An application to COVID-19 patients," European Journal of Operational Research, Elsevier, vol. 299(2), pages 631-641.
    9. Daehan Won & Hasan Manzour & Wanpracha Chaovalitwongse, 2020. "Convex Optimization for Group Feature Selection in Networked Data," INFORMS Journal on Computing, INFORMS, vol. 32(1), pages 182-198, January.
    10. Pi, J. & Wang, Honggang & Pardalos, Panos M., 2021. "A dual reformulation and solution framework for regularized convex clustering problems," European Journal of Operational Research, Elsevier, vol. 290(3), pages 844-856.
    11. He Jiang, 2023. "Robust forecasting in spatial autoregressive model with total variation regularization," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 195-211, March.
    12. Jimenez-Marquez, Jose Luis & Gonzalez-Carrasco, Israel & Lopez-Cuadrado, Jose Luis & Ruiz-Mezcua, Belen, 2019. "Towards a big data framework for analyzing social media content," International Journal of Information Management, Elsevier, vol. 44(C), pages 1-12.
    13. Emilio Carrizosa & Cristina Molero-Río & Dolores Romero Morales, 2021. "Mathematical optimization in classification and regression trees," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(1), pages 5-33, April.
    14. Liangjun Wu & Lihui Yang & Yabin Li & Jian Shi & Xiaochen Zhu & Yan Zeng, 2024. "Evaluation of the Habitat Suitability for Zhuji Torreya Based on Machine Learning Algorithms," Agriculture, MDPI, vol. 14(7), pages 1-17, July.
    15. Ghaddar, Bissan & Naoum-Sawaya, Joe, 2018. "High dimensional data classification and feature selection using support vector machines," European Journal of Operational Research, Elsevier, vol. 265(3), pages 993-1004.
    16. Giovanni Felici & Kumar Parijat Tripathi & Daniela Evangelista & Mario Rosario Guarracino, 2017. "A mixed integer programming-based global optimization framework for analyzing gene expression data," Journal of Global Optimization, Springer, vol. 69(3), pages 727-744, November.
    17. Bottmer, Lea & Croux, Christophe & Wilms, Ines, 2022. "Sparse regression for large data sets with outliers," European Journal of Operational Research, Elsevier, vol. 297(2), pages 782-794.
    18. Jiang, He & Luo, Shihua & Dong, Yao, 2021. "Simultaneous feature selection and clustering based on square root optimization," European Journal of Operational Research, Elsevier, vol. 289(1), pages 214-231.
    19. Douek-Pinkovich, Yifat & Ben-Gal, Irad & Raviv, Tal, 2022. "The stochastic test collection problem: Models, exact and heuristic solution approaches," European Journal of Operational Research, Elsevier, vol. 299(3), pages 945-959.
    20. Zhang, Yucheng & Xu, Shan & Zhang, Long & Yang, Mengxi, 2021. "Big data and human resource management research: An integrative review and new directions for future research," Journal of Business Research, Elsevier, vol. 133(C), pages 34-50.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:293:y:2021:i:1:p:24-35. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.