IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v276y2019i1d10.1007_s10479-018-2972-2.html
   My bibliography  Save this article

Feature importance ranking for classification in mixed online environments

Author

Listed:
  • Alaleh Razmjoo

    (University of Central Florida)

  • Petros Xanthopoulos

    (Stetson University)

  • Qipeng Phil Zheng

    (University of Central Florida)

Abstract

Online learning is a growing branch of machine learning with applications in many domains. One of the less studied topics in this area is development of strategies for online feature importance ranking. In this paper we present two methods for incremental ranking of features in classification tasks. Our ranking strategies are based on measurement of the sensitivity of the classification outcome with respect to individual features. The two methods work for different types of classification environments with discrete, continuous and mixed feature types with minimum prior assumptions. The second method, which is a modification of the original method, is designed to handle concept drift while avoiding cumbersome computations. Concept drift is described as sudden or slow changes in characteristics of the learning features which happens in many online learning tasks such as online marketing analysis. If the rankings are not adaptable, during the time, these changes will make the rankings obsolete. Moreover, we investigate different feature selection schemes for feature reduction in online environments to effectively remove irrelevant features from the classification model. Finally, we present experimental results which verify the efficacy of our methods against currently available online feature ranking algorithms.

Suggested Citation

  • Alaleh Razmjoo & Petros Xanthopoulos & Qipeng Phil Zheng, 2019. "Feature importance ranking for classification in mixed online environments," Annals of Operations Research, Springer, vol. 276(1), pages 315-330, May.
  • Handle: RePEc:spr:annopr:v:276:y:2019:i:1:d:10.1007_s10479-018-2972-2
    DOI: 10.1007/s10479-018-2972-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-018-2972-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-018-2972-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hoai An Le Thi & Manh Cuong Nguyen, 2017. "DCA based algorithms for feature selection in multi-class support vector machine," Annals of Operations Research, Springer, vol. 249(1), pages 273-300, February.
    2. Ya-Ju Fan & Wanpracha Chaovalitwongse, 2010. "Optimizing feature selection to improve medical diagnosis," Annals of Operations Research, Springer, vol. 174(1), pages 169-183, February.
    3. Ying Liu & Hong Li & Geng Peng & Benfu Lv & Chong Zhang, 2015. "Online purchaser segmentation and promotion strategy selection: evidence from Chinese E-commerce market," Annals of Operations Research, Springer, vol. 233(1), pages 263-279, October.
    4. Hsinchun Chen, 2003. "Introduction to the JASIST Special Topic issue on web retrieval and mining: A machine learning perspective," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 54(7), pages 621-624, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jie Wu & Zhixin Chen & Xiang Ji, 2020. "Sustainable trade promotion decisions under demand disruption in manufacturer-retailer supply chains," Annals of Operations Research, Springer, vol. 290(1), pages 115-143, July.
    2. Haoyu Liu & Kim Hua Tan & Xianfeng Wu, 2023. "Who’s watching? Classifying sports viewers on social live streaming services," Annals of Operations Research, Springer, vol. 325(1), pages 743-765, June.
    3. Qing Zhu & Renxian Zuo & Shan Liu & Fan Zhang, 2020. "Online dynamic group-buying community analysis based on high frequency time series simulation," Electronic Commerce Research, Springer, vol. 20(1), pages 81-118, March.
    4. Kamyab Karimi & Ali Ghodratnama & Reza Tavakkoli-Moghaddam, 2023. "Two new feature selection methods based on learn-heuristic techniques for breast cancer prediction: a comprehensive analysis," Annals of Operations Research, Springer, vol. 328(1), pages 665-700, September.
    5. Tomasz Hachaj & Marek R. Ogiela & Katarzyna Koptyra, 2018. "Human actions recognition from motion capture recordings using signal resampling and pattern recognition methods," Annals of Operations Research, Springer, vol. 265(2), pages 223-239, June.
    6. Talayeh Razzaghi & Ilya Safro & Joseph Ewing & Ehsan Sadrfaridpour & John D. Scott, 2019. "Predictive models for bariatric surgery risks with imbalanced medical datasets," Annals of Operations Research, Springer, vol. 280(1), pages 1-18, September.
    7. Erfan Mehmanchi & Andrés Gómez & Oleg A. Prokopyev, 2021. "Solving a class of feature selection problems via fractional 0–1 programming," Annals of Operations Research, Springer, vol. 303(1), pages 265-295, August.
    8. Kazim Topuz & Hasmet Uner & Asil Oztekin & Mehmet Bayram Yildirim, 2018. "Predicting pediatric clinic no-shows: a decision analytic framework using elastic net and Bayesian belief network," Annals of Operations Research, Springer, vol. 263(1), pages 479-499, April.
    9. Hui Yuan & Wei Xu & Qian Li & Raymond Lau, 2018. "Topic sentiment mining for sales performance prediction in e-commerce," Annals of Operations Research, Springer, vol. 270(1), pages 553-576, November.
    10. Şenay Yaşar Sağlam & W. Nick Street, 2018. "Distant diversity in dynamic class prediction," Annals of Operations Research, Springer, vol. 263(1), pages 5-19, April.
    11. Xinghua Fang & Jian Zhou & Hongya Zhao & Yizeng Chen, 2022. "A biclustering-based heterogeneous customer requirement determination method from customer participation in product development," Annals of Operations Research, Springer, vol. 309(2), pages 817-835, February.
    12. Sylvie Tchumtchoua & Dipak Dey, 2012. "Modeling Associations Among Multivariate Longitudinal Categorical Variables in Survey Data: A Semiparametric Bayesian Approach," Psychometrika, Springer;The Psychometric Society, vol. 77(4), pages 670-692, October.
    13. Farid Huseynov & Sevgi Özkan Yıldırım, 2019. "Online Consumer Typologies and Their Shopping Behaviors in B2C E-Commerce Platforms," SAGE Open, , vol. 9(2), pages 21582440198, May.
    14. Ning Wang & Zhuo Zhang & Jiao Zhao & Dawei Hu, 2022. "Recognition method of equipment state with the FLDA based Mahalanobis–Taguchi system," Annals of Operations Research, Springer, vol. 311(1), pages 417-435, April.
    15. Nan Hu & Kevin E. Dow & Alain Yee Loong Chong & Ling Liu, 2018. "Double learning or double blinding: an investigation of vendor private information acquisition and consumer learning via online reviews," Annals of Operations Research, Springer, vol. 270(1), pages 213-234, November.
    16. Daniel Gartner & Rainer Kolisch & Daniel B. Neill & Rema Padman, 2015. "Machine Learning Approaches for Early DRG Classification and Resource Allocation," INFORMS Journal on Computing, INFORMS, vol. 27(4), pages 718-734, November.
    17. Madjid Tavana & Debora Di Caprio & Francisco J. Santos-Arteaga, 2016. "Loyal customer bases as innovation disincentives for duopolistic firms using strategic signaling and Bayesian analysis," Annals of Operations Research, Springer, vol. 244(2), pages 647-676, September.
    18. Lei Li & Ting Chi & Tongtong Hao & Tao Yu, 2018. "Customer demand analysis of the electronic commerce supply chain using Big Data," Annals of Operations Research, Springer, vol. 268(1), pages 113-128, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:276:y:2019:i:1:d:10.1007_s10479-018-2972-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.