IDEAS home Printed from https://ideas.repec.org/a/eee/jomega/v59y2016ipap40-46.html
   My bibliography  Save this article

Constrained subspace classifier for high dimensional datasets

Author

Listed:
  • Panagopoulos, Orestis P.
  • Pappu, Vijay
  • Xanthopoulos, Petros
  • Pardalos, Panos M.

Abstract

Datasets with significantly larger number of features, compared to samples, pose a serious challenge in supervised learning. Such datasets arise in various areas including business analytics. In this paper, a new binary classification method called constrained subspace classifier (CSC) is proposed for such high dimensional datasets. CSC improves on an earlier proposed classification method called local subspace classifier (LSC) by accounting for the relative angle between subspaces while approximating the classes with individual subspaces. CSC is formulated as an optimization problem and can be solved by an efficient alternating optimization technique. Classification performance is tested in publicly available datasets. The improvement in classification accuracy over LSC shows the importance of considering the relative angle between the subspaces while approximating the classes. Additionally, CSC appears to be a robust classifier, compared to traditional two step methods that perform feature selection and classification in two distinct steps.

Suggested Citation

  • Panagopoulos, Orestis P. & Pappu, Vijay & Xanthopoulos, Petros & Pardalos, Panos M., 2016. "Constrained subspace classifier for high dimensional datasets," Omega, Elsevier, vol. 59(PA), pages 40-46.
  • Handle: RePEc:eee:jomega:v:59:y:2016:i:pa:p:40-46
    DOI: 10.1016/j.omega.2015.05.009
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0305048315001188
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.omega.2015.05.009?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "High-Dimensional Methods and Inference on Structural and Treatment Effects," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 29-50, Spring.
    2. K. Kampa & S. Mehta & C. Chou & W. Chaovalitwongse & T. Grabowski, 2014. "Sparse optimization in feature selection: application in neuroimaging," Journal of Global Optimization, Springer, vol. 59(2), pages 439-457, July.
    3. Unler, Alper & Murat, Alper, 2010. "A discrete particle swarm optimization method for feature selection in binary classification problems," European Journal of Operational Research, Elsevier, vol. 206(3), pages 528-539, November.
    4. Claudio Cifarelli & Mario R. Guarracino & Onur Seref & Salvatore Cuciniello & Panos M. Pardalos, 2007. "Incremental Classification with Generalized Eigenvalues," Journal of Classification, Springer;The Classification Society, vol. 24(2), pages 205-219, September.
    5. Michael B. Fenn & Vijay Pappu, 2012. "Data Mining for Cancer Biomarkers with Raman Spectroscopy," Springer Optimization and Its Applications, in: Panos M. Pardalos & Petros Xanthopoulos & Michalis Zervakis (ed.), Data Mining for Biomarker Discovery, edition 127, chapter 0, pages 143-168, Springer.
    6. Li, Baibing & Martin, Elaine B. & Morris, A. Julian, 2002. "On principal component analysis in L1," Computational Statistics & Data Analysis, Elsevier, vol. 40(3), pages 471-474, September.
    7. Petros Xanthopoulos & Mario Guarracino & Panos Pardalos, 2014. "Robust generalized eigenvalue classifier with ellipsoidal uncertainty," Annals of Operations Research, Springer, vol. 216(1), pages 327-342, May.
    8. Wang, F. K. & Du, T. C. T., 2000. "Using principal component analysis in process performance for multivariate data," Omega, Elsevier, vol. 28(2), pages 185-194, April.
    9. Shanmugam, Ramalingam & Johnson, Charles, 2007. "At a crossroad of data envelopment and principal component analyses," Omega, Elsevier, vol. 35(4), pages 351-364, August.
    10. Garci'a Lopez, Felix & Garci'a Torres, Miguel & Melian Batista, Belen & Moreno Perez, Jose A. & Moreno-Vega, J. Marcos, 2006. "Solving feature subset selection problem by a Parallel Scatter Search," European Journal of Operational Research, Elsevier, vol. 169(2), pages 477-489, March.
    11. (Bill) Tseng, Tzu-Liang & Huang, Chun-Che, 2007. "Rough set-based approach to feature selection in customer relationship management," Omega, Elsevier, vol. 35(4), pages 365-383, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Orestis P. Panagopoulos & Petros Xanthopoulos & Talayeh Razzaghi & Onur Şeref, 2019. "Relaxed support vector regression," Annals of Operations Research, Springer, vol. 276(1), pages 191-210, May.
    2. Carrizosa, Emilio & Nogales-Gómez, Amaya & Romero Morales, Dolores, 2017. "Clustering categories in support vector machines," Omega, Elsevier, vol. 66(PA), pages 28-37.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lei Wang & Yan Yan & Xiaoteng Li & Xiaosong Chen, 2018. "General Component Analysis (GCA): A new approach to identify Chinese corporate bond market structures," PLOS ONE, Public Library of Science, vol. 13(7), pages 1-18, July.
    2. Orestis P. Panagopoulos & Petros Xanthopoulos & Talayeh Razzaghi & Onur Şeref, 2019. "Relaxed support vector regression," Annals of Operations Research, Springer, vol. 276(1), pages 191-210, May.
    3. Onur Şeref & Talayeh Razzaghi & Petros Xanthopoulos, 2017. "Weighted relaxed support vector machines," Annals of Operations Research, Springer, vol. 249(1), pages 235-271, February.
    4. Juan Carlos Chávez & Felipe J. Fonseca & Manuel Gómez-Zaldívar, 2017. "Resoluciones de disputas comerciales y desempeño económico regional en México. (Commercial Disputes Resolution and Regional Economic Performance in Mexico)," Ensayos Revista de Economia, Universidad Autonoma de Nuevo Leon, Facultad de Economia, vol. 0(1), pages 79-93, May.
    5. Chen, Ray-Bing & Chen, Ying & Härdle, Wolfgang K., 2014. "TVICA—Time varying independent component analysis and its application to financial data," Computational Statistics & Data Analysis, Elsevier, vol. 74(C), pages 95-109.
    6. Gonzalez, Felipe & Prem, Mounu & von Dessauer, Cristine, 2023. "Empowerment or Indoctrination? Women Centers Under Dictatorship," SocArXiv 64mf9, Center for Open Science.
    7. Yan Yu Chen & Chun-Cheih Chao & Fu-Chen Liu & Po-Chen Hsu & Hsueh-Fen Chen & Shih-Chi Peng & Yung-Jen Chuang & Chung-Yu Lan & Wen-Ping Hsieh & David Shan Hill Wong, 2013. "Dynamic Transcript Profiling of Candida albicans Infection in Zebrafish: A Pathogen-Host Interaction Study," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-16, September.
    8. Plat, Richard, 2009. "Stochastic portfolio specific mortality and the quantification of mortality basis risk," Insurance: Mathematics and Economics, Elsevier, vol. 45(1), pages 123-132, August.
    9. Kondylis, Athanassios & Whittaker, Joe, 2008. "Spectral preconditioning of Krylov spaces: Combining PLS and PC regression," Computational Statistics & Data Analysis, Elsevier, vol. 52(5), pages 2588-2603, January.
    10. Simplice A. Asongu & Nicholas M. Odhiambo, 2019. "Governance, capital flight and industrialisation in Africa," Journal of Economic Structures, Springer;Pan-Pacific Association of Input-Output Studies (PAPAIOS), vol. 8(1), pages 1-22, December.
    11. M. J. Aziakpono & S. Kleimeier & H. Sander, 2012. "Banking market integration in the SADC countries: evidence from interest rate analyses," Applied Economics, Taylor & Francis Journals, vol. 44(29), pages 3857-3876, October.
    12. Anil Kumar, 2018. "Do Restrictions on Home Equity Extraction Contribute to Lower Mortgage Defaults? Evidence from a Policy Discontinuity at the Texas Border," American Economic Journal: Economic Policy, American Economic Association, vol. 10(1), pages 268-297, February.
    13. Bianca Maria Colosimo & Luca Pagani & Marco Grasso, 2024. "Modeling spatial point processes in video-imaging via Ripley’s K-function: an application to spatter analysis in additive manufacturing," Journal of Intelligent Manufacturing, Springer, vol. 35(1), pages 429-447, January.
    14. Ay, Jean-Sauveur & Le Gallo, Julie, 2021. "The Signaling Values of Nested Wine Names," Working Papers 321851, American Association of Wine Economists.
    15. Ouyang, Yaofu & Li, Peng, 2018. "On the nexus of financial development, economic growth, and energy consumption in China: New perspective from a GMM panel VAR approach," Energy Economics, Elsevier, vol. 71(C), pages 238-252.
    16. Fan, Cheng & Sun, Yongjun & Zhao, Yang & Song, Mengjie & Wang, Jiayuan, 2019. "Deep learning-based feature engineering methods for improved building energy prediction," Applied Energy, Elsevier, vol. 240(C), pages 35-45.
    17. Ionela Munteanu & Adriana Grigorescu & Elena Condrea & Elena Pelinescu, 2020. "Convergent Insights for Sustainable Development and Ethical Cohesion: An Empirical Study on Corporate Governance in Romanian Public Entities," Sustainability, MDPI, vol. 12(7), pages 1-17, April.
    18. Daniel Boss & Annick Hoffmann & Benjamin Rappaz & Christian Depeursinge & Pierre J Magistretti & Dimitri Van de Ville & Pierre Marquet, 2012. "Spatially-Resolved Eigenmode Decomposition of Red Blood Cells Membrane Fluctuations Questions the Role of ATP in Flickering," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-10, August.
    19. Doukas, Haris & Papadopoulou, Alexandra & Savvakis, Nikolaos & Tsoutsos, Theocharis & Psarras, John, 2012. "Assessing energy sustainability of rural communities using Principal Component Analysis," Renewable and Sustainable Energy Reviews, Elsevier, vol. 16(4), pages 1949-1957.
    20. Yuexin Li & Xiaoyin Ma & Luc Renneboog, 2024. "In Art We Trust," Management Science, INFORMS, vol. 70(1), pages 98-127, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jomega:v:59:y:2016:i:pa:p:40-46. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/375/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.