IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v50y2023i3p934-961.html
   My bibliography  Save this article

Sparse concordance‐based ordinal classification

Author

Listed:
  • Yiwei Fan
  • Jiaqi Gu
  • Guosheng Yin

Abstract

Ordinal classification is an important area in statistical machine learning, where labels exhibit a natural order. One of the major goals in ordinal classification is to correctly predict the relative order of instances. We develop a novel concordance‐based approach to ordinal classification, where a concordance function is introduced and a penalized smoothed method for optimization is designed. Variable selection using the L1$$ {L}_1 $$ penalty is incorporated for sparsity considerations. Within the set of classification rules that maximize the concordance function, we find optimal thresholds to predict labels by minimizing a loss function. After building the classifier, we derive nonparametric estimation of class conditional probabilities. The asymptotic properties of the estimators as well as the variable selection consistency are established. Extensive simulations and real data applications show the robustness and advantage of the proposed method in terms of classification accuracy, compared with other existing methods.

Suggested Citation

  • Yiwei Fan & Jiaqi Gu & Guosheng Yin, 2023. "Sparse concordance‐based ordinal classification," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(3), pages 934-961, September.
  • Handle: RePEc:bla:scjsta:v:50:y:2023:i:3:p:934-961
    DOI: 10.1111/sjos.12606
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/sjos.12606
    Download Restriction: no

    File URL: https://libkey.io/10.1111/sjos.12606?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yuanshan Wu & Yanyuan Ma & Guosheng Yin, 2015. "Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1670-1683, December.
    2. Chun Li & Bryan E. Shepherd, 2012. "A new residual for ordinal outcomes," Biometrika, Biometrika Trust, vol. 99(2), pages 473-480.
    3. Yee, Thomas W., 2010. "The VGAM Package for Categorical Data Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i10).
    4. Wang, Junhui & Shen, Xiaotong & Pan, Wei, 2009. "On Large Margin Hierarchical Classification With Multiple Paths," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1213-1223.
    5. Bercedis Peterson & Frank E. Harrell, 1990. "Partial Proportional Odds Models for Ordinal Response Variables," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 39(2), pages 205-217, June.
    6. Janitza, Silke & Tutz, Gerhard & Boulesteix, Anne-Laure, 2016. "Random forest for ordinal responses: Prediction and variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 96(C), pages 57-73.
    7. Sherman, Robert P, 1993. "The Limiting Distribution of the Maximum Rank Correlation Estimator," Econometrica, Econometric Society, vol. 61(1), pages 123-137, January.
    8. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    9. Rainer Hirk & Kurt Hornik & Laura Vana, 2019. "Multivariate ordinal regression models: an analysis of corporate credit ratings," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 507-539, September.
    10. Lin, Huazhen & Peng, Heng, 2013. "Smoothed rank correlation of the linear transformation regression model," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 615-630.
    11. Caiyun Fan & Wenbin Lu & Rui Song & Yong Zhou, 2017. "Concordance-assisted learning for estimating optimal individualized treatment regimes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1565-1582, November.
    12. Han, Aaron K., 1987. "Non-parametric analysis of a generalized regression model : The maximum rank correlation estimator," Journal of Econometrics, Elsevier, vol. 35(2-3), pages 303-316, July.
    13. Horowitz, Joel L, 1992. "A Smoothed Maximum Score Estimator for the Binary Response Model," Econometrica, Econometric Society, vol. 60(3), pages 505-531, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Haixiang & Huang, Jian & Sun, Liuquan, 2020. "A rank-based approach to estimating monotone individualized two treatment regimes," Computational Statistics & Data Analysis, Elsevier, vol. 151(C).
    2. Xiao Song & Shuangge Ma, 2010. "Penalised variable selection with U-estimates," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 22(4), pages 499-515.
    3. Lin, Xiefang & Fang, Fang, 2024. "Variable selection of Kolmogorov-Smirnov maximization with a penalized surrogate loss," Computational Statistics & Data Analysis, Elsevier, vol. 195(C).
    4. Patrick Bajari & Jeremy Fox & Stephen Ryan, 2008. "Evaluating wireless carrier consolidation using semiparametric demand estimation," Quantitative Marketing and Economics (QME), Springer, vol. 6(4), pages 299-338, December.
    5. repec:hal:wpspec:info:hdl:2441/3vl5fe4i569nbr005tctlc8ll5 is not listed on IDEAS
    6. Shakeeb Khan & Arnaud Maurel & Yichong Zhang, 2023. "Informational Content of Factor Structures in Simultaneous Binary Response Models," Advances in Econometrics, in: Essays in Honor of Joon Y. Park: Econometric Methodology in Empirical Applications, volume 45, pages 385-410, Emerald Group Publishing Limited.
    7. Sokbae Lee & Myung Hwan Seo & Youngki Shin, 2017. "Correction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 883-883, April.
    8. Coppejans, Mark, 2001. "Estimation of the binary response model using a mixture of distributions estimator (MOD)," Journal of Econometrics, Elsevier, vol. 102(2), pages 231-269, June.
    9. Tan, Xin Lu, 2019. "Optimal estimation of slope vector in high-dimensional linear transformation models," Journal of Multivariate Analysis, Elsevier, vol. 169(C), pages 179-204.
    10. Jochmans, Koen, 2015. "Multiplicative-error models with sample selection," Journal of Econometrics, Elsevier, vol. 184(2), pages 315-327.
    11. repec:hal:spmain:info:hdl:2441/3vl5fe4i569nbr005tctlc8ll5 is not listed on IDEAS
    12. Gerhard Tutz & Moritz Berger, 2022. "Sparser Ordinal Regression Models Based on Parametric and Additive Location‐Shift Approaches," International Statistical Review, International Statistical Institute, vol. 90(2), pages 306-327, August.
    13. Chen, Songnian, 2000. "Efficient estimation of binary choice models under symmetry," Journal of Econometrics, Elsevier, vol. 96(1), pages 183-199, May.
    14. Horowitz, Joel L., 2002. "Bootstrap critical values for tests based on the smoothed maximum score estimator," Journal of Econometrics, Elsevier, vol. 111(2), pages 141-167, December.
    15. Chen, Songnian & Zhou, Yahong, 2007. "Estimating a generalized correlation coefficient for a generalized bivariate probit model," Journal of Econometrics, Elsevier, vol. 141(2), pages 1100-1114, December.
    16. Yan, Jin & Yoo, Hong Il, 2019. "Semiparametric estimation of the random utility model with rank-ordered choice data," Journal of Econometrics, Elsevier, vol. 211(2), pages 414-438.
    17. repec:spo:wpmain:info:hdl:2441/3vl5fe4i569nbr005tctlc8ll5 is not listed on IDEAS
    18. Shakeeb Khan & Fu Ouyang & Elie Tamer, 2021. "Inference on semiparametric multinomial response models," Quantitative Economics, Econometric Society, vol. 12(3), pages 743-777, July.
    19. Takahiro ITO, 2024. "Binary and Ordered Response Models in Randomized Experiments: Applications of the Resampling-Based Maximum Likelihood Method," GSICS Working Paper Series 42, Graduate School of International Cooperation Studies, Kobe University.
    20. repec:spo:wpecon:info:hdl:2441/3vl5fe4i569nbr005tctlc8ll5 is not listed on IDEAS
    21. Gallant, A. Ronald & Hong, Han & Leung, Michael P. & Li, Jessie, 2022. "Constrained estimation using penalization and MCMC," Journal of Econometrics, Elsevier, vol. 228(1), pages 85-106.
    22. Chen, Songnian, 2000. "Rank estimation of a location parameter in the binary choice model," Journal of Econometrics, Elsevier, vol. 98(2), pages 317-334, October.
    23. Chen, Songnian & Zhang, Hanghui, 2020. "n-prediction of generalized heteroscedastic transformation regression models," Journal of Econometrics, Elsevier, vol. 215(2), pages 305-340.
    24. Fu Ouyang & Thomas Tao Yang, 2020. "Semiparametric Discrete Choice Models for Bundles," Discussion Papers Series 625, School of Economics, University of Queensland, Australia.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:50:y:2023:i:3:p:934-961. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.