IDEAS home Printed from https://ideas.repec.org/a/spr/jclass/v38y2021i3d10.1007_s00357-021-09388-3.html
   My bibliography  Save this article

A Model-Free Subject Selection Method for Active Learning Classification Procedures

Author

Listed:
  • Bo-Shiang Ke

    (National Chiao Tung University)

  • Yuan-chin Ivan Chang

    (Academia Sinica)

Abstract

To construct a classification rule via an active learning method, during the learning process, users select training subjects sequentially, without knowing their labels, based on the model learned at the current stage. For a parametric-model-based classification rule, methods of statistical experimental design are popular guidelines for selecting new learning subjects. However, there is a lack of a counterpart for non-parametric-model-based classifiers, such as support vector machines. Thus, we propose a subject selection scheme via an extended influential index for the area under a receiver operating characteristic curve, which is applicable to general classifiers with continuous scores.

Suggested Citation

  • Bo-Shiang Ke & Yuan-chin Ivan Chang, 2021. "A Model-Free Subject Selection Method for Active Learning Classification Procedures," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 544-555, October.
  • Handle: RePEc:spr:jclass:v:38:y:2021:i:3:d:10.1007_s00357-021-09388-3
    DOI: 10.1007/s00357-021-09388-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00357-021-09388-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00357-021-09388-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Deng, Xinwei & Joseph, V. Roshan & Sudjianto, Agus & Wu, C. F. Jeff, 2009. "Active Learning Through Sequential Design, With Applications to Detection of Money Laundering," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 969-981.
    2. Margaret Sullivan Pepe & Tianxi Cai, 2004. "The Analysis of Placement Values for Evaluating Discriminatory Measures," Biometrics, The International Biometric Society, vol. 60(2), pages 528-535, June.
    3. Zimu Chen & Zhanfeng Wang & Yuan‐chin Ivan Chang, 2020. "Sequential adaptive variables and subject selection for GEE methods," Biometrics, The International Biometric Society, vol. 76(2), pages 496-507, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Elisa–María Molanes-López & Ricardo Cao, 2008. "Relative density estimation for left truncated and right censored data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 20(8), pages 693-720.
    2. Chen, Xiwei & Vexler, Albert & Markatou, Marianthi, 2015. "Empirical likelihood ratio confidence interval estimation of best linear combinations of biomarkers," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 186-198.
    3. Holly Janes & Gary Longton & Margaret S. Pepe, 2009. "Accommodating covariates in receiver operating characteristic analysis," Stata Journal, StataCorp LP, vol. 9(1), pages 17-39, March.
    4. Hsu, Hsiang-Ling & Chang, Yuan-chin Ivan & Chen, Ray-Bing, 2019. "Greedy active learning algorithm for logistic regression models," Computational Statistics & Data Analysis, Elsevier, vol. 129(C), pages 119-134.
    5. Margaret S. Pepe & Gary Longton & Holly Janes, 2009. "Estimation and comparison of receiver operating characteristic curves," Stata Journal, StataCorp LP, vol. 9(1), pages 1-16, March.
    6. Wang, Suohong & Zhang, Biao, 2014. "Semiparametric empirical likelihood confidence intervals for AUC under a density ratio model," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 101-115.
    7. Tianxi Cai & Yingye Zheng, 2007. "Model Checking for ROC Regression Analysis," Biometrics, The International Biometric Society, vol. 63(1), pages 152-163, March.
    8. Gengsheng Qin & Xiao-Hua Zhou, 2006. "Empirical Likelihood Inference for the Area under the ROC Curve," Biometrics, The International Biometric Society, vol. 62(2), pages 613-622, June.
    9. Letón, Emilio & Molanes, Elisa M., 2009. "Adjusted empirical likelihood estimation of the youden index and associated threshold for the bigamma model," DES - Working Papers. Statistics and Econometrics. WS ws091907, Universidad Carlos III de Madrid. Departamento de Estadística.
    10. Li, Jingjing & Chen, Zimu & Wang, Zhanfeng & Chang, Yuan-chin Ivan, 2020. "Active learning in multiple-class classification problems via individualized binary models," Computational Statistics & Data Analysis, Elsevier, vol. 145(C).
    11. Y. Huang & M. S. Pepe, 2009. "A Parametric ROC Model-Based Approach for Evaluating the Predictiveness of Continuous Markers in Case–Control Studies," Biometrics, The International Biometric Society, vol. 65(4), pages 1133-1144, December.
    12. Daniel R. Cavagnaro & Richard Gonzalez & Jay I. Myung & Mark A. Pitt, 2013. "Optimal Decision Stimuli for Risky Choice Experiments: An Adaptive Approach," Management Science, INFORMS, vol. 59(2), pages 358-375, February.
    13. Soutik Ghosal & Zhen Chen, 2022. "Discriminatory Capacity of Prenatal Ultrasound Measures for Large-for-Gestational-Age Birth: A Bayesian Approach to ROC Analysis Using Placement Values," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 14(1), pages 1-22, April.
    14. Rodríguez-Álvarez, María Xosé & Roca-Pardiñas, Javier & Cadarso-Suárez, Carmen, 2011. "A new flexible direct ROC regression model: Application to the detection of cardiovascular risk factors by anthropometric measures," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3257-3270, December.
    15. Eleni Verykouki & Christos T. Nakas, 2023. "Adaptations on the Use of p -Values for Statistical Inference: An Interpretation of Messages from Recent Public Discussions," Stats, MDPI, vol. 6(2), pages 1-13, April.
    16. Rodríguez-Álvarez, María Xosé & Tahoces, Pablo G. & Cadarso-Suárez, Carmen & Lado, María José, 2011. "Comparative study of ROC regression techniques--Applications for the computer-aided diagnostic system in breast cancer detection," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 888-902, January.
    17. Rebecca Graziani & Michele Guindani & Peter F. Thall, 2015. "Bayesian nonparametric estimation of targeted agent effects on biomarker change to predict clinical outcome," Biometrics, The International Biometric Society, vol. 71(1), pages 188-197, March.
    18. Ma, Hua & Bandos, Andriy I. & Gur, David, 2018. "Informativeness of diagnostic marker values and the impact of data grouping," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 76-89.
    19. Adimari Gianfranco & Chiogna Monica, 2010. "Simple Nonparametric Confidence Regions for the Evaluation of Continuous-Scale Diagnostic Tests," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-20, July.
    20. Wan, Shuwen, 2012. "An empirical likelihood confidence interval for the volume under ROC surface," Statistics & Probability Letters, Elsevier, vol. 82(7), pages 1463-1467.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:38:y:2021:i:3:d:10.1007_s00357-021-09388-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.