IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v058i12.html
   My bibliography  Save this article

Regularization Paths for Conditional Logistic Regression: The clogitL1 Package

Author

Listed:
  • Reid, Stephen
  • Tibshirani, Rob

Abstract

We apply the cyclic coordinate descent algorithm of Friedman, Hastie, and Tibshirani (2010) to the fitting of a conditional logistic regression model with lasso (ℓ1) and elastic net penalties. The sequential strong rules of Tibshirani, Bien, Hastie, Friedman, Taylor, Simon, and Tibshirani (2012) are also used in the algorithm and it is shown that these offer a considerable speed up over the standard coordinate descent algorithm with warm starts. Once implemented, the algorithm is used in simulation studies to compare the variable selection and prediction performance of the conditional logistic regression model against that of its unconditional (standard) counterpart. We find that the conditional model performs admirably on datasets drawn from a suitable conditional distribution, outperforming its unconditional counterpart at variable selection. The conditional model is also fit to a small real world dataset, demonstrating how we obtain regularization paths for the parameters of the model and how we apply cross validation for this method where natural unconditional prediction rules are hard to come by.

Suggested Citation

  • Reid, Stephen & Tibshirani, Rob, 2014. "Regularization Paths for Conditional Logistic Regression: The clogitL1 Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 58(i12).
  • Handle: RePEc:jss:jstsof:v:058:i12
    DOI: http://hdl.handle.net/10.18637/jss.v058.i12
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v058i12/v58i12.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v058i12/clogitL1_1.4.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v058i12/v58i12.R
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v058.i12?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    2. Simon, Noah & Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2011. "Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 39(i05).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shi, Haolun & Yin, Guosheng, 2018. "Boosting conditional logit model," Journal of choice modelling, Elsevier, vol. 26(C), pages 48-63.
    2. E. Ollier & V. Viallon, 2017. "Regression modelling on stratified data with the lasso," Biometrika, Biometrika Trust, vol. 104(1), pages 83-96.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benedicte Sjo Tislevoll & Monica Hellesøy & Oda Helen Eck Fagerholt & Stein-Erik Gullaksen & Aashish Srivastava & Even Birkeland & Dimitrios Kleftogiannis & Pilar Ayuda-Durán & Laure Piechaczyk & Dagi, 2023. "Early response evaluation by single cell signaling profiling in acute myeloid leukemia," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    2. Matthew F Dixon, 2017. "A High Frequency Trade Execution Model for Supervised Learning," Papers 1710.03870, arXiv.org, revised Dec 2017.
    3. Zhixuan Fu & Shuangge Ma & Haiqun Lin & Chirag R. Parikh & Bingqing Zhou, 2017. "Penalized Variable Selection for Multi-center Competing Risks Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 379-405, December.
    4. Andreas Groll & Gerhard Tutz, 2017. "Variable selection in discrete survival models including heterogeneity," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(2), pages 305-338, April.
    5. Matthew F Dixon, 2017. "Sequence Classification of the Limit Order Book using Recurrent Neural Networks," Papers 1707.05642, arXiv.org.
    6. Jie Xiong & Zhitong Bing & Yanlin Su & Defeng Deng & Xiaoning Peng, 2014. "An Integrated mRNA and microRNA Expression Signature for Glioblastoma Multiforme Prognosis," PLOS ONE, Public Library of Science, vol. 9(5), pages 1-8, May.
    7. Liao Zhu & Robert A. Jarrow & Martin T. Wells, 2021. "Time-Invariance Coefficients Tests with the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 11(04), pages 1-30, December.
    8. Shengzhi Huang & Jiajia Qian & Yong Huang & Wei Lu & Yi Bu & Jinqing Yang & Qikai Cheng, 2022. "Disclosing the relationship between citation structure and future impact of a publication," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(7), pages 1025-1042, July.
    9. Gal Dinstag & David Amar & Erik Ingelsson & Euan Ashley & Ron Shamir, 2019. "Personalized prediction of adverse heart and kidney events using baseline and longitudinal data from SPRINT and ACCORD," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-12, August.
    10. Paul Ghelasi & Florian Ziel, 2024. "From day-ahead to mid and long-term horizons with econometric electricity price forecasting models," Papers 2406.00326, arXiv.org, revised Aug 2024.
    11. Shikhar Uttam & Andrew M. Stern & Christopher J. Sevinsky & Samantha Furman & Filippo Pullara & Daniel Spagnolo & Luong Nguyen & Albert Gough & Fiona Ginty & D. Lansing Taylor & S. Chakra Chennubhotla, 2020. "Spatial domain analysis predicts risk of colorectal cancer recurrence and infers associated tumor microenvironment networks," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    12. Robert A. Jarrow & Rinald Murataj & Martin T. Wells & Liao Zhu, 2023. "The Low-Volatility Anomaly And The Adaptive Multi-Factor Model," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 26(04n05), pages 1-33, August.
    13. Liao Zhu, 2021. "The Adaptive Multi-Factor Model and the Financial Market," Papers 2107.14410, arXiv.org, revised Aug 2021.
    14. Kou Fujimori, 2022. "The variable selection by the Dantzig selector for Cox’s proportional hazards model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 515-537, June.
    15. Jacek Bia{l}ek & Maciej Berk{e}sewicz, 2020. "Scanner data in inflation measurement: from raw data to price indices," Papers 2005.11233, arXiv.org.
    16. Liao Zhu & Sumanta Basu & Robert A. Jarrow & Martin T. Wells, 2020. "High-Dimensional Estimation, Basis Assets, and the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 10(04), pages 1-52, December.
    17. Wu, Tong Tong & He, Xin, 2012. "Coordinate ascent for penalized semiparametric regression on high-dimensional panel count data," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 25-33, January.
    18. Marton Gosztonyi, 2023. "Comparative Analysis of X-Y-Z Generation Entrepreneurs in a Semi-Peripheral EU Member Country: Insights from Regularized Regression Techniques," European Research Studies Journal, European Research Studies Journal, vol. 0(4), pages 191-217.
    19. Sill, Martin & Hielscher, Thomas & Becker, Natalia & Zucknick, Manuela, 2014. "c060: Extended Inference with Lasso and Elastic-Net Regularized Cox and Generalized Linear Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 62(i05).
    20. repec:jss:jstsof:47:i09 is not listed on IDEAS
    21. Yoonsuh Jung, 2018. "Multiple predicting K-fold cross-validation for model selection," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 30(1), pages 197-215, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:058:i12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.