IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v115y2017icp155-171.html
   My bibliography  Save this article

Optimal scaling for survival analysis with ordinal data

Author

Listed:
  • Willems, S.J.W.
  • Fiocco, M.
  • Meulman, J.J.

Abstract

Medical and psychological studies often involve the collection and analysis of categorical data with nominal or ordinal category levels. Nominal categories have no ordering property, e.g. gender, with the two unordered covariates male and female. Ordinal category levels, however, have an ordering, e.g. when subjects are classified according to their education level, often categorized as low, medium or high education. When analyzing survival data, currently two methods can be chosen to include ordinal covariates in the Cox proportional hazard model. Dummy covariates can be used to indicate category memberships, as is usually done for nominal covariates. Estimated parameters for each category indicate the increase or decrease in risk of experiencing the event of interest compared to the reference category. Since these parameters are estimated independently from each other, the ordering property of the categories is lost in the process. To keep the ordinal property, integer values can be given to the covariate’s categories (e.g. low = 0, medium = 1, high = 2), and the variable is included in the model as a numeric covariate. However, since the ordinal data are now interpreted as numeric data, the property of equal distances between consecutive categories is introduced. This assumption is too strict for this data type; distances between consecutive categories do not necessarily have to be equal. A method is described to include ordinal data in the Cox model. The method implements optimal scaling to find optimal quantifications for the ordinal category levels. These quantifications are chosen such that they preserve the categories’ ordering, and do not force equal distances between consecutive category levels. A simulation study is carried out to compare the performance of optimal scaling with the performance of the two currently used methods described above. Results show that the optimal scaling method increases the model fit if ordinal covariates are included in the model.

Suggested Citation

  • Willems, S.J.W. & Fiocco, M. & Meulman, J.J., 2017. "Optimal scaling for survival analysis with ordinal data," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 155-171.
  • Handle: RePEc:eee:csdana:v:115:y:2017:i:c:p:155-171
    DOI: 10.1016/j.csda.2017.05.008
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947317301032
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2017.05.008?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Simon, Noah & Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2011. "Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 39(i05).
    2. Moeschberger M.L., 2003. "Statistical Methods for the Analysis of Repeated Measurements," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 248-249, January.
    3. J. Kruskal, 1964. "Nonmetric multidimensional scaling: A numerical method," Psychometrika, Springer;The Psychometric Society, vol. 29(2), pages 115-129, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Roberto Louis Forestal & Shih-Ming Pi, 2021. "Using Artificial Neural networks and Optimal Scaling Model to Forecast Agriculture Commodity Price: An Ecological-economic Approach," Advances in Management and Applied Economics, SCIENPRESS Ltd, vol. 11(3), pages 1-3.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Samuel Shye, 2010. "The Motivation to Volunteer: A Systemic Quality of Life Theory," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 98(2), pages 183-200, September.
    2. Soave, David & Lawless, Jerald F., 2023. "Regularized regression for two phase failure time studies," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    3. Muñoz-Mas, Rafael & Vezza, Paolo & Alcaraz-Hernández, Juan Diego & Martínez-Capel, Francisco, 2016. "Risk of invasion predicted with support vector machines: A case study on northern pike (Esox Lucius, L.) and bleak (Alburnus alburnus, L.)," Ecological Modelling, Elsevier, vol. 342(C), pages 123-134.
    4. Zemin Zheng & Jie Zhang & Yang Li, 2022. "L 0 -Regularized Learning for High-Dimensional Additive Hazards Regression," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2762-2775, September.
    5. Simon Bussy & Mokhtar Z. Alaya & Anne‐Sophie Jannot & Agathe Guilloux, 2022. "Binacox: automatic cut‐point detection in high‐dimensional Cox model with applications in genetics," Biometrics, The International Biometric Society, vol. 78(4), pages 1414-1426, December.
    6. Biagini, Francesca & Groll, Andreas & Widenmann, Jan, 2013. "Intensity-based premium evaluation for unemployment insurance products," Insurance: Mathematics and Economics, Elsevier, vol. 53(1), pages 302-316.
    7. Benedicte Sjo Tislevoll & Monica Hellesøy & Oda Helen Eck Fagerholt & Stein-Erik Gullaksen & Aashish Srivastava & Even Birkeland & Dimitrios Kleftogiannis & Pilar Ayuda-Durán & Laure Piechaczyk & Dagi, 2023. "Early response evaluation by single cell signaling profiling in acute myeloid leukemia," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    8. Leandro C. Hermida & E. Michael Gertz & Eytan Ruppin, 2022. "Predicting cancer prognosis and drug response from the tumor microbiome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    9. Karim Abou-Moustafa & Frank P. Ferrie, 2018. "Local generalized quadratic distance metrics: application to the k-nearest neighbors classifier," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(2), pages 341-363, June.
    10. Camacho, Maximo & Perez-Quiros, Gabriel & Saiz, Lorena, 2006. "Are European business cycles close enough to be just one?," Journal of Economic Dynamics and Control, Elsevier, vol. 30(9-10), pages 1687-1706.
    11. Takumi Saegusa & Tianzhou Ma & Gang Li & Ying Qing Chen & Mei-Ling Ting Lee, 2020. "Variable Selection in Threshold Regression Model with Applications to HIV Drug Adherence Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 12(3), pages 376-398, December.
    12. Mingxu Zhao & Nalaka Geekiyanage & Jianchu Xu & Myo Myo Khin & Dian Ridwan Nurdiana & Ekananda Paudel & Rhett Daniel Harrison, 2015. "Structure of the Epiphyte Community in a Tropical Montane Forest in SW China," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-19, April.
    13. Willem Heiser, 1991. "A generalized majorization method for least souares multidimensional scaling of pseudodistances that may be negative," Psychometrika, Springer;The Psychometric Society, vol. 56(1), pages 7-27, March.
    14. Wenhua Liang & Jianhua Yao & Ailan Chen & Qingquan Lv & Mark Zanin & Jun Liu & SookSan Wong & Yimin Li & Jiatao Lu & Hengrui Liang & Guoqiang Chen & Haiyan Guo & Jun Guo & Rong Zhou & Limin Ou & Niyun, 2020. "Early triage of critically ill COVID-19 patients using deep learning," Nature Communications, Nature, vol. 11(1), pages 1-7, December.
    15. Tatyana Deryugina & Garth Heutel & Nolan H. Miller & David Molitor & Julian Reif, 2019. "The Mortality and Medical Costs of Air Pollution: Evidence from Changes in Wind Direction," American Economic Review, American Economic Association, vol. 109(12), pages 4178-4219, December.
    16. Luís Francisco Aguiar & Pedro C. Magalhães & Maria Joana Soares, 2010. "Synchronism in Electoral Cycles: How United are the United States?," NIPE Working Papers 17/2010, NIPE - Universidade do Minho.
    17. Andreas Groll & Gerhard Tutz, 2017. "Variable selection in discrete survival models including heterogeneity," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(2), pages 305-338, April.
    18. Vicente Gallego & Ramon Oller, 2024. "A Kernel approach for extending nonparametric multivariate analysis of variance in high-dimensional settings," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 33(5), pages 1315-1335, November.
    19. Kennen, Jonathan G. & Kauffman, Leon J. & Ayers, Mark A. & Wolock, David M. & Colarullo, Susan J., 2008. "Use of an integrated flow model to estimate ecologically relevant hydrologic characteristics at stream biomonitoring sites," Ecological Modelling, Elsevier, vol. 211(1), pages 57-76.
    20. Kevin He & Yue Wang & Xiang Zhou & Han Xu & Can Huang, 2019. "An improved variable selection procedure for adaptive Lasso in high-dimensional survival analysis," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 25(3), pages 569-585, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:115:y:2017:i:c:p:155-171. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.