IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v5y2022i2p21-384d794755.html
   My bibliography  Save this article

ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R

Author

Listed:
  • Kellie J. Archer

    (Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH 43210, USA)

  • Anna Eames Seffernick

    (Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH 43210, USA)

  • Shuai Sun

    (Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH 43210, USA)

  • Yiran Zhang

    (Amgen Inc., 1 Amgen Center Dr, Thousand Oaks, CA 91320, USA)

Abstract

The stage of cancer is a discrete ordinal response that indicates the aggressiveness of disease and is often used by physicians to determine the type and intensity of treatment to be administered. For example, the FIGO stage in cervical cancer is based on the size and depth of the tumor as well as the level of spread. It may be of clinical relevance to identify molecular features from high-throughput genomic assays that are associated with the stage of cervical cancer to elucidate pathways related to tumor aggressiveness, identify improved molecular features that may be useful for staging, and identify therapeutic targets. High-throughput RNA-Seq data and corresponding clinical data (including stage) for cervical cancer patients have been made available through The Cancer Genome Atlas Project (TCGA). We recently described penalized Bayesian ordinal response models that can be used for variable selection for over-parameterized datasets, such as the TCGA-CESC dataset. Herein, we describe our ordinalbayes R package, available from the Comprehensive R Archive Network (CRAN), which enhances the runjags R package by enabling users to easily fit cumulative logit models when the outcome is ordinal and the number of predictors exceeds the sample size, P > N , such as for TCGA and other high-throughput genomic data. We demonstrate the use of this package by applying it to the TCGA cervical cancer dataset. Our ordinalbayes package can be used to fit models to high-dimensional datasets, and it effectively performs variable selection.

Suggested Citation

  • Kellie J. Archer & Anna Eames Seffernick & Shuai Sun & Yiran Zhang, 2022. "ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R," Stats, MDPI, vol. 5(2), pages 1-14, April.
  • Handle: RePEc:gam:jstats:v:5:y:2022:i:2:p:21-384:d:794755
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/5/2/21/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/5/2/21/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Chris Hans, 2009. "Bayesian lasso regression," Biometrika, Biometrika Trust, vol. 96(4), pages 835-845.
    2. Denwood, Matthew J., 2016. "runjags: An R Package Providing Interface Utilities, Model Templates, Parallel Computing Methods and Additional Distributions for MCMC Models in JAGS," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 71(i09).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Merkle, Edgar C. & Steyvers, Mark & Mellers, Barbara & Tetlock, Philip E., 2017. "A neglected dimension of good forecasting judgment: The questions we choose also matter," International Journal of Forecasting, Elsevier, vol. 33(4), pages 817-832.
    2. Ji, Yonggang & Lin, Nan & Zhang, Baoxue, 2012. "Model selection in binary and tobit quantile regression using the Gibbs sampler," Computational Statistics & Data Analysis, Elsevier, vol. 56(4), pages 827-839.
    3. Aghabazaz, Zeynab & Kazemi, Iraj, 2023. "Under-reported time-varying MINAR(1) process for modeling multivariate count series," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    4. Guangbao Guo & Guoqi Qian & Lu Lin & Wei Shao, 2021. "Parallel inference for big data with the group Bayesian method," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(2), pages 225-243, February.
    5. Badri Padhukasahasram & Chandan K Reddy & Yan Li & David E Lanfear, 2015. "Joint Impact of Clinical and Behavioral Variables on the Risk of Unplanned Readmission and Death after a Heart Failure Hospitalization," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-11, June.
    6. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    7. Philip Kostov & Thankom Arun & Samuel Annim, 2014. "Financial Services to the Unbanked: the case of the Mzansi intervention in South Africa," Contemporary Economics, University of Economics and Human Sciences in Warsaw., vol. 8(2), June.
    8. Ruggieri, Eric & Lawrence, Charles E., 2012. "On efficient calculations for Bayesian variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1319-1332.
    9. Adam N. Smith & Jim E. Griffin, 2023. "Shrinkage priors for high-dimensional demand estimation," Quantitative Marketing and Economics (QME), Springer, vol. 21(1), pages 95-146, March.
    10. Daniel F. Schmidt & Enes Makalic, 2013. "Estimation of stationary autoregressive models with the Bayesian LASSO," Journal of Time Series Analysis, Wiley Blackwell, vol. 34(5), pages 517-531, September.
    11. Minerva Mukhopadhyay & Tapas Samanta, 2017. "A mixture of g-priors for variable selection when the number of regressors grows with the sample size," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(2), pages 377-404, June.
    12. Bergersen Linn Cecilie & Glad Ingrid K. & Lyng Heidi, 2011. "Weighted Lasso with Data Integration," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-29, August.
    13. van Erp, Sara & Oberski, Daniel L. & Mulder, Joris, 2018. "Shrinkage priors for Bayesian penalized regression," OSF Preprints cg8fq, Center for Open Science.
    14. Zhao, Lu & Sun, Zhongkui & Tang, Ming & Guan, Shuguang & Zou, Yong, 2023. "Learning successive weak synchronization transitions and coupling directions by reservoir computing," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
    15. Gelper, Sarah & Stremersch, Stefan, 2014. "Variable selection in international diffusion models," International Journal of Research in Marketing, Elsevier, vol. 31(4), pages 356-367.
    16. Gehong Zhang & Junming Li & Sijin Li & Yang Wang, 2018. "Exploring Spatial Trends and Influencing Factors for Gastric Cancer Based on Bayesian Statistics: A Case Study of Shanxi, China," IJERPH, MDPI, vol. 15(9), pages 1-17, August.
    17. Charles Miller & Benjamin Barber & Shuvo Bakar, 2018. "Indoctrination and coercion in agent motivation: Evidence from Nazi Germany," Rationality and Society, , vol. 30(2), pages 189-219, May.
    18. Cao, Wen-Rui & Huang, Qiu-Ru & Zhang, Nan & Liang, Hui-Juan & Xian, Ben-Song & Gan, Xiao-Fang & Xu, Dong Roman & Lai, Ying-Si, 2022. "Mapping the travel modes and acceptable travel time to primary healthcare institutions: A case study in Inner Mongolia Autonomous Region, China," Journal of Transport Geography, Elsevier, vol. 102(C).
    19. Alexina J. Mason & Manuel Gomes & James Carpenter & Richard Grieve, 2021. "Flexible Bayesian longitudinal models for cost‐effectiveness analyses with informative missing data," Health Economics, John Wiley & Sons, Ltd., vol. 30(12), pages 3138-3158, December.
    20. Baragatti, M. & Pommeret, D., 2012. "A study of variable selection using g-prior distribution with ridge parameter," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1920-1934.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:5:y:2022:i:2:p:21-384:d:794755. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.