IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v72y2014icp219-226.html
   My bibliography  Save this article

Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases

Author

Listed:
  • Franklin, Jessica M.
  • Schneeweiss, Sebastian
  • Polinski, Jennifer M.
  • Rassen, Jeremy A.

Abstract

Longitudinal healthcare claim databases are frequently used for studying the comparative safety and effectiveness of medications, but results from these studies may be biased due to residual confounding. It is unclear whether methods for confounding adjustment that have been shown to perform well in small, simple nonrandomized studies are applicable to the large, complex pharmacoepidemiologic studies created from secondary healthcare data. Ordinary simulation approaches for evaluating the performance of statistical methods do not capture important features of healthcare claims. A statistical framework for creating replicated simulation datasets from an empirical cohort study in electronic healthcare claims data is developed and validated. The approach relies on resampling from the observed covariate and exposure data without modification in all simulated datasets to preserve the associations among these variables. Repeated outcomes are simulated using a true treatment effect of the investigator’s choice and the baseline hazard function estimated from the empirical data. As an example, this framework is applied to a study of high versus low-intensity statin use and cardiovascular outcomes. Simulated data is based on real data drawn from Medicare Parts A and B linked with a prescription drug insurance claims database maintained by Caremark. Properties of the data simulated using this framework are compared with the empirical data on which the simulations were based. In addition, the simulated datasets are used to compare variable selection strategies for confounder adjustment via the propensity score, including high-dimensional approaches that could not be evaluated with ordinary simulation methods. The simulated datasets are found to closely resemble the observed complex data structure but have the advantage of an investigator-specified exposure effect.

Suggested Citation

  • Franklin, Jessica M. & Schneeweiss, Sebastian & Polinski, Jennifer M. & Rassen, Jeremy A., 2014. "Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 219-226.
  • Handle: RePEc:eee:csdana:v:72:y:2014:i:c:p:219-226
    DOI: 10.1016/j.csda.2013.10.018
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947313003721
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2013.10.018?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gary L Gadbury & Qinfang Xiang & Lin Yang & Stephen Barnes & Grier P Page & David B Allison, 2008. "Evaluating Statistical Methods Using Plasmode Data Sets in the Age of Massive Public Databases: An Illustration Using False Discovery Rates," PLOS Genetics, Public Library of Science, vol. 4(6), pages 1-8, June.
    2. Fatih Safa Erenay & Oguzhan Alagoz & Ritesh Banerjee & Robert R. Cima, 2011. "Estimating the Unknown Parameters of the Natural History of Metachronous Colorectal Cancer Using Discrete-Event Simulation," Medical Decision Making, , vol. 31(4), pages 611-624, July.
    3. Vaughan, Laura K. & Divers, Jasmin & Padilla, Miguel A. & Redden, David T. & Tiwari, Hemant K. & Pomp, Daniel & Allison, David B., 2009. "The use of plasmodes as a supplement to simulations: A simple example evaluating individual admixture estimation methodologies," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1755-1766, March.
    4. Mai A Elobeid & Miguel A Padilla & Theresa McVie & Olivia Thomas & David W Brock & Bret Musser & Kaifeng Lu & Christopher S Coffey & Renee A Desmond & Marie-Pierre St-Onge & Kishore M Gadde & Steven B, 2009. "Missing Data in Randomized Clinical Trials for Weight Loss: Scope of the Problem, State of the Field, and Performance of Statistical Methods," PLOS ONE, Public Library of Science, vol. 4(8), pages 1-11, August.
    5. Clifford M. Hurvich & Jeffrey S. Simonoff & Chih‐Ling Tsai, 1998. "Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 271-293.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pang Menglan & Schuster Tibor & Filion Kristian B. & Schnitzer Mireille E. & Eberg Maria & Platt Robert W., 2016. "Effect Estimation in Point-Exposure Studies with Binary Outcomes and High-Dimensional Covariate Data – A Comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weigh," The International Journal of Biostatistics, De Gruyter, vol. 12(2), pages 1-12, November.
    2. Mingyang Shan & Douglas Faries & Andy Dang & Xiang Zhang & Zhanglin Cui & Kristin M. Sheffield, 2022. "A Simulation-Based Evaluation of Statistical Methods for Hybrid Real-World Control Arms in Clinical Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 14(2), pages 259-284, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    2. Don Harding, 2010. "Applying shape and phase restrictions in generalized dynamic categorical models of the business cycle," NCER Working Paper Series 58, National Centre for Econometric Research.
    3. Michael S. Delgado & Daniel J. Henderson & Christopher F. Parmeter, 2014. "Does Education Matter for Economic Growth?," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 76(3), pages 334-359, June.
    4. Juan Manuel Julio & Norberto Rodríguez & Héctor Manuel Zárate, 2005. "Estimating the COP Exchange Rate Volatility Smile and the Market Effect of Central Bank Interventions: A CHARN Approach," Borradores de Economia 2605, Banco de la Republica.
    5. Malloy, Elizabeth J. & Spiegelman, Donna & Eisen, Ellen A., 2009. "Comparing measures of model selection for penalized splines in Cox models," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2605-2616, May.
    6. Karimu, Amin & Brännlund, Runar, 2013. "Functional form and aggregate energy demand elasticities: A nonparametric panel approach for 17 OECD countries," Energy Economics, Elsevier, vol. 36(C), pages 19-27.
    7. Liao, Jun & Zou, Guohua, 2020. "Corrected Mallows criterion for model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    8. Lu, Jun & Lin, Lu, 2018. "Feature screening for multi-response varying coefficient models with ultrahigh dimensional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 242-254.
    9. Chu, Chi-Yang & Henderson, Daniel J. & Parmeter, Christopher F., 2017. "On discrete Epanechnikov kernel functions," Computational Statistics & Data Analysis, Elsevier, vol. 116(C), pages 79-105.
    10. Salvatore Ingrassia & Simona Minotti & Giorgio Vittadini, 2012. "Local Statistical Modeling via a Cluster-Weighted Approach with Elliptical Distributions," Journal of Classification, Springer;The Classification Society, vol. 29(3), pages 363-401, October.
    11. Maria Sassi, 2010. "OLS and GWR Approaches to Agricultural Convergence in the EU-15," International Advances in Economic Research, Springer;International Atlantic Economic Society, vol. 16(1), pages 96-108, February.
    12. Nagler Thomas & Schellhase Christian & Czado Claudia, 2017. "Nonparametric estimation of simplified vine copula models: comparison of methods," Dependence Modeling, De Gruyter, vol. 5(1), pages 99-120, January.
    13. Arturo Bujanda & Thomas M. Fullerton, 2017. "Impacts of transportation infrastructure on single-family property values," Applied Economics, Taylor & Francis Journals, vol. 49(51), pages 5183-5199, November.
    14. Costas Milas & Ruthira Naraidoo, 2009. "Financial Market Conditions, Real Time, Nonlinearity and European Central Bank Monetary Policy: In-Sample and Out-of-Sample Assessment," Working Papers 200923, University of Pretoria, Department of Economics.
    15. Cai, Zongwu & Xu, Xiaoping, 2009. "Nonparametric Quantile Estimations for Dynamic Smooth Coefficient Models," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 371-383.
    16. Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2017. "The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 91-102.
    17. Víctor M. Guerrero & Daniela Cortés Toto & Hortensia J. Reyes Cervantes, 2018. "Effect of autocorrelation when estimating the trend of a time series via penalized least squares with controlled smoothness," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(1), pages 109-130, March.
    18. Yanagihara, Hirokazu & Satoh, Kenichi, 2010. "An unbiased Cp criterion for multivariate ridge regression," Journal of Multivariate Analysis, Elsevier, vol. 101(5), pages 1226-1238, May.
    19. Centorrino Samuele & Feve Frederique & Florens Jean-Pierre, 2017. "Additive Nonparametric Instrumental Regressions: A Guide to Implementation," Journal of Econometric Methods, De Gruyter, vol. 6(1), pages 1-25, January.
    20. Simon Lineu Umbach, 2020. "Forecasting with supervised factor models," Empirical Economics, Springer, vol. 58(1), pages 169-190, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:72:y:2014:i:c:p:219-226. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.