IDEAS home Printed from https://ideas.repec.org/a/eee/stapro/v81y2011i7p852-860.html
   My bibliography  Save this article

Power, sample size and sampling costs for clustered data

Author

Listed:
  • Tokola, K.
  • Larocque, D.
  • Nevalainen, J.
  • Oja, H.

Abstract

The data collected in epidemiological or clinical studies are frequently clustered. In such settings, appropriate variance adjustments must be made in order to estimate the sufficient sample size correctly. This paper works through the sample size calculations for clustered data. Importantly, our explicit variance expressions also enable us to optimize the design with respect to the number of clusters and number of subjects; the objective could be either to maximize the power or to minimize the costs with given costs on the clusters and on the individuals. In our approach, units on different levels and treatment groups can have different costs, but the members of the same cluster are assumed to belong to the same treatment group. Design considerations in the health coaching project TERVA are used as motivating examples. R-functions for carrying out the computations presented are provided.

Suggested Citation

  • Tokola, K. & Larocque, D. & Nevalainen, J. & Oja, H., 2011. "Power, sample size and sampling costs for clustered data," Statistics & Probability Letters, Elsevier, vol. 81(7), pages 852-860, July.
  • Handle: RePEc:eee:stapro:v:81:y:2011:i:7:p:852-860
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-7152(11)00044-7
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ronald E. Gangnon, 2004. "Sample-size formula for clustered survival data using weighted log-rank statistics," Biometrika, Biometrika Trust, vol. 91(2), pages 263-275, June.
    2. Moonseong Heo & Andrew C. Leon, 2008. "Statistical Power and Sample Size Requirements for Three Level Hierarchical Cluster Randomized Trials," Biometrics, The International Biometric Society, vol. 64(4), pages 1256-1262, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kari Tokola & Andreas Lundell & Jaakko Nevalainen & Hannu Oja, 2014. "Design and cost optimization for hierarchical data," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(2), pages 130-148, May.
    2. Md Abu Manju & Math J. J. M. Candel & Gerard J. P. van Breukelen, 2019. "SamP2CeT: an interactive computer program for sample size and power calculation for two-level cost-effectiveness trials," Computational Statistics, Springer, vol. 34(1), pages 47-70, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kari Tokola & Andreas Lundell & Jaakko Nevalainen & Hannu Oja, 2014. "Design and cost optimization for hierarchical data," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(2), pages 130-148, May.
    2. Jianghao Li & Sin-Ho Jung, 2022. "Sample size calculation for clustered survival data under subunit randomization," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 28(1), pages 40-67, January.
    3. Mirjam Moerbeek & Maryam Safarkhani, 2018. "The Design of Cluster Randomized Trials With Random Cross-Classifications," Journal of Educational and Behavioral Statistics, , vol. 43(2), pages 159-181, April.
    4. Heo, Moonseong & Xue, Xiaonan & Kim, Mimi Y., 2013. "Sample size requirements to detect an intervention by time interaction in longitudinal cluster randomized clinical trials with random slopes," Computational Statistics & Data Analysis, Elsevier, vol. 60(C), pages 169-178.
    5. Satoshi Usami, 2017. "Generalized SAMPLE SIZE Determination Formulas for Investigating Contextual Effects by a Three-Level Random Intercept Model," Psychometrika, Springer;The Psychometric Society, vol. 82(1), pages 133-157, March.
    6. Steven Teerenstra & Bing Lu & John S. Preisser & Theo van Achterberg & George F. Borm, 2010. "Sample Size Considerations for GEE Analyses of Three-Level Cluster Randomized Trials," Biometrics, The International Biometric Society, vol. 66(4), pages 1230-1237, December.
    7. Kosorok, Michael R. & Gangnon, Ronald E., 2006. "Resolving the tail instability in weighted log-rank statistics for clustered survival data," Statistics & Probability Letters, Elsevier, vol. 76(3), pages 304-309, February.
    8. Kendra Davis‐Plourde & Monica Taljaard & Fan Li, 2023. "Sample size considerations for stepped wedge designs with subclusters," Biometrics, The International Biometric Society, vol. 79(1), pages 98-112, March.
    9. Su, Pei-Fang & Li, Chung-I & Shyr, Yu, 2014. "Sample size determination for paired right-censored data based on the difference of Kaplan–Meier estimates," Computational Statistics & Data Analysis, Elsevier, vol. 74(C), pages 39-51.
    10. Micha Mandel & Rebecca A. Betensky, 2007. "Testing Goodness of Fit of a Uniform Truncation Model," Biometrics, The International Biometric Society, vol. 63(2), pages 405-412, June.
    11. Rui Song & Michael R. Kosorok & Jianwen Cai, 2008. "Robust Covariate‐Adjusted Log‐Rank Statistics and Corresponding Sample Size Formula for Recurrent Events Data," Biometrics, The International Biometric Society, vol. 64(3), pages 741-750, September.
    12. Casper T. Charles & Cook Thomas D., 2012. "Estimation of the Mean Frequency Function for Recurrent Events when Ascertainment of Events Is Delayed," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-20, February.
    13. S.P. Singh & S. Mukhopadhyay & A. Roy, 2015. "Comparison of three-level cluster randomized trials using quantile dispersion graphs," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(8), pages 1792-1812, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:stapro:v:81:y:2011:i:7:p:852-860. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.