IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v73y2017i4p1189-1198.html
   My bibliography  Save this article

Modeling and analyzing respondent‐driven sampling as a counting process

Author

Listed:
  • Yakir Berchenko
  • Jonathan D. Rosenblatt
  • Simon D. W. Frost

Abstract

Respondent‐driven sampling (RDS) is an approach to sampling design and analysis which utilizes the networks of social relationships that connect members of the target population, using chain‐referral. RDS sampling will typically oversample participants with many acquaintances. Naïve estimators, such as the sample average, will thus be biased towards the state of the most highly connected individuals. Current methodology cannot estimate population size from RDS, and promotes inverse probability weighted estimators for population parameters such as HIV prevalence. We propose to use the timing of recruitment, typically collected and discarded, in order to estimate the population size via a counting process model. Once population size and degree frequencies are made available, prevalence can be debiased in a post‐stratified framework. We adapt methods developed for inference in epidemiology and software reliability to estimate the population size, degree counts and frequencies. A fundamental advantage of our approach is that it makes the assumptions of the sampling design explicit. This enables verification of the assumptions, maximum likelihood estimation, extension with covariates, and model selection. We develop large‐sample theory, proving consistency and asymptotic normality. We further compare our estimators to other estimators in the RDS literature, through simulation and real‐world data. In both cases, we find our estimators to outperform current methods. The likelihood problem in the model we present is separable, and thus efficiently solvable. We implement these estimators in an accompanying R package, chords, available on CRAN.

Suggested Citation

  • Yakir Berchenko & Jonathan D. Rosenblatt & Simon D. W. Frost, 2017. "Modeling and analyzing respondent‐driven sampling as a counting process," Biometrics, The International Biometric Society, vol. 73(4), pages 1189-1198, December.
  • Handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1189-1198
    DOI: 10.1111/biom.12678
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12678
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12678?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gile, Krista J., 2011. "Improved Inference for Respondent-Driven Sampling Data With Application to HIV Prevalence Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 135-146.
    2. Traud, Amanda L. & Mucha, Peter J. & Porter, Mason A., 2012. "Social structure of Facebook networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(16), pages 4165-4180.
    3. T. Britton, 1998. "Estimation in multitype epidemics," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(4), pages 663-679.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Barash Vladimir D. & Cameron Christopher J. & Heckathorn Douglas D. & Spiller Michael W., 2016. "Respondent-Driven Sampling – Testing Assumptions: Sampling with Replacement," Journal of Official Statistics, Sciendo, vol. 32(1), pages 29-73, March.
    2. Xin Xu & Yang Lu & Yupeng Zhou & Zhiguo Fu & Yanjie Fu & Minghao Yin, 2021. "An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks," Mathematics, MDPI, vol. 9(15), pages 1-14, July.
    3. Ian E. Fellows & Mark S. Handcock, 2023. "Modeling of networked populations when data is sampled or missing," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 21-35, April.
    4. Demiris, Nikolaos & Kypraios, Theodore & Smith, L. Vanessa, 2012. "On the epidemic of financial crises," MPRA Paper 46693, University Library of Munich, Germany.
    5. Jiashun Jin & Zheng Tracy Ke & Shengming Luo, 2022. "Improvements on SCORE, Especially for Weak Signals," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 127-162, June.
    6. Fatemi, Samira & Salehi, Mostafa & Veisi, Hadi & Jalili, Mahdi, 2018. "A fuzzy logic based estimator for respondent driven sampling of complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 510(C), pages 42-51.
    7. Luca Braghieri & Ro'ee Levy & Alexey Makarin, 2022. "Social Media and Mental Health," American Economic Review, American Economic Association, vol. 112(11), pages 3660-3693, November.
    8. Huggins, Richard M. & Yip, Paul S. F. & Lau, Eric H. Y., 2004. "A note on the estimation of the initial number of susceptible individuals in the general epidemic model," Statistics & Probability Letters, Elsevier, vol. 67(4), pages 321-330, May.
    9. Han, Kevin & Basse, Guillaume & Bojinov, Iavor, 2024. "Population interference in panel experiments," Journal of Econometrics, Elsevier, vol. 238(1).
    10. Marya Bazzi & Mason A. Porter & Stacy Williams & Mark McDonald & Daniel J. Fenn & Sam D. Howison, 2014. "Community detection in temporal multilayer networks, with an application to correlation networks," Papers 1501.00040, arXiv.org, revised Dec 2017.
    11. Christopher Avery & William Bossert & Adam Clark & Glenn Ellison & Sara Fisher Ellison, 2020. "An Economist's Guide to Epidemiology Models of Infectious Disease," Journal of Economic Perspectives, American Economic Association, vol. 34(4), pages 79-104, Fall.
    12. Toshihiko Matsuoka & Tomoki Sato & Tomoyuki Akita & Jiturou Yanagida & Hiroki Ohge & Masao Kuwabara & Junko Tanaka, 2016. "High Vaccination Coverage among Children during Influenza A(H1N1)pdm09 as a Potential Factor of Herd Immunity," IJERPH, MDPI, vol. 13(10), pages 1-17, October.
    13. Saxena, Rakhi & Kaur, Sharanjit & Bhatnagar, Vasudha, 2019. "Identifying similar networks using structural hierarchy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
    14. Nicky McCreesh & Andrew Copas & Janet Seeley & Lisa G Johnston & Pam Sonnenberg & Richard J Hayes & Simon D W Frost & Richard G White, 2013. "Respondent Driven Sampling: Determinants of Recruitment and a Method to Improve Point Estimation," PLOS ONE, Public Library of Science, vol. 8(10), pages 1-9, October.
    15. Aronow, Peter M. & Crawford, Forrest W., 2015. "Nonparametric identification for respondent-driven sampling," Statistics & Probability Letters, Elsevier, vol. 106(C), pages 100-102.
    16. He, Dongxiao & Wang, Hongcui & Jin, Di & Liu, Baolin, 2016. "A model framework for the enhancement of community detection in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 461(C), pages 602-612.
    17. Chien-Min Huang & F. Jay Breidt, 2023. "A dual-frame approach for estimation with respondent-driven samples," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 65-81, April.
    18. Vicente, R. & Susemihl, A. & Jericó, J.P. & Caticha, N., 2014. "Moral foundations in an interacting neural networks society: A statistical mechanics analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 400(C), pages 124-138.
    19. Ali Mirzazadeh & Yea-Hung Chen & Jess Lin & Katie Burk & Erin C Wilson & Desmond Miller & Danielle Veloso & Willi McFarland & Meghan D Morris, 2021. "Progress toward closing gaps in the hepatitis C virus cascade of care for people who inject drugs in San Francisco," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-11, April.
    20. Taghavian, Fatemeh & Salehi, Mostafa & Teimouri, Mehdi, 2017. "A local immunization strategy for networks with overlapping community structure," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 467(C), pages 148-156.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1189-1198. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.