IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v71y2022i5p1471-1502.html
   My bibliography  Save this article

Investigating the association of a sensitive attribute with a random variable using the Christofides generalised randomised response design and Bayesian methods

Author

Listed:
  • Shen‐Ming Lee
  • Truong‐Nhat Le
  • Phuoc‐Loc Tran
  • Chin‐Shang Li

Abstract

In empirical studies involving sensitive topics, in addition to the problem of estimating the population proportion with a sensitive characteristic, a question arises as to whether or not there is heterogeneity in the distribution of an auxiliary random variable representing the information of subjects collected from a sensitive group and a non‐sensitive group. That is, it is of interest to investigate the influence of sensitive attribute on the auxiliary random variable of interest. Finite mixture models are utilised to evaluate the association. A proposed Bayesian method through data augmentation and Markov chain Monte Carlo is applied to estimate unknown parameters of interest. Deviance information criterion and marginal likelihood are employed to select a suitable model to describe the association of the sensitive characteristic with the auxiliary random variable. Simulation and real data studies are conducted to assess the performance of and illustrate applications of the proposed methodology.

Suggested Citation

  • Shen‐Ming Lee & Truong‐Nhat Le & Phuoc‐Loc Tran & Chin‐Shang Li, 2022. "Investigating the association of a sensitive attribute with a random variable using the Christofides generalised randomised response design and Bayesian methods," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1471-1502, November.
  • Handle: RePEc:bla:jorssc:v:71:y:2022:i:5:p:1471-1502
    DOI: 10.1111/rssc.12585
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssc.12585
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssc.12585?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Liu, Chun & Liu, Qing, 2012. "Marginal likelihood calculation for the Gelfand–Dey and Chib methods," Economics Letters, Elsevier, vol. 115(2), pages 200-203.
    2. Migon, Helio S. & Tachibana, Vilma M., 1997. "Bayesian approximations in randomized response model," Computational Statistics & Data Analysis, Elsevier, vol. 24(4), pages 401-409, June.
    3. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    4. Li, Yong & Yu, Jun & Zeng, Tao, 2020. "Deviance information criterion for latent variable models and misspecified models," Journal of Econometrics, Elsevier, vol. 216(2), pages 450-493.
    5. Nicholas Tierney & Dianne Cook, 2018. "Expanding tidy data principles to facilitate missing data exploration, visualization and assessment of imputations," Monash Econometrics and Business Statistics Working Papers 14/18, Monash University, Department of Econometrics and Business Statistics.
    6. Pei-Chieh Chang & Kim-Hung Pho & Shen-Ming Lee & Chin-Shang Li, 2021. "Estimation of parameters of logistic regression for two-stage randomized response technique," Computational Statistics, Springer, vol. 36(3), pages 2111-2133, September.
    7. Shiow-Lan Gau & Jean Dieu Tapsoba & Shen-Ming Lee, 2014. "Bayesian approach for mixture models with grouped data," Computational Statistics, Springer, vol. 29(5), pages 1025-1043, October.
    8. Heiko Groenitz, 2015. "Using prior information in privacy-protecting survey designs for categorical sensitive variables," Statistical Papers, Springer, vol. 56(1), pages 167-189, February.
    9. Balgobin Nandram & Yuan Yu, 2019. "Bayesian Analysis of Sparse Counts Obtained From the Unrelated Question Design," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 8(5), pages 66-84, September.
    10. Hsieh, S.H. & Lee, S.M. & Shen, P.S., 2009. "Semiparametric analysis of randomized response data with missing covariates in logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2673-2692, May.
    11. Graeme Blair & Kosuke Imai & Yang-Yang Zhou, 2015. "Design and Analysis of the Randomized Response Technique," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1304-1319, September.
    12. James Abernathy & Bernard Greenberg & Daniel Horvitz, 1970. "Estimates of induced abortion in urban North Carolina," Demography, Springer;Population Association of America (PAA), vol. 7(1), pages 19-29, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shen-Ming Lee & Phuoc-Loc Tran & Truong-Nhat Le & Chin-Shang Li, 2023. "Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model," Mathematics, MDPI, vol. 11(2), pages 1-21, January.
    2. Truong-Nhat Le & Shen-Ming Lee & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Randomized Response Techniques: A Systematic Review from the Pioneering Work of Warner (1965) to the Present," Mathematics, MDPI, vol. 11(7), pages 1-26, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Truong-Nhat Le & Shen-Ming Lee & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Randomized Response Techniques: A Systematic Review from the Pioneering Work of Warner (1965) to the Present," Mathematics, MDPI, vol. 11(7), pages 1-26, April.
    2. Shen-Ming Lee & Phuoc-Loc Tran & Truong-Nhat Le & Chin-Shang Li, 2023. "Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model," Mathematics, MDPI, vol. 11(2), pages 1-21, January.
    3. Kazuhiko Kakamu, 2022. "Bayesian analysis of mixtures of lognormal distribution with an unknown number of components from grouped data," Papers 2210.05115, arXiv.org, revised Sep 2023.
    4. Yang, Kai & Yu, Xinyang & Zhang, Qingqing & Dong, Xiaogang, 2022. "On MCMC sampling in self-exciting integer-valued threshold time series models," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    5. Ye Yang & Osman Doğan & Süleyman Taşpınar, 2023. "Observed-data DIC for spatial panel data models," Empirical Economics, Springer, vol. 64(3), pages 1281-1314, March.
    6. Ye Yang & Osman Dogan & Suleyman Taspinar & Fei Jin, 2023. "A Review of Cross-Sectional Matrix Exponential Spatial Models," Papers 2311.14813, arXiv.org.
    7. Marco Gregori & Martijn G. Jong & Rik Pieters, 2024. "The Crosswise Model for Surveys on Sensitive Topics: A General Framework for Item Selection and Statistical Analysis," Psychometrika, Springer;The Psychometric Society, vol. 89(3), pages 1007-1033, September.
    8. Iseringhausen, Martin, 2020. "The time-varying asymmetry of exchange rate returns: A stochastic volatility – stochastic skewness model," Journal of Empirical Finance, Elsevier, vol. 58(C), pages 275-292.
    9. Arnab Kumar Maity & Sanjib Basu & Santu Ghosh, 2021. "Bayesian criterion‐based variable selection," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(4), pages 835-857, August.
    10. Chan, Joshua C.C., 2023. "Comparing stochastic volatility specifications for large Bayesian VARs," Journal of Econometrics, Elsevier, vol. 235(2), pages 1419-1446.
    11. Kim-Hung Pho & Michael McAleer, 2021. "Specification and Estimation of a Logistic Function, with Applications in the Sciences and Social Sciences," Advances in Decision Sciences, Asia University, Taiwan, vol. 25(2), pages 74-104, June.
    12. Fang Liu & Xiaojing Wang & Roeland Hancock & Ming-Hui Chen, 2022. "Bayesian Model Assessment for Jointly Modeling Multidimensional Response Data with Application to Computerized Testing," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1290-1317, December.
    13. Balgobin Nandram & Yuan Yu, 2019. "Bayesian Analysis of Sparse Counts Obtained From the Unrelated Question Design," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 8(5), pages 66-84, September.
    14. Liu, Xiaobin & Li, Yong & Yu, Jun & Zeng, Tao, 2022. "Posterior-based Wald-type statistics for hypothesis testing," Journal of Econometrics, Elsevier, vol. 230(1), pages 83-113.
    15. Buddhavarapu, Prasad & Bansal, Prateek & Prozzi, Jorge A., 2021. "A new spatial count data model with time-varying parameters," Transportation Research Part B: Methodological, Elsevier, vol. 150(C), pages 566-586.
    16. Mumtaz, Haroon & Theodoridis, Konstantinos, 2017. "Common and country specific economic uncertainty," Journal of International Economics, Elsevier, vol. 105(C), pages 205-216.
    17. Jesse Elliott & Zemin Bai & Shu-Ching Hsieh & Shannon E Kelly & Li Chen & Becky Skidmore & Said Yousef & Carine Zheng & David J Stewart & George A Wells, 2020. "ALK inhibitors for non-small cell lung cancer: A systematic review and network meta-analysis," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-18, February.
    18. Christina Leuker & Thorsten Pachur & Ralph Hertwig & Timothy J. Pleskac, 2019. "Do people exploit risk–reward structures to simplify information processing in risky choice?," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 5(1), pages 76-94, August.
    19. Francois Olivier & Laval Guillaume, 2011. "Deviance Information Criteria for Model Selection in Approximate Bayesian Computation," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-25, July.
    20. Raggi, Davide & Bordignon, Silvano, 2012. "Long memory and nonlinearities in realized volatility: A Markov switching approach," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3730-3742.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:71:y:2022:i:5:p:1471-1502. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.