IDEAS home Printed from https://ideas.repec.org/a/sae/jedbes/v48y2023i3p349-383.html
   My bibliography  Save this article

Assessing Inter-rater Reliability With Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables

Author

Listed:
  • Patrícia Martinková

    (Institute of Computer Science of the Czech Academy of Sciences, Charles University)

  • FrantiÅ¡ek BartoÅ¡

    (Institute of Computer Science of the Czech Academy of Sciences, University of Amsterdam)

  • Marek Brabec

    (Institute of Computer Science of the Czech Academy of Sciences)

Abstract

Inter-rater reliability (IRR), which is a prerequisite of high-quality ratings and assessments, may be affected by contextual variables, such as the rater’s or ratee’s gender, major, or experience. Identification of such heterogeneity sources in IRR is important for the implementation of policies with the potential to decrease measurement error and to increase IRR by focusing on the most relevant subgroups. In this study, we propose a flexible approach for assessing IRR in cases of heterogeneity due to covariates by directly modeling differences in variance components. We use Bayes factors (BFs) to select the best performing model, and we suggest using Bayesian model averaging as an alternative approach for obtaining IRR and variance component estimates, allowing us to account for model uncertainty. We use inclusion BFs considering the whole model space to provide evidence for or against differences in variance components due to covariates. The proposed method is compared with other Bayesian and frequentist approaches in a simulation study, and we demonstrate its superiority in some situations. Finally, we provide real data examples from grant proposal peer review, demonstrating the usefulness of this method and its flexibility in the generalization of more complex designs.

Suggested Citation

  • Patrícia Martinková & FrantiÅ¡ek BartoÅ¡ & Marek Brabec, 2023. "Assessing Inter-rater Reliability With Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables," Journal of Educational and Behavioral Statistics, , vol. 48(3), pages 349-383, June.
  • Handle: RePEc:sae:jedbes:v:48:y:2023:i:3:p:349-383
    DOI: 10.3102/10769986221150517
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.3102/10769986221150517
    Download Restriction: no

    File URL: https://libkey.io/10.3102/10769986221150517?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Patrícia Martinková & Dan Goldhaber & Elena Erosheva, 2018. "Disparities in ratings of internal and external applicants: A case for model-based inter-rater reliability," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-17, October.
    2. Tiago M. Fragoso & Wesley Bertoli & Francisco Louzada, 2018. "Bayesian Model Averaging: A Systematic Review and Conceptual Classification," International Statistical Review, International Statistical Institute, vol. 86(1), pages 1-28, April.
    3. Goldhaber, Dan & Grout, Cyrus & Wolff, Malcolm & Martinková, Patrícia, 2021. "Evidence on the Dimensionality and Reliability of Professional References’ Ratings of Teacher Applicants," Economics of Education Review, Elsevier, vol. 83(C).
    4. Rüdiger Mutz & Lutz Bornmann & Hans-Dieter Daniel, 2012. "Heterogeneity of Inter-Rater Reliabilities of Grant Peer Reviews and Its Determinants: A General Estimating Equations Approach," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-10, October.
    5. Jeffrey N. Rouder & Richard D. Morey, 2019. "Teaching Bayes’ Theorem: Strength of Evidence as Predictive Accuracy," The American Statistician, Taylor & Francis Journals, vol. 73(2), pages 186-190, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roland Brown & Yingling Fan & Kirti Das & Julian Wolfson, 2021. "Iterated multisource exchangeability models for individualized inference with an application to mobile sensor data," Biometrics, The International Biometric Society, vol. 77(2), pages 401-412, June.
    2. Hyemin Han, 2024. "Bayesian Model Averaging and Regularized Regression as Methods for Data-Driven Model Exploration, with Practical Considerations," Stats, MDPI, vol. 7(3), pages 1-13, July.
    3. Emanuel Kopp, 2018. "Determinants of U.S. Business Investment," IMF Working Papers 2018/139, International Monetary Fund.
    4. Liao, Jun & Zou, Guohua, 2020. "Corrected Mallows criterion for model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    5. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    6. Goldhaber, Dan & Grout, Cyrus & Wolff, Malcolm & Martinková, Patrícia, 2021. "Evidence on the Dimensionality and Reliability of Professional References’ Ratings of Teacher Applicants," Economics of Education Review, Elsevier, vol. 83(C).
    7. Díaz, Juan D. & Hansen, Erwin & Cabrera, Gabriel, 2021. "Economic drivers of commodity volatility: The case of copper," Resources Policy, Elsevier, vol. 73(C).
    8. Mihai MUTASCU & Nicolae-Bogdan IANC & ALBERT LESSOUA, 2021. "Public debt and inequality in Sub-Saharan Africa: the case of EMCCA and WAEMU countries," LEO Working Papers / DR LEO 2909, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    9. Marcin Błażejowski & Jacek Kwiatkowski & Paweł Kufel, 2020. "BACE and BMA Variable Selection and Forecasting for UK Money Demand and Inflation with Gretl," Econometrics, MDPI, vol. 8(2), pages 1-29, May.
    10. Huihang Liu & Xinyu Zhang, 2023. "Frequentist model averaging for undirected Gaussian graphical models," Biometrics, The International Biometric Society, vol. 79(3), pages 2050-2062, September.
    11. Francisco Alonso & Sergio A. Useche & Eliseo Valle & Cristina Esteban & Javier Gene-Morales, 2021. "Could Road Safety Education (RSE) Help Parents Protect Children? Examining Their Driving Crashes with Children on Board," IJERPH, MDPI, vol. 18(7), pages 1-13, March.
    12. Grover,Arti Goswami & Lall,Somik V. & Timmis,Jonathan David, 2021. "Agglomeration Economies in Developing Countries : A Meta-Analysis," Policy Research Working Paper Series 9730, The World Bank.
    13. Yin-Wong Cheung & Wenhao Wang, 2020. "A Jackknife Model Averaging Analysis of RMB Misalignment Estimates," Journal of International Commerce, Economics and Policy (JICEP), World Scientific Publishing Co. Pte. Ltd., vol. 11(02), pages 1-45, June.
    14. Marwah Soliman & Vyacheslav Lyubchich & Yulia R. Gel, 2020. "Ensemble forecasting of the Zika space‐time spread with topological data analysis," Environmetrics, John Wiley & Sons, Ltd., vol. 31(7), November.
    15. Enrique Labrada & Luis Huesca, "undated". "Data management in household income and expenditure surveys: Working with extended families using Stata," Mexican Stata Conference 2023 19, Stata Users Group.
    16. Jonathan Berrisch & Florian Ziel, 2021. "CRPS Learning," Papers 2102.00968, arXiv.org, revised Nov 2021.
    17. Marzieh Khajehali & Hamid R. Safavi & Mohammad Reza Nikoo & Mahmood Fooladi, 2024. "A fusion-based framework for daily flood forecasting in multiple-step-ahead and near-future under climate change scenarios: a case study of the Kan River, Iran," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 120(9), pages 8483-8504, July.
    18. Tihana Škrinjarić, 2023. "Credit-to-GDP Gap Estimates in Real Time: A Stable Indicator for Macroprudential Policy Making in Croatia," Comparative Economic Studies, Palgrave Macmillan;Association for Comparative Economic Studies, vol. 65(3), pages 582-614, September.
    19. Minerva Mukhopadhyay & Sourabh Bhattacharya, 2022. "Bayes factor asymptotics for variable selection in the Gaussian process framework," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 581-613, June.
    20. Wei Zhou & Eoghan O'Neill & Alice Moncaster & David M Reiner & Peter Guthrie, 2019. "Applying Bayesian Model Averaging to Characterise Urban Residential Stock Turnover Dynamics," Working Papers EPRG1933, Energy Policy Research Group, Cambridge Judge Business School, University of Cambridge.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:jedbes:v:48:y:2023:i:3:p:349-383. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.