IDEAS home Printed from https://ideas.repec.org/p/col/000089/012061.html
   My bibliography  Save this paper

A derivation of the optimal answer-copying index and some applications

Author

Listed:
  • Mauricio Romero
  • Álvaro Riascos
  • Diego Jara

Abstract

Multiple choice exams are frequently used as an efficient and objective instrument to evaluate knowledge. Nevertheless, they are more vulnerable to answer-copying than tests based on open questions. Several statistical tests (known as indices) have been proposed to detect cheating but to the best of our knowledge they all lack a mathematical support that guarantees optimality in any sense. This work aims at filling this void by deriving the uniform most powerful (UMP) test assuming the response distribution is known. In practice we must estimate a behavioral model that yields a response distribution for each question. We calculate the empirical type-I and type-II error rates for several indices, that assume different behavioral models, using simulations based on real data from twelve nation wide multiple choice exams taken by 5th and 9th graders in Colombia. We find that the index with the highest power among those studied, subject to the restriction of preserving the type-I error, is the one that uses a nominal response model for item answering, conditions on the answers of the individual suspected of being the source of copy and calculates critical values via a normal approximation. This index was first studied by Wollack (1997) and later by W. Van der Linden and Sotaridona (2006) and is superior to the indices studied and developed by Wesolowsky (2000) and Frary, Tideman, and Watts (1977). Furthermore, we compare the performance of the indices on examination rooms with different levels of proctoring and find that increasing the level of proctoring can reduce copying by as much as 50% and that simple strategies such as having different students answer different portions of the test at different times canal so reduce cheating by over 50%. Finally, a Bonferroni type false discovery rate procedure is used to detect massive cheating. The application is straightforward and we believe it could be use to make entire examination rooms retake an exam under stricter surveillance conditions.

Suggested Citation

  • Mauricio Romero & Álvaro Riascos & Diego Jara, 2014. "A derivation of the optimal answer-copying index and some applications," Documentos CEDE 12061, Universidad de los Andes, Facultad de Economía, CEDE.
  • Handle: RePEc:col:000089:012061
    as

    Download full text from publisher

    File URL: https://repositorio.uniandes.edu.co/bitstream/handle/1992/8507/dcede2014-32.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. George Wesolowsky, 2000. "Detecting excessive similarity in answers on multiple choice exams," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(7), pages 909-921.
    2. Diego Jara & Álvaro Riascos & Mauricio Romero, 2010. "Detección de copia en pruebas del Estado," Documentos CEDE 7093, Universidad de los Andes, Facultad de Economía, CEDE.
    3. Alexander Shapiro & Jos Berge, 2002. "Statistical inference of minimum rank factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 79-94, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Li, Tao & Zhou, Yisu, 2017. "Do Pay-for-Grades Programs Encourage Student Cheating? Evidence from a randomized experiment," SocArXiv ck9z6, Center for Open Science.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mauricio Romero & Ã lvaro Riascos & Diego Jara, 2015. "On the Optimality of Answer-Copying Indices," Journal of Educational and Behavioral Statistics, , vol. 40(5), pages 435-453, October.
    2. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m-dependent random variables," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 171-181.
    3. Denter, Philipp & Sisak, Dana, 2015. "Do polls create momentum in political competition?," Journal of Public Economics, Elsevier, vol. 130(C), pages 1-14.
    4. Salgado Alfredo, 2018. "Incomplete Information and Costly Signaling in College Admissions," Working Papers 2018-23, Banco de México.
    5. Albrecht, James & Anderson, Axel & Vroman, Susan, 2010. "Search by committee," Journal of Economic Theory, Elsevier, vol. 145(4), pages 1386-1407, July.
    6. Blier-Wong, Christopher & Cossette, Hélène & Marceau, Etienne, 2023. "Risk aggregation with FGM copulas," Insurance: Mathematics and Economics, Elsevier, vol. 111(C), pages 102-120.
    7. Simon Bruhn & Thomas Grebel & Lionel Nesta, 2023. "The fallacy in productivity decomposition," Journal of Evolutionary Economics, Springer, vol. 33(3), pages 797-835, July.
    8. Wim J. van der Linden, 2019. "Lord’s Equity Theorem Revisited," Journal of Educational and Behavioral Statistics, , vol. 44(4), pages 415-430, August.
    9. Simar, Léopold & Wilson, Paul, 2022. "Modern Tools for Evaluating the Performance of Health-Care Providers," LIDAM Discussion Papers ISBA 2022006, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    10. Baey, Charlotte & Didier, Anne & Lemaire, Sébastien & Maupas, Fabienne & Cournède, Paul-Henry, 2013. "Modelling the interindividual variability of organogenesis in sugar beet populations using a hierarchical segmented model," Ecological Modelling, Elsevier, vol. 263(C), pages 56-63.
    11. Tasche, Dirk, 2013. "Bayesian estimation of probabilities of default for low default portfolios," Journal of Risk Management in Financial Institutions, Henry Stewart Publications, vol. 6(3), pages 302-326, July.
    12. Diers, Dorothea & Linde, Marc & Hahn, Lukas, 2016. "Addendum to ‘The multi-year non-life insurance risk in the additive reserving model’ [Insurance Math. Econom. 52(3) (2013) 590–598]: Quantification of multi-year non-life insurance risk in chain ladde," Insurance: Mathematics and Economics, Elsevier, vol. 67(C), pages 187-199.
    13. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m -dependent random variables," LSE Research Online Documents on Economics 83635, London School of Economics and Political Science, LSE Library.
    14. Hirschberg, Joe & Lye, Jenny, 2017. "Inverting the indirect—The ellipse and the boomerang: Visualizing the confidence intervals of the structural coefficient from two-stage least squares," Journal of Econometrics, Elsevier, vol. 199(2), pages 173-183.
    15. Serguei Kaniovski & Alexander Zaigraev, 2018. "The probability of majority inversion in a two-stage voting system with three states," Theory and Decision, Springer, vol. 84(4), pages 525-546, June.
    16. Packham, Natalie & Woebbeking, Fabian, 2021. "Correlation scenarios and correlation stress testing," IRTG 1792 Discussion Papers 2021-012, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    17. Guillermo Martínez-Flórez & Roger Tovar-Falón & Carlos Barrera-Causil, 2022. "Inflated Unit-Birnbaum-Saunders Distribution," Mathematics, MDPI, vol. 10(4), pages 1-14, February.
    18. Xyngis, Georgios, 2017. "Business-cycle variation in macroeconomic uncertainty and the cross-section of expected returns: Evidence for scale-dependent risks," Journal of Empirical Finance, Elsevier, vol. 44(C), pages 43-65.
    19. Nair, Gopalan M. & Turlach, Berwin A., 2012. "The stochastic h-index," Journal of Informetrics, Elsevier, vol. 6(1), pages 80-87.
    20. Riko Kelter, 2021. "Analysis of type I and II error rates of Bayesian and frequentist parametric and nonparametric two-sample hypothesis tests under preliminary assessment of normality," Computational Statistics, Springer, vol. 36(2), pages 1263-1288, June.

    More about this item

    Keywords

    : Index; Answer Copying; False Discovery Rate; Neyman-Pearson Lemma;
    All these keywords.

    JEL classification:

    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other
    • I20 - Health, Education, and Welfare - - Education - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:col:000089:012061. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Universidad De Los Andes-Cede (email available below). General contact details of provider: https://edirc.repec.org/data/ceandco.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.