IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v8y2023i11p165-d1271499.html
   My bibliography  Save this article

Can We Mathematically Spot the Possible Manipulation of Results in Research Manuscripts Using Benford’s Law?

Author

Listed:
  • Teddy Lazebnik

    (Department of Cancer Biology, Cancer Institute, University College London, London WC1E 6BT, UK)

  • Dan Gorlitsky

    (Department of Economics, Reichman University, Herzliya 4610101, Israel)

Abstract

The reproducibility of academic research has long been a persistent issue, contradicting one of the fundamental principles of science. Recently, there has been an increasing number of false claims found in academic manuscripts, casting doubt on the validity of reported results. In this paper, we utilize an adapted version of Benford’s law, a statistical phenomenon that describes the distribution of leading digits in naturally occurring datasets, to identify the potential manipulation of results in research manuscripts, solely using the aggregated data presented in those manuscripts rather than the commonly unavailable raw datasets. Our methodology applies the principles of Benford’s law to commonly employed analyses in academic manuscripts, thus reducing the need for the raw data itself. To validate our approach, we employed 100 open-source datasets and successfully predicted 79 % of them accurately using our rules. Moreover, we tested the proposed method on known retracted manuscripts, showing that around half (48.6%) can be detected using the proposed method. Additionally, we analyzed 100 manuscripts published in the last two years across ten prominent economic journals, with 10 manuscripts randomly sampled from each journal. Our analysis predicted a 3 % occurrence of results manipulation with a 96 % confidence level. Our findings show that Benford’s law adapted for aggregated data, can be an initial tool for identifying data manipulation; however, it is not a silver bullet, requiring further investigation for each flagged manuscript due to the relatively low prediction accuracy.

Suggested Citation

  • Teddy Lazebnik & Dan Gorlitsky, 2023. "Can We Mathematically Spot the Possible Manipulation of Results in Research Manuscripts Using Benford’s Law?," Data, MDPI, vol. 8(11), pages 1-11, October.
  • Handle: RePEc:gam:jdataj:v:8:y:2023:i:11:p:165-:d:1271499
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/8/11/165/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/8/11/165/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bauer Johannes & Groß Jochen, 2011. "Difficulties Detecting Fraud? The Use of Benford’s Law on Regression Tables," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(5-6), pages 733-748, October.
    2. List, John A, et al, 2001. "Academic Economists Behaving Badly? A Survey on Three Areas of Unethical Behavior," Economic Inquiry, Western Economic Association International, vol. 39(1), pages 162-170, January.
    3. Druică, Elena & Oancea, Bogdan & Vâlsan, Călin, 2018. "Benford's law and the limits of digit analysis," International Journal of Accounting Information Systems, Elsevier, vol. 31(C), pages 75-82.
    4. Nicholas Walker & Kristy Holtfreter, 2015. "Applying criminological theory to academic fraud," Journal of Financial Crime, Emerald Group Publishing Limited, vol. 22(1), pages 48-62, January.
    5. Andreas Diekmann, 2007. "Not the First Digit! Using Benford's Law to Detect Fraudulent Scientif ic Data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 34(3), pages 321-329.
    6. Teddy Lazebnik & Tzach Fleischer & Amit Yaniv-Rosenfeld, 2023. "Benchmarking Biologically-Inspired Automatic Machine Learning for Economic Tasks," Sustainability, MDPI, vol. 15(14), pages 1-9, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Horton, Joanne & Krishna Kumar, Dhanya & Wood, Anthony, 2020. "Detecting academic fraud using Benford law: The case of Professor James Hunton," Research Policy, Elsevier, vol. 49(8).
    2. Philip E Hulme & Danish A Ahmed & Phillip J Haubrock & Brooks A Kaiser & Melina Kourantidou & Boris Leroy & Shana M Mcdermott, 2024. "Widespread imprecision in estimates of the economic costs of invasive alien species worldwide," Post-Print hal-04633043, HAL.
    3. Călin Vâlsan & Andreea-Ionela Puiu & Elena Druică, 2024. "From Whence Commeth Data Misreporting? A Survey of Benford’s Law and Digit Analysis in the Time of the COVID-19 Pandemic," Mathematics, MDPI, vol. 12(16), pages 1-20, August.
    4. Jalan, Akanksha & Matkovskyy, Roman & Yarovaya, Larisa, 2021. "“Shiny” crypto assets: A systemic look at gold-backed cryptocurrencies during the COVID-19 pandemic," International Review of Financial Analysis, Elsevier, vol. 78(C).
    5. Bruno S. Frey, 2010. "Withering Academia?," CESifo Working Paper Series 3209, CESifo.
    6. de Araújo Silva, Archibald & Aparecida Gouvêa, Maria, 2023. "Study on the effect of sample size on type I error, in the first, second and first-two digits excessmad tests," International Journal of Accounting Information Systems, Elsevier, vol. 48(C).
    7. Walter R. Schumm & Duane W. Crawford & Lorenza Lockett & Asma bin Ateeq & Abdullah AlRashed, 2023. "Can Retracted Social Science Articles Be Distinguished from Non-Retracted Articles by Some of the Same Authors, Using Benford’s Law or Other Statistical Methods?," Publications, MDPI, vol. 11(1), pages 1-13, March.
    8. Roy Cerqueti & Claudio Lupi, 2023. "Severe testing of Benford’s law," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(2), pages 677-694, June.
    9. John A. List, 2024. "Optimally generate policy-based evidence before scaling," Nature, Nature, vol. 626(7999), pages 491-499, February.
    10. Gary Charness & David Masclet & Marie Claire Villeval, 2014. "The Dark Side of Competition for Status," Management Science, INFORMS, vol. 60(1), pages 38-55, January.
    11. Brian Fabo & Martina Jancokova & Elisabeth Kempf & Lubos Pastor, 2020. "Fifty Shades of QE: Conflicts of Interest in Economic Research," Working Papers 2020-128, Becker Friedman Institute for Research In Economics.
    12. Denis Fougère & Nicolas Jacquemet, 2020. "Policy Evaluation Using Causal Inference Methods," SciencePo Working papers Main hal-03455978, HAL.
    13. Sitsofe Tsagbey & Miguel de Carvalho & Garritt L. Page, 2017. "All Data are Wrong, but Some are Useful? Advocating the Need for Data Auditing," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 231-235, July.
    14. Dato, Simon & Nieken, Petra, 2014. "Gender differences in competition and sabotage," Journal of Economic Behavior & Organization, Elsevier, vol. 100(C), pages 64-80.
    15. Hazra, Ummaha & Priyo, Asad Karim Khan, 2022. "Unethical practices in online classes during COVID-19 pandemic: an analysis of affordances using routine activity theory," MPRA Paper 117853, University Library of Munich, Germany.
    16. Necker, Sarah, 2014. "Scientific misbehavior in economics," Research Policy, Elsevier, vol. 43(10), pages 1747-1759.
    17. Schwieren, Christiane & Weichselbaumer, Doris, 2010. "Does competition enhance performance or cheating? A laboratory experiment," Journal of Economic Psychology, Elsevier, vol. 31(3), pages 241-253, June.
    18. Pei, Duo & Vasarhelyi, Miklos A., 2020. "Big data and algorithmic trading against periodic and tangible asset reporting: The need for U-XBRL," International Journal of Accounting Information Systems, Elsevier, vol. 37(C).
    19. Lattarulo, Patrizia & Masucci, Valentino & Pazienza, Maria Grazia, 2019. "Resistance to change: Car use and routines," Transport Policy, Elsevier, vol. 74(C), pages 63-72.
    20. Banerjee, Ritwik & Mitra, Arnab, 2018. "On monetary and non-monetary interventions to combat corruption," Journal of Economic Behavior & Organization, Elsevier, vol. 149(C), pages 332-355.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:8:y:2023:i:11:p:165-:d:1271499. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.