IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v5y2022i3p49-855d903399.html
   My bibliography  Save this article

A New Benford Test for Clustered Data with Applications to American Elections

Author

Listed:
  • Katherine M. Anderson

    (Department of Economics, Brigham Young University-Idaho, 298 S 1st East SMI 214, Rexburg, ID 83460, USA)

  • Kevin Dayaratna

    (Center for Data Analysis, The Heritage Foundation, 214 Massachusetts Ave. NE, Washington, DC 20002, USA)

  • Drew Gonshorowski

    (Center for Data Analysis, The Heritage Foundation, 214 Massachusetts Ave. NE, Washington, DC 20002, USA)

  • Steven J. Miller

    (Department of Mathematics and Statistics, Williams College, 880 Main St., Williamstown, MA 01267, USA)

Abstract

A frequent problem with classic first digit applications of Benford’s law is the law’s inapplicability to clustered data, which becomes especially problematic for analyzing election data. This study offers a novel adaptation of Benford’s law by performing a first digit analysis after converting vote counts from election data to base 3 (referred to throughout the paper as 1-BL 3), spreading out the data and thus rendering the law significantly more useful. We test the efficacy of our approach on synthetic election data using discrete Weibull modeling, finding in many cases that election data often conforms to 1-BL 3. Lastly, we apply 1-BL 3 analysis to selected states from the 2004 US Presidential election to detect potential statistical anomalies.

Suggested Citation

  • Katherine M. Anderson & Kevin Dayaratna & Drew Gonshorowski & Steven J. Miller, 2022. "A New Benford Test for Clustered Data with Applications to American Elections," Stats, MDPI, vol. 5(3), pages 1-15, August.
  • Handle: RePEc:gam:jstats:v:5:y:2022:i:3:p:49-855:d:903399
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/5/3/49/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/5/3/49/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Deckert, Joseph & Myagkov, Mikhail & Ordeshook, Peter C., 2011. "Benford's Law and the Detection of Election Fraud," Political Analysis, Cambridge University Press, vol. 19(3), pages 245-268, July.
    2. Villas-Boas, Sofia B. & Fu, Qiuzi & Judge, George, 2017. "Benford’s law and the FSD distribution of economic behavioral micro data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 486(C), pages 711-719.
    3. Alina Peluso & Veronica Vinciotti & Keming Yu, 2019. "Discrete Weibull generalized additive model: an application to count fertility data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 68(3), pages 565-583, April.
    4. Engel, Hans-Andreas & Leuenberger, Christoph, 2003. "Benford's law for exponential random variables," Statistics & Probability Letters, Elsevier, vol. 63(4), pages 361-365, July.
    5. Theoharry Grammatikos & Nikolaos I. Papanikolaou, 2021. "Applying Benford’s Law to Detect Accounting Data Manipulation in the Banking Industry," Journal of Financial Services Research, Springer;Western Finance Association, vol. 59(1), pages 115-142, April.
    6. James D. Englehardt & Ruochen Li, 2011. "The Discrete Weibull Distribution: An Alternative for Correlated Counts with Confirmation for Microbial Counts in Water," Risk Analysis, John Wiley & Sons, vol. 31(3), pages 370-381, March.
    7. Steven J. Miller, 2015. "Benford's Law: Theory and Applications," Economics Books, Princeton University Press, edition 1, number 10527.
    8. Riccardo Patriarca & Tianya Hu & Francesco Costantino & Giulio Di Gravio & Massimo Tronci, 2019. "A System-Approach for Recoverable Spare Parts Management Using the Discrete Weibull Distribution," Sustainability, MDPI, vol. 11(19), pages 1-15, September.
    9. Steven J. Miller & Mark J. Nigrini, 2008. "Order Statistics and Benford's Law," International Journal of Mathematics and Mathematical Sciences, Hindawi, vol. 2008, pages 1-19, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Don Lemons & Nathan Lemons & William Peter, 2021. "First Digit Oscillations," Stats, MDPI, vol. 4(3), pages 1-7, July.
    2. Gueron, Eduardo & Pellegrini, Jerônimo, 2022. "Application of Benford–Newcomb law with base change to electoral fraud detection," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 607(C).
    3. Huang, Yasheng & Niu, Zhiyong & Yang, Clair, 2020. "Testing firm-level data quality in China against Benford’s Law," Economics Letters, Elsevier, vol. 192(C).
    4. Dang, Canh Thien & Owens, Trudy, 2020. "Does transparency come at the cost of charitable services? Evidence from investigating British charities," Journal of Economic Behavior & Organization, Elsevier, vol. 172(C), pages 314-343.
    5. Ananyev, Maxim & Poyker, Michael, 2022. "Do dictators signal strength with electoral fraud?," European Journal of Political Economy, Elsevier, vol. 71(C).
    6. Pourya Pourhejazy, 2020. "Destruction Decisions for Managing Excess Inventory in E-Commerce Logistics," Sustainability, MDPI, vol. 12(20), pages 1-12, October.
    7. Christoph Koenig, 2024. "With a Little Help From the Crowd: Estimating Election Fraud with Forensic Methods," CEIS Research Paper 584, Tor Vergata University, CEIS, revised 28 Oct 2024.
    8. Montag, Josef, 2017. "Identifying odometer fraud in used car market data," Transport Policy, Elsevier, vol. 60(C), pages 10-23.
    9. Ausloos, Marcel & Ficcadenti, Valerio & Dhesi, Gurjeet & Shakeel, Muhammad, 2021. "Benford’s laws tests on S&P500 daily closing values and the corresponding daily log-returns both point to huge non-conformity," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 574(C).
    10. Arno Berger & Theodore P. Hill, 2021. "The mathematics of Benford’s law: a primer," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 779-795, September.
    11. Hao, Zhuang & Zhang, Xudong & Wang, Yuze, 2024. "Assessing the accuracy of self-reported health expenditure data: Evidence from two public surveys in China," Social Science & Medicine, Elsevier, vol. 356(C).
    12. Schräpler Jörg-Peter, 2011. "Benford’s Law as an Instrument for Fraud Detection in Surveys Using the Data of the Socio-Economic Panel (SOEP)," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(5-6), pages 685-718, October.
    13. Arezzo, Maria Felice & Cerqueti, Roy, 2023. "A Benford’s Law view of inspections’ reasonability," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 632(P1).
    14. Lucio Barabesi & Andrea Cerioli & Domenico Perrotta, 2021. "Forum on Benford’s law and statistical methods for the detection of frauds," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 767-778, September.
    15. Barbiero, A., 2019. "A bivariate count model with discrete Weibull margins," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 156(C), pages 91-109.
    16. Dlugosz, Stephan & Müller-Funk, Ulrich, 2012. "Ziffernanalyse zur Betrugserkennung in Finanzverwaltungen: Prüfung von Kassenbelegen," Arbeitsberichte des Instituts für Wirtschaftsinformatik 133, University of Münster, Department of Information Systems.
    17. Cerqueti, Roy & Maggi, Mario, 2021. "Data validity and statistical conformity with Benford’s Law," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).
    18. Madeleine Farris & Noah Luntzlara & Steven J. Miller & Lily Shao & Mengxi Wang, 2021. "Recurrence relations and Benford’s law," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 797-817, September.
    19. de Araújo Silva, Archibald & Aparecida Gouvêa, Maria, 2023. "Study on the effect of sample size on type I error, in the first, second and first-two digits excessmad tests," International Journal of Accounting Information Systems, Elsevier, vol. 48(C).
    20. Vegard Nilsen & John Wyller, 2016. "QMRA for Drinking Water: 2. The Effect of Pathogen Clustering in Single‐Hit Dose‐Response Models," Risk Analysis, John Wiley & Sons, vol. 36(1), pages 163-181, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:5:y:2022:i:3:p:49-855:d:903399. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.