IDEAS home Printed from https://ideas.repec.org/a/bpj/mcmeap/v30y2024i4p331-363n1001.html
   My bibliography  Save this article

Investigating the ecological fallacy through sampling distributions constructed from finite populations

Author

Listed:
  • Torres David J.

    (Department of Mathematics and Physical Science, Northern New Mexico College, Española, NM 87532, USA)

  • Rouson Damain

    (Computer Languages and Systems Software Group, Lawrence Berkeley National Laboratory, Berkeley, California, USA)

Abstract

Correlation coefficients and linear regression values computed from group averages can differ from correlation coefficients and linear regression values computed using individual scores. This observation known as the ecological fallacy often assumes that all the individual scores are available from a population. In many situations, one must use a sample from the larger population. In such cases, the computed correlation coefficient and linear regression values will depend on the sample that is chosen and the underlying sampling distribution. The sampling distribution of correlation coefficients and linear regression values for group averages will be identical to the sampling distribution for individuals for normally distributed variables for random samples drawn from infinitely large continuous distributions. However, data that is acquired in practice is often acquired when sampling without replacement from a finite population. Our objective is to demonstrate through Monte Carlo simulations that the sampling distributions for correlation and linear regression will also be similar for individuals and group averages when sampling without replacement from normally distributed variables. These simulations suggest that when a random sample from a population is selected, the correlation coefficients and linear regression values computed from individual scores will not be more accurate in estimating the entire population values compared to samples when group averages are used as long as the sample size is the same.

Suggested Citation

  • Torres David J. & Rouson Damain, 2024. "Investigating the ecological fallacy through sampling distributions constructed from finite populations," Monte Carlo Methods and Applications, De Gruyter, vol. 30(4), pages 331-363.
  • Handle: RePEc:bpj:mcmeap:v:30:y:2024:i:4:p:331-363:n:1001
    DOI: 10.1515/mcma-2024-2013
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/mcma-2024-2013
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/mcma-2024-2013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. N. Cleave & P. J. Brown & C. D. Payne, 1995. "Evaluation of Methods for Ecological Inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 158(1), pages 55-72, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    2. Carolina Plescia & Lorenzo De Sio, 2018. "An evaluation of the performance and suitability of R × C methods for ecological inference with known true values," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(2), pages 669-683, March.
    3. Roberto Colombi & Antonio Forcina, 2016. "Latent class models for ecological inference on voters transitions," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 25(4), pages 501-517, November.
    4. Robin Lovelace & Mark Birkin & Dimitris Ballas & Eveline van Leeuwen, 2015. "Evaluating the Performance of Iterative Proportional Fitting for Spatial Microsimulation: New Tests for an Established Technique," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 18(2), pages 1-21.
    5. Gillian A. Lancaster & Mick Green & Steven Lane, 2006. "Reducing bias in ecological studies: an evaluation of different methodologies," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 169(4), pages 681-700, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:mcmeap:v:30:y:2024:i:4:p:331-363:n:1001. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.