IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v170y2007i2p483-501.html
   My bibliography  Save this article

Information integration for constructing social statistics: history, theory and ideas towards a research programme

Author

Listed:
  • D. H. Judson

Abstract

Summary. More precise policy making at all levels of government has fuelled tremendous demand for small area data—smaller than ever before. At the same time, there has been an unprecedented accumulation of data in geographic information systems, administrative records databases and more sophisticated survey sampling schemes. Researchers and practitioners have been trying to combine these diverse sources of data. But how should these diverse sources of data be combined in a way that is policy relevant and statistically principled? The paper illustrates these questions with several example applications at the state, county and local level: emerging geographic information systems databases, the need for estimates of small area income, poverty, demographic and uninsurance data by health authorities, and how administrative records databases (such as licensed day care facilities, traffic counts and unemployment insurance records) are being harvested for their information content. Finally, the paper proposes approaches for integrating these diverse sources of data with different error, uncertainty and quality profiles, and surveys persistent challenges in this area.

Suggested Citation

  • D. H. Judson, 2007. "Information integration for constructing social statistics: history, theory and ideas towards a research programme," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(2), pages 483-501, March.
  • Handle: RePEc:bla:jorssa:v:170:y:2007:i:2:p:483-501
    DOI: 10.1111/j.1467-985X.2007.00472.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-985X.2007.00472.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-985X.2007.00472.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. J Charlton, 1998. "Use of the Census Samples of Anonymised Records (SARs) and Survey Data in Combination to Obtain Estimates at Local Authority Level," Environment and Planning A, , vol. 30(5), pages 775-784, May.
    2. P. Lahiri & Michael D. Larsen, 2005. "Regression Analysis With Linked Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 222-230, March.
    3. Jeff Tayman & David Swanson, 1996. "On the utility of population forecasts," Demography, Springer;Population Association of America (PAA), vol. 33(4), pages 523-528, November.
    4. D King & D Bolsdon, 1998. "Using the SARs to Add Policy Value to Household Projections," Environment and Planning A, , vol. 30(5), pages 867-880, May.
    5. M Tranmer & D G Steel, 1998. "Using Census Data to Investigate the Causes of the Ecological Fallacy," Environment and Planning A, , vol. 30(5), pages 817-831, May.
    6. Philip Redfern, 1989. "Population Registers: Some Administrative and Statistical Pros and Cons," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 152(1), pages 1-28, January.
    7. P Williamson & M Birkin & P H Rees, 1998. "The Estimation of Population Microdata by Using Data from Small Area Statistics and Samples of Anonymised Records," Environment and Planning A, , vol. 30(5), pages 785-816, May.
    8. Larsen M. D & Rubin D. B, 2001. "Iterative Automated Record Linkage Using Mixture Models," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 32-41, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bernard Baffour & Thomas King & Paolo Valente, 2013. "The Modern Census: Evolution, Examples and Evaluation," International Statistical Review, International Statistical Institute, vol. 81(3), pages 407-425, December.
    2. Nikos Tzavidis & Li‐Chun Zhang & Angela Luna & Timo Schmid & Natalia Rojas‐Perilla, 2018. "From start to finish: a framework for the production of small area official statistics," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 927-979, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Afshin Fallah & Mohsen Mohammadzadeh, 2010. "Bayesian regression analysis with linked data using mixture normal distributions," Statistical Papers, Springer, vol. 51(2), pages 421-430, June.
    2. Bera Sabyasachi & Chatterjee Snigdhansu, 2020. "High dimensional, robust, unsupervised record linkage," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 123-143, August.
    3. Sabyasachi Bera & Snigdhansu Chatterjee, 2020. "High dimensional, robust, unsupervised record linkage," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 123-143, August.
    4. Irene L. Hudson & Linda Moore & Eric J. Beh & David G. Steel, 2010. "Ecological inference techniques: an empirical evaluation using data describing gender and voter turnout at New Zealand elections, 1893–1919," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 173(1), pages 185-213, January.
    5. Al-Kandari Noriah M. & Lahiri Partha, 2016. "Prediction of a Function of Misclassified Binary Data," Statistics in Transition New Series, Polish Statistical Association, vol. 17(3), pages 429-447, September.
    6. Michael P. Cameron & William Cochrane, 2015. "Using Land-Use Modelling to Statistically Downscale Population Projections to Small Areas," Working Papers in Economics 15/12, University of Waikato.
    7. Josef Schürle, 2005. "A method for consideration of conditional dependencies in the Fellegi and Sunter model of record linkage," Statistical Papers, Springer, vol. 46(3), pages 433-449, July.
    8. Loughrey, Jason & O’Donoghue, Cathal & Meredith, David & Murphy, Ger & Shanahan, Ultan & Miller, Corina, 2018. "The Local Impact of Cattle Farming," 166th Seminar, August 30-31, 2018, Galway, West of Ireland 276231, European Association of Agricultural Economists.
    9. Dasylva Abel, 2018. "Design-Based Estimation with Record-Linked Administrative Files and a Clerical Review Sample," Journal of Official Statistics, Sciendo, vol. 34(1), pages 41-54, March.
    10. Fullerton, Thomas M., Jr. & Walke, Adam G. & Villavicencio, Diana, 2015. "An Econometric Approach for Modeling Population Change in Doña Ana County, New Mexico," MPRA Paper 71141, University Library of Munich, Germany, revised 28 Jan 2015.
    11. Ben Powell & Paul A. Smith, 2020. "Computing expectations and marginal likelihoods for permutations," Computational Statistics, Springer, vol. 35(2), pages 871-891, June.
    12. Han Ying, 2020. "Discussion of “Small area estimation: its evolution in five decades”, by Malay Ghosh," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 30-34, August.
    13. Durrant, Gabriele B. & D'Arrigo, Julia & Steele, Fiona, 2011. "Using field process data to predict best times of contact conditioning on household and interviewer influences," LSE Research Online Documents on Economics 52201, London School of Economics and Political Science, LSE Library.
    14. Thomas Stringham, 2022. "Fast Bayesian Record Linkage With Record-Specific Disagreement Parameters," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1509-1522, October.
    15. Robert Haining & Jane Law, 2007. "Combining police perceptions with police records of serious crime areas: a modelling approach," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(4), pages 1019-1034, October.
    16. Kim, Gunky & Chambers, Raymond, 2012. "Regression analysis under incomplete linkage," Computational Statistics & Data Analysis, Elsevier, vol. 56(9), pages 2756-2770.
    17. Lovelace, Robin & Ballas, Dimitris & Watson, Matt, 2014. "A spatial microsimulation approach for the analysis of commuter patterns: from individual to regional levels," Journal of Transport Geography, Elsevier, vol. 34(C), pages 282-296.
    18. Tatiana Komarova & Denis Nekipelov & Evgeny Yakovlev, 2018. "Identification, data combination, and the risk of disclosure," Quantitative Economics, Econometric Society, vol. 9(1), pages 395-440, March.
    19. Vo, Thanh Huan & Chauvet, Guillaume & Happe, André & Oger, Emmanuel & Paquelet, Stéphane & Garès, Valérie, 2023. "Extending the Fellegi-Sunter record linkage model for mixed-type data with application to the French national health data system," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    20. Mary Layne & Deborah Wagner & Cynthia Rothhaas, 2014. "Estimating Record Linkage False Match Rate for the Person Identification Validation System," CARRA Working Papers 2014-02, Center for Economic Studies, U.S. Census Bureau.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:170:y:2007:i:2:p:483-501. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.