IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v3y2018i3p30-d162841.html
   My bibliography  Save this article

Linking Synthetic Populations to Household Geolocations: A Demonstration in Namibia

Author

Listed:
  • Dana R. Thomson

    (Flowminder Foundation, SE-11355 Stockholm, Sweden
    WorldPop, Department of Geography and Environment, University of Southampton, Southampton SO17 1BJ, UK
    Department of Social Statistics, University of Southampton, Southampton SO17 1BJ, UK
    These authors contributed equally to this work.)

  • Lieke Kools

    (Department of Economics, Leiden University, 2311 EZ Leiden, The Netherlands
    These authors contributed equally to this work.)

  • Warren C. Jochem

    (Flowminder Foundation, SE-11355 Stockholm, Sweden
    WorldPop, Department of Geography and Environment, University of Southampton, Southampton SO17 1BJ, UK)

Abstract

Whether evaluating gridded population dataset estimates (e.g., WorldPop, LandScan) or household survey sample designs, a population census linked to residential locations are needed. Geolocated census microdata data, however, are almost never available and are thus best simulated. In this paper, we simulate a close-to-reality population of individuals nested in households geolocated to realistic building locations. Using the R simPop package and ArcGIS, multiple realizations of a geolocated synthetic population are derived from the Namibia 2011 census 20% microdata sample, Namibia census enumeration area boundaries, Namibia 2013 Demographic and Health Survey (DHS), and dozens of spatial covariates derived from publicly available datasets. Realistic household latitude-longitude coordinates are manually generated based on public satellite imagery. Simulated households are linked to latitude-longitude coordinates by identifying distinct household types with multivariate k-means analysis and modelling a probability surface for each household type using Random Forest machine learning methods. We simulate five realizations of a synthetic population in Namibia’s Oshikoto region, including demographic, socioeconomic, and outcome characteristics at the level of household, woman, and child. Comparison of variables in the synthetic population were made with 2011 census 20% sample and 2013 DHS data by primary sampling unit/enumeration area. We found that synthetic population variable distributions matched observed observations and followed expected spatial patterns. We outline a novel process to simulate a close-to-reality microdata census geolocated to realistic building locations in a low- or middle-income country setting to support spatial demographic research and survey methodological development while avoiding disclosure risk of individuals.

Suggested Citation

  • Dana R. Thomson & Lieke Kools & Warren C. Jochem, 2018. "Linking Synthetic Populations to Household Geolocations: A Demonstration in Namibia," Data, MDPI, vol. 3(3), pages 1-19, August.
  • Handle: RePEc:gam:jdataj:v:3:y:2018:i:3:p:30-:d:162841
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/3/3/30/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/3/3/30/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Günther Fink & Isabel Günther & Kenneth Hill, 2014. "Slum Residence and Child Health in Developing Countries," Demography, Springer;Population Association of America (PAA), vol. 51(4), pages 1175-1197, August.
    2. Robert Tanton, 2014. "A Review of Spatial Microsimulation Methods," International Journal of Microsimulation, International Microsimulation Association, vol. 7(1), pages 4-25.
    3. Andreas Alfons & Stefan Kraft & Matthias Templ & Peter Filzmoser, 2011. "Simulation of close-to-reality population data for household surveys with application to EU-SILC," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 20(3), pages 383-407, August.
    4. Templ, Matthias & Meindl, Bernhard & Kowarik, Alexander & Dupriez, Olivier, 2017. "Simulation of Synthetic Complex Data: The R Package simPop," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 79(i10).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thomson, Dana R. & Stevens, Forrest R. & Chen, Robert & Yetman, Gregory & Sorichetta, Alessandro & Gaughan, Andrea E., 2022. "Improving the accuracy of gridded population estimates in cities and slums to monitor SDG 11: Evidence from a simulation study in Namibia," Land Use Policy, Elsevier, vol. 123(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Trond Husby & Olga Ivanova & Mark Thissen, 2018. "Simulating the Joint Distribution of Individuals, Households and Dwellings in Small Areas," International Journal of Microsimulation, International Microsimulation Association, vol. 11(2), pages 169-190.
    2. Sugata Bag & Suman Seth, 2016. "Understanding Standard of Living and Correlates in Slums - An Analysis Using Monetary Versus Multidimensional Approaches in Three Indian Cities," Working papers 263, Centre for Development Economics, Delhi School of Economics.
    3. Kevin Credit & Zander Arnao, 2023. "A method to derive small area estimates of linked commuting trips by mode from open source LODES and ACS data," Environment and Planning B, , vol. 50(3), pages 709-722, March.
    4. Laura B. Nolan, 2015. "Slum Definitions in Urban India: Implications for the Measurement of Health Inequalities," Population and Development Review, The Population Council, Inc., vol. 41(1), pages 59-84, March.
    5. Tom Wilson & Irina Grossman & Monica Alexander & Phil Rees & Jeromey Temple, 2022. "Methods for Small Area Population Forecasts: State-of-the-Art and Research Needs," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(3), pages 865-898, June.
    6. Shon, Huijoo, 2024. "Urbanicity and child health in 26 sub-Saharan African countries: Settlement type and its association with mortality and morbidity," Social Science & Medicine, Elsevier, vol. 340(C).
    7. Templ, Matthias & Meindl, Bernhard & Kowarik, Alexander & Dupriez, Olivier, 2017. "Simulation of Synthetic Complex Data: The R Package simPop," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 79(i10).
    8. MacTavish, Robert & Bixby, Honor & Cavanaugh, Alicia & Agyei-Mensah, Samuel & Bawah, Ayaga & Owusu, George & Ezzati, Majid & Arku, Raphael & Robinson, Brian & Schmidt, Alexandra M. & Baumgartner, Jill, 2023. "Identifying deprived “slum” neighbourhoods in the Greater Accra Metropolitan Area of Ghana using census and remote sensing data," World Development, Elsevier, vol. 167(C).
    9. Roszka Wojciech, 2019. "Spatial Microsimulation Of Personal Income In Poland At The Level Of Subregions," Statistics in Transition New Series, Statistics Poland, vol. 20(3), pages 133-153, September.
    10. Alberto Vitalini & Simona Ballabio & Flavio Verrecchia, 2024. "Rebuilding a pseudo population register for estimating physical vulnerability at the local level: a case study of spatial microsimulation in Sondrio," RIEDS - Rivista Italiana di Economia, Demografia e Statistica - The Italian Journal of Economic, Demographic and Statistical Studies, SIEDS Societa' Italiana di Economia Demografia e Statistica, vol. 78(1), pages 55-64, January-M.
    11. Kesztenbaum, Lionel & Rosenthal, Jean-Laurent, 2017. "Sewers’ diffusion and the decline of mortality: The case of Paris, 1880–1914," Journal of Urban Economics, Elsevier, vol. 98(C), pages 174-186.
    12. Sebastian Dräger & Johannes Kopp & Ralf Münnich & Simon Schmaus, 2022. "Die zukünftige Entwicklung der Grundschulversorgung im Kontext ausgewählter Wanderungsszenarien [The future development of primary school demand in the context of selected migration scenarios]," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 16(1), pages 51-77, March.
    13. Payal Hathi & Sabrina Haque & Lovey Pant & Diane Coffey & Dean Spears, 2017. "Place and Child Health: The Interaction of Population Density and Sanitation in Developing Countries," Demography, Springer;Population Association of America (PAA), vol. 54(1), pages 337-360, February.
    14. Templ Matthias, 2015. "Quality Indicators for Statistical Disclosure Methods: A Case Study on the Structure of Earnings Survey," Journal of Official Statistics, Sciendo, vol. 31(4), pages 737-761, December.
    15. Ashira Menashe-Oren, 2020. "Migrant-based youth bulges and social conflict in urban sub-Saharan Africa," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 42(3), pages 57-98.
    16. Amanda M. Y. Chu & Benson S. Y. Lam & Agnes Tiwari & Mike K. P. So, 2019. "An Empirical Study of Applying Statistical Disclosure Control Methods to Public Health Research," IJERPH, MDPI, vol. 16(22), pages 1-17, November.
    17. Robert Tanton, 2018. "Spatial Microsimulation: Developments and Potential Future Directions," International Journal of Microsimulation, International Microsimulation Association, vol. 11(1), pages 143-161.
    18. Olufunke Fayehun & Motunrayo Ajisola & Olalekan Uthman & Oyinlola Oyebode & Abiola Oladejo & Eme Owoaje & Olalekan Taiwo & Oladoyin Odubanjo & Bronwyn Harris & Richard Lilford & Akinyinka Omigbodun & , 2022. "A contextual exploration of healthcare service use in urban slums in Nigeria," PLOS ONE, Public Library of Science, vol. 17(2), pages 1-14, February.
    19. Claus C. Pörtner & Yu-hsuan Su, 2018. "Differences in Child Health Across Rural, Urban, and Slum Areas: Evidence From India," Demography, Springer;Population Association of America (PAA), vol. 55(1), pages 223-247, February.
    20. Stefania Lovo & Samantha Rawlings, 2021. "Garbage in, garbage out: the impact of e-waste dumping sites on early child health," Economics Discussion Papers em-dp2021-07, Department of Economics, University of Reading.

    More about this item

    Keywords

    simulation; census; simPop; LMIC;
    All these keywords.

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:3:y:2018:i:3:p:30-:d:162841. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.