IDEAS home Printed from https://ideas.repec.org/a/spr/astaws/v16y2022i1d10.1007_s11943-021-00298-9.html
   My bibliography  Save this article

Kernel density smoothing of composite spatial data on administrative area level
[Die Glättung räumlicher Datensätze auf administrativen Flächen]

Author

Listed:
  • Kerstin Erfurth

    (Amt für Statistik Berlin-Brandenburg)

  • Marcus Groß

    (INWT Statistics GmbH)

  • Ulrich Rendtel

    (Freie Universität Berlin)

  • Timo Schmid

    (Universität Bamberg)

Abstract

Composite spatial data on administrative area level are often presented by maps. The aim is to detect regional differences in the concentration of subpopulations, like elderly persons, ethnic minorities, low-educated persons, voters of a political party or persons with a certain disease. Thematic collections of such maps are presented in different atlases. The standard presentation is by Choropleth maps where each administrative unit is represented by a single value. These maps can be criticized under three aspects: the implicit assumption of a uniform distribution within the area, the instability of the resulting map with respect to a change of the reference area and the discontinuities of the maps at the borderlines of the reference areas which inhibit the detection of regional clusters. In order to address these problems we use a density approach in the construction of maps. This approach does not enforce a local uniform distribution. It does not depend on a specific choice of area reference system and there are no discontinuities in the displayed maps. A standard estimation procedure of densities are Kernel density estimates. However, these estimates need the geo-coordinates of the single units which are not at disposal as we have only access to the aggregates of some area system. To overcome this hurdle, we use a statistical simulation concept. This can be interpreted as a Simulated Expectation Maximisation (SEM) algorithm of Celeux et al (1996). We simulate observations from the current density estimates which are consistent with the aggregation information (S-step). Then we apply the Kernel density estimator to the simulated sample which gives the next density estimate (E-Step). This concept has been first applied for grid data with rectangular areas, see Groß et al (2017), for the display of ethnic minorities. In a second application we demonstrated the use of this approach for the so-called “change of support” (Bradley et al 2016) problem. Here Groß et al (2020) used the SEM algorithm to recalculate case numbers between non-hierarchical administrative area systems. Recently Rendtel et al (2021) applied the SEM algorithm to display spatial-temporal clusters of Corona infections in Germany. Here we present three modifications of the basic SEM algorithm: 1) We introduce a boundary correction which removes the underestimation of kernel density estimates at the borders of the population area. 2) We recognize unsettled areas, like lakes, parks and industrial areas, in the computation of the kernel density. 3) We adapt the SEM algorithm for the computation of local percentages which are important especially in voting analysis. We evaluate our approach against several standard maps by means of the local voting register with known addresses. In the empirical part we apply our approach for the display of voting results for the 2016 election of the Berlin parliament. We contrast our results against Choropleth maps and show new possibilities for reporting spatial voting results.

Suggested Citation

  • Kerstin Erfurth & Marcus Groß & Ulrich Rendtel & Timo Schmid, 2022. "Kernel density smoothing of composite spatial data on administrative area level [Die Glättung räumlicher Datensätze auf administrativen Flächen]," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 16(1), pages 25-49, March.
  • Handle: RePEc:spr:astaws:v:16:y:2022:i:1:d:10.1007_s11943-021-00298-9
    DOI: 10.1007/s11943-021-00298-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11943-021-00298-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11943-021-00298-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jonathan R. Bradley & Christopher K. Wikle & Scott H. Holan, 2016. "Bayesian Spatial Change of Support for Count-Valued Survey Data With Application to the American Community Survey," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 472-487, April.
    2. Marcus Groß & Ulrich Rendtel & Timo Schmid & Sebastian Schmon & Nikos Tzavidis, 2017. "Estimating the density of ethnic minorities and aged people in Berlin: multivariate kernel density estimation applied to sensitive georeferenced administrative data protected via measurement error," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(1), pages 161-183, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Groß Marcus & Kreutzmann Ann-Kristin & Rendtel Ulrich & Schmid Timo & Tzavidis Nikos, 2020. "Switching Between Different Non-Hierachical Administrative Areas via Simulated Geo-Coordinates: A Case Study for Student Residents in Berlin," Journal of Official Statistics, Sciendo, vol. 36(2), pages 297-314, June.
    2. Groß Marcus & Kreutzmann Ann-Kristin & Rendtel Ulrich & Schmid Timo & Tzavidis Nikos, 2020. "Switching Between Different Non-Hierachical Administrative Areas via Simulated Geo-Coordinates: A Case Study for Student Residents in Berlin," Journal of Official Statistics, Sciendo, vol. 36(2), pages 297-314, June.
    3. K. Shuvo Bakar & Nicholas Biddle & Philip Kokic & Huidong Jin, 2020. "A Bayesian spatial categorical model for prediction to overlapping geographical areas in sample surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 535-563, February.
    4. Paul Makdissi & Walid Marrouch & Myra Yazbeck, 2022. "Monitoring Poverty in a Data Deprived Environment: The Case of Lebanon," Working Papers 2022-014, Human Capital and Economic Opportunity Working Group.
    5. Marco Gramatica & Peter Congdon & Silvia Liverani, 2021. "Bayesian modelling for spatially misaligned health areal data: A multiple membership approach," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 645-666, June.
    6. Groß, Marcus & Rendtel, Ulrich & Schmid, Timo & Bömermann, Hartmut & Erfurth, Kerstin, 2018. "Simulated geo-coordinates as a tool for map-based regional analysis," Discussion Papers 2018/3, Free University Berlin, School of Business & Economics.
    7. Ulrich Rendtel & Milo Ruhanen, 2018. "Die Konstruktion von Dienstleistungskarten mit Open Data am Beispiel des lokalen Bedarfs an Kinderbetreuung in Berlin [The construction of service maps with open data: the case of local need for ch," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 12(3), pages 271-284, December.
    8. Daniel H. Weinberg & John M. Abowd & Robert F. Belli & Noel Cressie & David C. Folch & Scott H. Holan & Margaret C. Levenstein & Kristen M. Olson & Jerome P. Reiter & Matthew D. Shapiro & Jolene Smyth, 2017. "Effects of a Government-Academic Partnership: Has the NSF-Census Bureau Research Network Helped Improve the U.S. Statistical System?," Working Papers 17-59r, Center for Economic Studies, U.S. Census Bureau.
    9. Paul Walter & Marcus Groß & Timo Schmid & Nikos Tzavidis, 2021. "Domain prediction with grouped income data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1501-1523, October.
    10. Walter, Paul & Weimer, Katja, 2018. "Estimating poverty and inequality indicators using interval censored income data from the German microcensus," Discussion Papers 2018/10, Free University Berlin, School of Business & Economics.
    11. Duncan Lee & Craig Anderson, 2023. "Delivering spatially comparable inference on the risks of multiple severities of respiratory disease from spatially misaligned disease count data," Biometrics, The International Biometric Society, vol. 79(3), pages 2691-2704, September.
    12. Nelson B. Walker & Trevor J. Hefley & Daniel P. Walsh, 2020. "Bias correction of bounded location error in binary data," Biometrics, The International Biometric Society, vol. 76(2), pages 530-539, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:astaws:v:16:y:2022:i:1:d:10.1007_s11943-021-00298-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.