IDEAS home Printed from https://ideas.repec.org/a/eee/juecon/v125y2021ics0094119019301068.html
   My bibliography  Save this article

Identifying urban areas by combining human judgment and machine learning: An application to India

Author

Listed:
  • Galdo, Virgilio
  • Li, Yue
  • Rama, Martin

Abstract

We propose a methodology for identifying urban areas that combines subjective assessments with machine learning, and we apply it to India, a country where several studies see the official urbanization rate as an under-estimate. For a representative sample of cities, towns and villages, as administratively defined, we rely on human judgment of Google images to determine whether they are urban or rural in practice. We collect judgments across four groups of assessors, differing in their familiarity with India and with urban issues, following two different protocols. We then combine the judgment-based classification with data from the population census and from satellite imagery to predict the urban status of the sample. The Logit model, and LASSO and random forests methods, are applied. These approaches are then used to decide whether each of the out-of-sample administrative units in India is urban or rural in practice. We do not find that India is substantially more urban than officially claimed. However, there are important differences at more disaggregated levels, with “other towns” and “census towns” being more rural, and some southern states more urban, than is officially claimed. The consistency of human judgment across assessors and protocols, the easy availability of crowd-sourcing, and the stability of predictions across approaches, suggest that the proposed methodology is a promising avenue for studying urban issues.

Suggested Citation

  • Galdo, Virgilio & Li, Yue & Rama, Martin, 2021. "Identifying urban areas by combining human judgment and machine learning: An application to India," Journal of Urban Economics, Elsevier, vol. 125(C).
  • Handle: RePEc:eee:juecon:v:125:y:2021:i:c:s0094119019301068
    DOI: 10.1016/j.jue.2019.103229
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0094119019301068
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jue.2019.103229?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Edward L. Glaeser & Andrew Hillis & Scott Duke Kominers & Michael Luca, 2016. "Crowdsourcing City Government: Using Tournaments to Improve Inspection Accuracy," American Economic Review, American Economic Association, vol. 106(5), pages 114-118, May.
    2. Naik, Nikhil & Kominers, Scott Duke & Raskar, Ramesh & Glaeser, Edward L. & Hidalgo, Cesar A., 2015. "Do People Shape Cities, or Do Cities Shape People? THe Co-evolution of Physical, Social and Economic Change in Five Major U.S. Cities," Working Paper Series 15-061, Harvard University, John F. Kennedy School of Government.
    3. Gabriel M. Ahlfeldt & Stephen J. Redding & Daniel M. Sturm & Nikolaus Wolf, 2015. "The Economics of Density: Evidence From the Berlin Wall," Econometrica, Econometric Society, vol. 83, pages 2127-2189, November.
    4. Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2018. "Big Data And Big Cities: The Promises And Limitations Of Improved Measures Of Urban Life," Economic Inquiry, Western Economic Association International, vol. 56(1), pages 114-137, January.
    5. Duranton, Gilles & Puga, Diego, 2015. "Urban Land Use," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 467-560, Elsevier.
    6. Jan Eeckhout, 2004. "Gibrat's Law for (All) Cities," American Economic Review, American Economic Association, vol. 94(5), pages 1429-1451, December.
    7. Ministry of Finance, Government of India,, 2017. "Economic Survey 2016-17," OUP Catalogue, Oxford University Press, edition 2, number 9780199477661.
    8. Masahisa Fujita & Paul Krugman & Anthony J. Venables, 2001. "The Spatial Economy: Cities, Regions, and International Trade," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262561476, December.
    9. Hernán D. Rozenfeld & Diego Rybski & Xavier Gabaix & Hernán A. Makse, 2011. "The Area and Population of Cities: New Insights from a Different Perspective on Cities," American Economic Review, American Economic Association, vol. 101(5), pages 2205-2225, August.
    10. Pierre-Philippe Combes & Gilles Duranton & Laurent Gobillon & Sébastien Roux, 2010. "Estimating Agglomeration Economies with History, Geology, and Worker Effects," NBER Chapters, in: Agglomeration Economics, pages 15-66, National Bureau of Economic Research, Inc.
    11. Philip Salesses & Katja Schechtner & César A Hidalgo, 2013. "The Collaborative Image of The City: Mapping the Inequality of Urban Perception," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-12, July.
    12. Duranton, Gilles & Puga, Diego, 2004. "Micro-foundations of urban agglomeration economies," Handbook of Regional and Urban Economics, in: J. V. Henderson & J. F. Thisse (ed.), Handbook of Regional and Urban Economics, edition 1, volume 4, chapter 48, pages 2063-2117, Elsevier.
    13. Scotchmer, Suzanne, 2002. "Local public goods and clubs," Handbook of Public Economics, in: A. J. Auerbach & M. Feldstein (ed.), Handbook of Public Economics, edition 1, volume 4, chapter 29, pages 1997-2042, Elsevier.
    14. Briant, A. & Combes, P.-P. & Lafourcade, M., 2010. "Dots to boxes: Do the size and shape of spatial units jeopardize economic geography estimations?," Journal of Urban Economics, Elsevier, vol. 67(3), pages 287-302, May.
    15. Nikhil Naik & Ramesh Raskar & César A. Hidalgo, 2016. "Cities Are Physical Too: Using Computer Vision to Measure the Quality and Impact of Urban Appearance," American Economic Review, American Economic Association, vol. 106(5), pages 128-132, May.
    16. Bruno S. Frey & Alois Stutzer, 2002. "What Can Economists Learn from Happiness Research?," Journal of Economic Literature, American Economic Association, vol. 40(2), pages 402-435, June.
    17. Marcy Burchfield & Henry G. Overman & Diego Puga & Matthew A. Turner, 2006. "Causes of Sprawl: A Portrait from Space," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 121(2), pages 587-633.
    18. Dingel, Jonathan I. & Miscio, Antonio & Davis, Donald R., 2021. "Cities, lights, and skills in developing economies," Journal of Urban Economics, Elsevier, vol. 125(C).
    19. Krugman, Paul, 1991. "Increasing Returns and Economic Geography," Journal of Political Economy, University of Chicago Press, vol. 99(3), pages 483-499, June.
    20. Galdo,Virgilio & Li,Yue & Rama,Martin G., 2018. "Identifying Urban Areas by Combining Data from the Ground and from Outer Space : An Application to India," Policy Research Working Paper Series 8628, The World Bank.
    21. Hamermesh, Daniel S & Biddle, Jeff E, 1994. "Beauty and the Labor Market," American Economic Review, American Economic Association, vol. 84(5), pages 1174-1194, December.
    22. Moshe Levy, 2009. "Gibrat's Law for (All) Cities: Comment," American Economic Review, American Economic Association, vol. 99(4), pages 1672-1675, September.
    23. Gilles Duranton & J. V. Henderson & William C. Strange (ed.), 2015. "Handbook of Regional and Urban Economics," Handbook of Regional and Urban Economics, Elsevier, edition 1, volume 5, number 5.
    24. Bosker, Maarten & Park, Jane & Roberts, Mark, 2021. "Definition matters. Metropolitan areas and agglomeration economies in a large-developing country," Journal of Urban Economics, Elsevier, vol. 125(C).
    25. Susan Athey, 2018. "The Impact of Machine Learning on Economics," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 507-547, National Bureau of Economic Research, Inc.
    26. Dave Donaldson & Adam Storeygard, 2016. "The View from Above: Applications of Satellite Data in Economics," Journal of Economic Perspectives, American Economic Association, vol. 30(4), pages 171-198, Fall.
    27. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    28. Gilles Duranton, 2015. "A Proposal to Delineate Metropolitan Areas in Colombia," Revista Desarrollo y Sociedad, Universidad de los Andes,Facultad de Economía, CEDE, August.
    29. Guy Michaels & Ferdinand Rauch & Stephen J. Redding, 2012. "Urbanization and Structural Transformation," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 127(2), pages 535-586.
    30. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    31. Brueckner, Jan K., 1987. "The structure of urban equilibria: A unified treatment of the muth-mills model," Handbook of Regional and Urban Economics, in: E. S. Mills (ed.), Handbook of Regional and Urban Economics, edition 1, volume 2, chapter 20, pages 821-845, Elsevier.
    32. Christopher D. Elvidge & Daniel Ziskin & Kimberly E. Baugh & Benjamin T. Tuttle & Tilottama Ghosh & Dee W. Pack & Edward H. Erwin & Mikhail Zhizhin, 2009. "A Fifteen Year Record of Global Natural Gas Flaring Derived from Satellite Data," Energies, MDPI, vol. 2(3), pages 1-28, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Puente-Ajovín, Miguel & Ramos, Arturo & Sanz-Gracia, Fernando & Arribas-Bel, Daniel, 2020. "How sensitive is city size distribution to the definition of city? The case of Spain," Economics Letters, Elsevier, vol. 197(C).
    2. Bosker, Maarten & Park, Jane & Roberts, Mark, 2021. "Definition matters. Metropolitan areas and agglomeration economies in a large-developing country," Journal of Urban Economics, Elsevier, vol. 125(C).
    3. de Bellefon, Marie-Pierre & Combes, Pierre-Philippe & Duranton, Gilles & Gobillon, Laurent & Gorin, Clément, 2021. "Delineating urban areas using building density," Journal of Urban Economics, Elsevier, vol. 125(C).
    4. Beyer, Robert C.M. & Franco-Bedoya, Sebastian & Galdo, Virgilio, 2021. "Examining the economic impact of COVID-19 in India through daily electricity consumption and nighttime light intensity," World Development, Elsevier, vol. 140(C).
    5. World Bank, 2020. "India Development Update, July 2020," World Bank Publications - Reports 34367, The World Bank Group.
    6. Imryoung Jeong & Hyunjoo Yang, 2021. "Using maps to predict economic activity," Papers 2112.13850, arXiv.org, revised Apr 2022.
    7. Wei Zou & Fei Yang, 2024. "Does City Shape Affect China's Economic Development?," China & World Economy, Institute of World Economics and Politics, Chinese Academy of Social Sciences, vol. 32(1), pages 21-56, January.
    8. García-Suaza, Andres & Varela, Daniela, 2024. "Nightlight, landcover and buildings: understanding intracity socioeconomic differences," Documentos de Trabajo 21025, Universidad del Rosario.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baragwanath, Kathryn & Goldblatt, Ran & Hanson, Gordon & Khandelwal, Amit K., 2021. "Detecting urban markets with satellite imagery: An application to India," Journal of Urban Economics, Elsevier, vol. 125(C).
    2. Arribas-Bel, Daniel & Garcia-López, M.-À. & Viladecans-Marsal, Elisabet, 2021. "Building(s and) cities: Delineating urban areas with a machine learning algorithm," Journal of Urban Economics, Elsevier, vol. 125(C).
    3. Beltrán Tapia, Francisco J. & Díez-Minguela, Alfonso & Martinez-Galarraga, Julio, 2018. "Tracing the Evolution of Agglomeration Economies: Spain, 1860–1991," The Journal of Economic History, Cambridge University Press, vol. 78(1), pages 81-117, March.
    4. Bosker, Maarten & Park, Jane & Roberts, Mark, 2021. "Definition matters. Metropolitan areas and agglomeration economies in a large-developing country," Journal of Urban Economics, Elsevier, vol. 125(C).
    5. Stephen J. Redding, 2013. "Economic Geography: A Review of the Theoretical and Empirical Literature," Palgrave Macmillan Books, in: Daniel Bernhofen & Rod Falvey & David Greenaway & Udo Kreickemeier (ed.), Palgrave Handbook of International Trade, chapter 16, pages 497-531, Palgrave Macmillan.
    6. Bluhm, Richard & Krause, Melanie, 2022. "Top lights: Bright cities and their contribution to economic development," Journal of Development Economics, Elsevier, vol. 157(C).
    7. Licia Ferranna & Margherita Gerolimetto & Stefano Magrini, 2016. "Urban Governance Structure and Wage Disparities across US Metropolitan Areas," Working Papers 2016:26, Department of Economics, University of Venice "Ca' Foscari".
    8. Francisco J. Beltrán Tapia & Alfonso Díez-Minguela & Julio Martinez-Galarraga, 2021. "The shadow of cities: size, location and the spatial distribution of population," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 66(3), pages 729-753, June.
    9. Stephan Heblich & Stephen J Redding & Daniel M Sturm, 2020. "The Making of the Modern Metropolis: Evidence from London," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 135(4), pages 2059-2133.
    10. Stephen J. Redding, 2010. "The Empirics Of New Economic Geography," Journal of Regional Science, Wiley Blackwell, vol. 50(1), pages 297-311, February.
    11. Stephen J. Redding & Esteban Rossi-Hansberg, 2017. "Quantitative Spatial Economics," Annual Review of Economics, Annual Reviews, vol. 9(1), pages 21-58, September.
    12. Dingel, Jonathan I. & Miscio, Antonio & Davis, Donald R., 2021. "Cities, lights, and skills in developing economies," Journal of Urban Economics, Elsevier, vol. 125(C).
    13. Duranton, Gilles & Puga, Diego, 2014. "The Growth of Cities," Handbook of Economic Growth, in: Philippe Aghion & Steven Durlauf (ed.), Handbook of Economic Growth, edition 1, volume 2, chapter 5, pages 781-853, Elsevier.
    14. Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2018. "Big Data And Big Cities: The Promises And Limitations Of Improved Measures Of Urban Life," Economic Inquiry, Western Economic Association International, vol. 56(1), pages 114-137, January.
    15. Behrens, Kristian & Mion, Giordano & Murata, Yasusada & Suedekum, Jens, 2017. "Spatial frictions," Journal of Urban Economics, Elsevier, vol. 97(C), pages 40-70.
    16. Fabien Candau & Elisa Dienesch, 2015. "Spatial distribution of skills and regional trade integration," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 54(2), pages 451-488, March.
    17. Desmet, Klaus & Henderson, J. Vernon, 2015. "The Geography of Development Within Countries," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 1457-1517, Elsevier.
    18. Combes, Pierre-Philippe & Gobillon, Laurent, 2015. "The Empirics of Agglomeration Economies," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 247-348, Elsevier.
    19. Gilles Duranton & Diego Puga, 2023. "Urban Growth and Its Aggregate Implications," Econometrica, Econometric Society, vol. 91(6), pages 2219-2259, November.
    20. Achten, Sandra & Lessmann, Christian, 2020. "Spatial inequality, geography and economic activity," World Development, Elsevier, vol. 136(C).

    More about this item

    Keywords

    Urban area; Urbanization rate; Human judgment; Google images; Crowd sourcing; Population census; Satellite imagery; Machine learning;
    All these keywords.

    JEL classification:

    • O1 - Economic Development, Innovation, Technological Change, and Growth - - Economic Development
    • O18 - Economic Development, Innovation, Technological Change, and Growth - - Economic Development - - - Urban, Rural, Regional, and Transportation Analysis; Housing; Infrastructure
    • R1 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General Regional Economics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:juecon:v:125:y:2021:i:c:s0094119019301068. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622905 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.