IDEAS home Printed from https://ideas.repec.org/a/nas/journl/v116y2019p15447-15452.html
   My bibliography  Save this article

Predicting neighborhoods’ socioeconomic attributes using restaurant data

Author

Listed:
  • Lei Dong

    (Senseable City Lab, Department of Urban Studies and Planning, Massachusetts Institute of Technology, Cambridge, MA 02139; China Future City Lab and Center for Real Estate, Department of Urban Studies and Planning, Massachusetts Institute of Technology, Cambridge, MA 02139)

  • Carlo Ratti

    (Senseable City Lab, Department of Urban Studies and Planning, Massachusetts Institute of Technology, Cambridge, MA 02139)

  • Siqi Zheng

    (China Future City Lab and Center for Real Estate, Department of Urban Studies and Planning, Massachusetts Institute of Technology, Cambridge, MA 02139)

Abstract

Accessing high-resolution, timely socioeconomic data such as data on population, employment, and enterprise activity at the neighborhood level is critical for social scientists and policy makers to design and implement location-based policies. However, in many developing countries or cities, reliable local-scale socioeconomic data remain scarce. Here, we show an easily accessible and timely updated location attribute—restaurant—can be used to accurately predict a range of socioeconomic attributes of urban neighborhoods. We merge restaurant data from an online platform with 3 microdatasets for 9 Chinese cities. Using features extracted from restaurants, we train machine-learning models to estimate daytime and nighttime population, number of firms, and consumption level at various spatial resolutions. The trained model can explain 90 to 95% of the variation of those attributes across neighborhoods in the test dataset. We analyze the tradeoff between accuracy, spatial resolution, and number of training samples, as well as the heterogeneity of the predicted results across different spatial locations, demographics, and firm industries. Finally, we demonstrate the cross-city generality of this method by training the model in one city and then applying it directly to other cities. The transferability of this restaurant model can help bridge data gaps between cities, allowing all cities to enjoy big data and algorithm dividends.

Suggested Citation

  • Lei Dong & Carlo Ratti & Siqi Zheng, 2019. "Predicting neighborhoods’ socioeconomic attributes using restaurant data," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 116(31), pages 15447-15452, July.
  • Handle: RePEc:nas:journl:v:116:y:2019:p:15447-15452
    as

    Download full text from publisher

    File URL: http://www.pnas.org/content/116/31/15447.full
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Li Tian & Xiaoyan Shen, 2024. "Spatial patterns and their influencing factors for China’s catering industry," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-12, December.
    2. Tobias Cagala & Ulrich Glogowsky & Johannes Rincke & Anthony Strittmatter, 2021. "Optimal Targeting in Fundraising: A Machine-Learning Approach," Economics working papers 2021-08, Department of Economics, Johannes Kepler University Linz, Austria.
    3. Anqi Zhang & Weifeng Li & Jiayu Wu & Jian Lin & Jianqun Chu & Chang Xia, 2021. "How can the urban landscape affect urban vitality at the street block level? A case study of 15 metropolises in China," Environment and Planning B, , vol. 48(5), pages 1245-1262, June.
    4. Tobias Cagala & Ulrich Glogowsky & Johannes Rincke & Anthony Strittmatter, 2021. "Optimal Targeting in Fundraising: A Causal Machine-Learning Approach," Papers 2103.10251, arXiv.org, revised Sep 2021.
    5. Xiaofan Liang & Clio Andris, 2022. "Measuring McCities: Landscapes of chain and independent restaurants in the United States," Environment and Planning B, , vol. 49(2), pages 585-602, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nas:journl:v:116:y:2019:p:15447-15452. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Eric Cain (email available below). General contact details of provider: http://www.pnas.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.