IDEAS home Printed from https://ideas.repec.org/a/sae/envirb/v51y2024i1p89-108.html
   My bibliography  Save this article

Using machine learning to identify spatial market segments. A reproducible study of major Spanish markets

Author

Listed:
  • David Rey-Blanco
  • Pelayo Arbués
  • Fernando A. López
  • Antonio Páez

Abstract

Identifying market segments can improve the fit and performance of hedonic price models. In this paper, we present a novel approach to market segmentation based on the use of machine learning techniques. Concretely, we propose a two-stage process. In the first stage, classification trees with interactive basis functions are used to identify non-orthogonal and non-linear submarket boundaries. The market segments that result are then introduced in a spatial econometric model to obtain hedonic estimates of the implicit prices of interest. The proposed approach is illustrated with a reproducible example of three major Spanish real estate markets. We conclude that identifying market sub-segments using the approach proposed is a relatively simple and demonstrate the potential of the proposed modelling strategy to produce better models and more accurate predictions.

Suggested Citation

  • David Rey-Blanco & Pelayo Arbués & Fernando A. López & Antonio Páez, 2024. "Using machine learning to identify spatial market segments. A reproducible study of major Spanish markets," Environment and Planning B, , vol. 51(1), pages 89-108, January.
  • Handle: RePEc:sae:envirb:v:51:y:2024:i:1:p:89-108
    DOI: 10.1177/23998083231166952
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/23998083231166952
    Download Restriction: no

    File URL: https://libkey.io/10.1177/23998083231166952?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    2. Pace, R Kelley & Barry, Ronald & Clapp, John M. & Rodriquez, Mauricio, 1998. "Spatiotemporal Autoregressive Models of Neighborhood Effects," The Journal of Real Estate Finance and Economics, Springer, vol. 17(1), pages 15-33, July.
    3. Pace, R Kelley & Gilley, Otis W, 1997. "Using the Spatial Configuration of the Data to Improve Estimation," The Journal of Real Estate Finance and Economics, Springer, vol. 14(3), pages 333-340, May.
    4. Nowak, Adam & Sayago-Gomez, Juan, 2018. "Homeowner preferences after September 11th, a microdata approach," Regional Science and Urban Economics, Elsevier, vol. 70(C), pages 330-351.
    5. José-María Montero-Lorenzo & Beatriz Larraz-Iribas & Antonio Páez, 2009. "Estimating commercial property prices: an application of cokriging with housing prices as ancillary information," Journal of Geographical Systems, Springer, vol. 11(4), pages 407-425, December.
    6. Antonio Páez, 2009. "Recent research in spatial real estate hedonic analysis," Journal of Geographical Systems, Springer, vol. 11(4), pages 311-316, December.
    7. David C. Wheeler & Antonio Páez & Jamie Spinney & Lance A. Waller, 2014. "A Bayesian approach to hedonic price analysis," Papers in Regional Science, Wiley Blackwell, vol. 93(3), pages 663-683, August.
    8. Dani Arribas-Bel & Mark Green & Francisco Rowe & Alex Singleton, 2021. "Open data products-A framework for creating valuable analysis ready data," Journal of Geographical Systems, Springer, vol. 23(4), pages 497-514, October.
    9. Steven Bourassa & Eva Cantoni & Martin Hoesli, 2007. "Spatial Dependence, Housing Submarkets, and House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 35(2), pages 143-160, August.
    10. Usman Hamza & Lizam Mohd & Adekunle Muhammad Usman, 2020. "Property Price Modelling, Market Segmentation and Submarket Classifications: A Review," Real Estate Management and Valuation, Sciendo, vol. 28(3), pages 24-35, September.
    11. Thomas G. Thibodeau, 2003. "Marking Single-Family Property Values to Market," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 31(1), pages 1-22, March.
    12. Fernando A. López & Román Mínguez & Jesús Mur, 2020. "ML versus IV estimates of spatial SUR models: evidence from the case of Airbnb in Madrid urban area," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 64(2), pages 313-347, April.
    13. Hu, Lirong & He, Shenjing & Han, Zixuan & Xiao, He & Su, Shiliang & Weng, Min & Cai, Zhongliang, 2019. "Monitoring housing rental prices based on social media:An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies," Land Use Policy, Elsevier, vol. 82(C), pages 657-673.
    14. Füss, Roland & Koller, Jan A., 2016. "The role of spatial and temporal structure for residential rent predictions," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1352-1368.
    15. von Graevenitz, Kathrine, 2018. "The amenity cost of road noise," Journal of Environmental Economics and Management, Elsevier, vol. 90(C), pages 1-22.
    16. Fernando A. López & Coro Chasco & Julie Le Gallo, 2015. "Exploring scan methods to test spatial structure with an application to housing prices in Madrid," Papers in Regional Science, Wiley Blackwell, vol. 94(2), pages 317-346, June.
    17. Brad R. Humphreys & Adam Nowak & Yang Zhou, 2019. "Superstition and real estate prices: transaction-level evidence from the US housing market," Applied Economics, Taylor & Francis Journals, vol. 51(26), pages 2818-2841, June.
    18. Chasco, Coro & Le Gallo, Julie & López, Fernando A., 2018. "A scan test for spatial groupwise heteroscedasticity in cross-sectional models with an application on houses prices in Madrid," Regional Science and Urban Economics, Elsevier, vol. 68(C), pages 226-238.
    19. Antonio Páez, 2021. "Open spatial sciences: an introduction," Journal of Geographical Systems, Springer, vol. 23(4), pages 467-476, October.
    20. Robert W. Paterson & Kevin J. Boyle, 2002. "Out of Sight, Out of Mind? Using GIS to Incorporate Visibility in Hedonic Property Value Models," Land Economics, University of Wisconsin Press, vol. 78(3), pages 417-425.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Juergen Deppner & Marcelo Cajias, 2024. "Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 68(2), pages 235-273, February.
    2. Kuethe, Todd H. & Foster, Kenneth A. & Florax, Raymond J.G.M., 2008. "A Spatial Hedonic Model with Time-Varying Parameters: A New Method Using Flexible Least Squares," 2008 Annual Meeting, July 27-29, 2008, Orlando, Florida 6306, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    3. Füss, Roland & Koller, Jan A., 2016. "The role of spatial and temporal structure for residential rent predictions," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1352-1368.
    4. Delores Conway & Christina Li & Jennifer Wolch & Christopher Kahle & Michael Jerrett, 2010. "A Spatial Autocorrelation Approach for Examining the Effects of Urban Greenspace on Residential Property Values," The Journal of Real Estate Finance and Economics, Springer, vol. 41(2), pages 150-169, August.
    5. S. Wong & C. Yiu & K. Chau, 2013. "Trading Volume-Induced Spatial Autocorrelation in Real Estate Prices," The Journal of Real Estate Finance and Economics, Springer, vol. 46(4), pages 596-608, May.
    6. Xiaolong Liu, 2013. "Spatial and Temporal Dependence in House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 47(2), pages 341-369, August.
    7. Wieser, Robert, 2009. "Parameterstabilität in hedonischen Bodenpreismodellen [Stability of Parameters in Hedonic Urban Land Price Models]," MPRA Paper 65859, University Library of Munich, Germany.
    8. Takafumi Kato, 2013. "Usefulness of the Information Contained in the Prediction Sample for the Spatial Error Model," The Journal of Real Estate Finance and Economics, Springer, vol. 47(1), pages 169-195, July.
    9. Rocco Curto & Elena Fregonara, 2019. "Monitoring and Analysis of the Real Estate Market in a Social Perspective: Results from the Turin’s (Italy) Experience," Sustainability, MDPI, vol. 11(11), pages 1-22, June.
    10. Bing Zhu & Roland Füss & Nico Rottke, 2011. "The Predictive Power of Anisotropic Spatial Correlation Modeling in Housing Prices," The Journal of Real Estate Finance and Economics, Springer, vol. 42(4), pages 542-565, May.
    11. Ingrid Nappi‐Choulet Pr. & Tristan‐Pierre Maury, 2009. "A Spatiotemporal Autoregressive Price Index for the Paris Office Property Market," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 37(2), pages 305-340, June.
    12. Liv Osland & John Östh & Viggo Nordvik, 2022. "House price valuation of environmental amenities: An application of GIS‐derived data," Regional Science Policy & Practice, Wiley Blackwell, vol. 14(4), pages 939-959, August.
    13. Silke Hüttel & Simon Jetzinger & Martin Odening, 2014. "Forced Sales and Farmland Prices," Land Economics, University of Wisconsin Press, vol. 90(3), pages 395-410.
    14. Kiefer, Hua, 2011. "The house price determination process: Rational expectations with a spatial context," Journal of Housing Economics, Elsevier, vol. 20(4), pages 249-266.
    15. Steven C. Bourassa & Eva Cantoni & Martin Hoesli, 2005. "Spatial Dependence, Housing Submarkets, and House Prices," FAME Research Paper Series rp151, International Center for Financial Asset Management and Engineering.
    16. Dieudonné Tchuente & Serge Nyawa, 2022. "Real estate price estimation in French cities using geocoding and machine learning," Annals of Operations Research, Springer, vol. 308(1), pages 571-608, January.
    17. Jamie Bologna Pavlik & Yang Zhou, 2023. "Are historic districts a backdoor for segregation? Yes and no," Contemporary Economic Policy, Western Economic Association International, vol. 41(3), pages 415-434, July.
    18. David Maddison, 2009. "A Spatio‐temporal Model of Farmland Values," Journal of Agricultural Economics, Wiley Blackwell, vol. 60(1), pages 171-189, February.
    19. Steven Bourassa & Eva Cantoni & Martin Hoesli, 2007. "Spatial Dependence, Housing Submarkets, and House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 35(2), pages 143-160, August.
    20. Julie Le Gallo & Fernando A. López & Coro Chasco, 2020. "Testing for spatial group-wise heteroskedasticity in spatial autocorrelation regression models: Lagrange multiplier scan tests," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 64(2), pages 287-312, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:envirb:v:51:y:2024:i:1:p:89-108. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.