IDEAS home Printed from https://ideas.repec.org/a/sae/envirb/v51y2024i1p89-108.html
   My bibliography  Save this article

Using machine learning to identify spatial market segments. A reproducible study of major Spanish markets

Author

Listed:
  • David Rey-Blanco
  • Pelayo Arbués
  • Fernando A. López
  • Antonio Páez

Abstract

Identifying market segments can improve the fit and performance of hedonic price models. In this paper, we present a novel approach to market segmentation based on the use of machine learning techniques. Concretely, we propose a two-stage process. In the first stage, classification trees with interactive basis functions are used to identify non-orthogonal and non-linear submarket boundaries. The market segments that result are then introduced in a spatial econometric model to obtain hedonic estimates of the implicit prices of interest. The proposed approach is illustrated with a reproducible example of three major Spanish real estate markets. We conclude that identifying market sub-segments using the approach proposed is a relatively simple and demonstrate the potential of the proposed modelling strategy to produce better models and more accurate predictions.

Suggested Citation

  • David Rey-Blanco & Pelayo Arbués & Fernando A. López & Antonio Páez, 2024. "Using machine learning to identify spatial market segments. A reproducible study of major Spanish markets," Environment and Planning B, , vol. 51(1), pages 89-108, January.
  • Handle: RePEc:sae:envirb:v:51:y:2024:i:1:p:89-108
    DOI: 10.1177/23998083231166952
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/23998083231166952
    Download Restriction: no

    File URL: https://libkey.io/10.1177/23998083231166952?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    2. Michael C Knaus & Michael Lechner & Anthony Strittmatter, 2021. "Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence," The Econometrics Journal, Royal Economic Society, vol. 24(1), pages 134-161.
    3. Pace, R Kelley & Barry, Ronald & Clapp, John M. & Rodriquez, Mauricio, 1998. "Spatiotemporal Autoregressive Models of Neighborhood Effects," The Journal of Real Estate Finance and Economics, Springer, vol. 17(1), pages 15-33, July.
    4. Pace, R Kelley & Gilley, Otis W, 1997. "Using the Spatial Configuration of the Data to Improve Estimation," The Journal of Real Estate Finance and Economics, Springer, vol. 14(3), pages 333-340, May.
    5. Nowak, Adam & Sayago-Gomez, Juan, 2018. "Homeowner preferences after September 11th, a microdata approach," Regional Science and Urban Economics, Elsevier, vol. 70(C), pages 330-351.
    6. José-María Montero-Lorenzo & Beatriz Larraz-Iribas & Antonio Páez, 2009. "Estimating commercial property prices: an application of cokriging with housing prices as ancillary information," Journal of Geographical Systems, Springer, vol. 11(4), pages 407-425, December.
    7. Antonio Páez, 2009. "Recent research in spatial real estate hedonic analysis," Journal of Geographical Systems, Springer, vol. 11(4), pages 311-316, December.
    8. David C. Wheeler & Antonio Páez & Jamie Spinney & Lance A. Waller, 2014. "A Bayesian approach to hedonic price analysis," Papers in Regional Science, Wiley Blackwell, vol. 93(3), pages 663-683, August.
    9. Dani Arribas-Bel & Mark Green & Francisco Rowe & Alex Singleton, 2021. "Open data products-A framework for creating valuable analysis ready data," Journal of Geographical Systems, Springer, vol. 23(4), pages 497-514, October.
    10. Steven Bourassa & Eva Cantoni & Martin Hoesli, 2007. "Spatial Dependence, Housing Submarkets, and House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 35(2), pages 143-160, August.
    11. Usman Hamza & Lizam Mohd & Adekunle Muhammad Usman, 2020. "Property Price Modelling, Market Segmentation and Submarket Classifications: A Review," Real Estate Management and Valuation, Sciendo, vol. 28(3), pages 24-35, September.
    12. Thomas G. Thibodeau, 2003. "Marking Single-Family Property Values to Market," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 31(1), pages 1-22, March.
    13. Fernando A. López & Román Mínguez & Jesús Mur, 2020. "ML versus IV estimates of spatial SUR models: evidence from the case of Airbnb in Madrid urban area," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 64(2), pages 313-347, April.
    14. Hu, Lirong & He, Shenjing & Han, Zixuan & Xiao, He & Su, Shiliang & Weng, Min & Cai, Zhongliang, 2019. "Monitoring housing rental prices based on social media:An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies," Land Use Policy, Elsevier, vol. 82(C), pages 657-673.
    15. Füss, Roland & Koller, Jan A., 2016. "The role of spatial and temporal structure for residential rent predictions," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1352-1368.
    16. von Graevenitz, Kathrine, 2018. "The amenity cost of road noise," Journal of Environmental Economics and Management, Elsevier, vol. 90(C), pages 1-22.
    17. Fernando A. López & Coro Chasco & Julie Le Gallo, 2015. "Exploring scan methods to test spatial structure with an application to housing prices in Madrid," Papers in Regional Science, Wiley Blackwell, vol. 94(2), pages 317-346, June.
    18. Brad R. Humphreys & Adam Nowak & Yang Zhou, 2019. "Superstition and real estate prices: transaction-level evidence from the US housing market," Applied Economics, Taylor & Francis Journals, vol. 51(26), pages 2818-2841, June.
    19. Chasco, Coro & Le Gallo, Julie & López, Fernando A., 2018. "A scan test for spatial groupwise heteroscedasticity in cross-sectional models with an application on houses prices in Madrid," Regional Science and Urban Economics, Elsevier, vol. 68(C), pages 226-238.
    20. Antonio Páez, 2021. "Open spatial sciences: an introduction," Journal of Geographical Systems, Springer, vol. 23(4), pages 467-476, October.
    21. Robert W. Paterson & Kevin J. Boyle, 2002. "Out of Sight, Out of Mind? Using GIS to Incorporate Visibility in Hedonic Property Value Models," Land Economics, University of Wisconsin Press, vol. 78(3), pages 417-425.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Juergen Deppner & Marcelo Cajias, 2024. "Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 68(2), pages 235-273, February.
    2. Kuethe, Todd H. & Foster, Kenneth A. & Florax, Raymond J.G.M., 2008. "A Spatial Hedonic Model with Time-Varying Parameters: A New Method Using Flexible Least Squares," 2008 Annual Meeting, July 27-29, 2008, Orlando, Florida 6306, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    3. Füss, Roland & Koller, Jan A., 2016. "The role of spatial and temporal structure for residential rent predictions," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1352-1368.
    4. Delores Conway & Christina Li & Jennifer Wolch & Christopher Kahle & Michael Jerrett, 2010. "A Spatial Autocorrelation Approach for Examining the Effects of Urban Greenspace on Residential Property Values," The Journal of Real Estate Finance and Economics, Springer, vol. 41(2), pages 150-169, August.
    5. S. Wong & C. Yiu & K. Chau, 2013. "Trading Volume-Induced Spatial Autocorrelation in Real Estate Prices," The Journal of Real Estate Finance and Economics, Springer, vol. 46(4), pages 596-608, May.
    6. Davide Viviano & Jelena Bradic, 2019. "Synthetic learner: model-free inference on treatments over time," Papers 1904.01490, arXiv.org, revised Aug 2022.
    7. Daniel Goller, 2023. "Analysing a built-in advantage in asymmetric darts contests using causal machine learning," Annals of Operations Research, Springer, vol. 325(1), pages 649-679, June.
    8. Michael C Knaus, 2022. "Double machine learning-based programme evaluation under unconfoundedness [Econometric methods for program evaluation]," The Econometrics Journal, Royal Economic Society, vol. 25(3), pages 602-627.
    9. Denis Fougère & Nicolas Jacquemet, 2020. "Policy Evaluation Using Causal Inference Methods," Working Papers hal-03455978, HAL.
    10. Cockx, Bart & Lechner, Michael & Bollens, Joost, 2023. "Priority to unemployed immigrants? A causal machine learning evaluation of training in Belgium," Labour Economics, Elsevier, vol. 80(C).
    11. Daniel Jacob, 2021. "CATE meets ML -- The Conditional Average Treatment Effect and Machine Learning," Papers 2104.09935, arXiv.org, revised Apr 2021.
    12. Daniel Boller & Michael Lechner & Gabriel Okasa, 2021. "The Effect of Sport in Online Dating: Evidence from Causal Machine Learning," Papers 2104.04601, arXiv.org.
    13. Phillip Heiler & Michael C. Knaus, 2021. "Effect or Treatment Heterogeneity? Policy Evaluation with Aggregated and Disaggregated Treatments," Papers 2110.01427, arXiv.org, revised Aug 2023.
    14. Goller, Daniel & Lechner, Michael & Moczall, Andreas & Wolff, Joachim, 2020. "Does the estimation of the propensity score by machine learning improve matching estimation? The case of Germany's programmes for long term unemployed," Labour Economics, Elsevier, vol. 65(C).
    15. Verbeke, Wouter & Olaya, Diego & Guerry, Marie-Anne & Van Belle, Jente, 2023. "To do or not to do? Cost-sensitive causal classification with individual treatment effect estimates," European Journal of Operational Research, Elsevier, vol. 305(2), pages 838-852.
    16. Xiaolong Liu, 2013. "Spatial and Temporal Dependence in House Price Prediction," The Journal of Real Estate Finance and Economics, Springer, vol. 47(2), pages 341-369, August.
    17. Daniel Goller & Tamara Harrer & Michael Lechner & Joachim Wolff, 2021. "Active labour market policies for the long-term unemployed: New evidence from causal machine learning," Papers 2106.10141, arXiv.org, revised May 2023.
    18. Wieser, Robert, 2009. "Parameterstabilität in hedonischen Bodenpreismodellen [Stability of Parameters in Hedonic Urban Land Price Models]," MPRA Paper 65859, University Library of Munich, Germany.
    19. Takafumi Kato, 2013. "Usefulness of the Information Contained in the Prediction Sample for the Spatial Error Model," The Journal of Real Estate Finance and Economics, Springer, vol. 47(1), pages 169-195, July.
    20. Rocco Curto & Elena Fregonara, 2019. "Monitoring and Analysis of the Real Estate Market in a Social Perspective: Results from the Turin’s (Italy) Experience," Sustainability, MDPI, vol. 11(11), pages 1-22, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:envirb:v:51:y:2024:i:1:p:89-108. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.