IDEAS home Printed from https://ideas.repec.org/a/spr/ijsaem/v11y2020i2d10.1007_s13198-020-00946-3.html
   My bibliography  Save this article

Big data analytics predicting real estate prices

Author

Listed:
  • Archana Singh

    (Amity University)

  • Apoorva Sharma

    (Amity University)

  • Gaurav Dubey

    (ABES Engineering College)

Abstract

The enormous data generated on daily basis amounts to big data technologies. This large amounts of data have knowledge and hidden patterns. Real estate turning out to be another biggest application in big data. The emphasis of this paper is to map the process involved in taking large amounts of data to predict the price of a house in real estate. The real estate sounds to be a long-term investment. In this paper, the housing Sale Data from Ames, Iowa is considered for the timeframe 2006–2010 with a view to construct relevant models to estimate the final sale price of a house. Due to high number of explanatory variables several models such as linear regression, random forest and gradient boosting models have been used as tools for feature selection to determine the statistically significant characteristics that influence the final sale price of a house. It has been observed that out of all the models, the gradient boosting model returned the efficient results.

Suggested Citation

  • Archana Singh & Apoorva Sharma & Gaurav Dubey, 2020. "Big data analytics predicting real estate prices," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 11(2), pages 208-219, July.
  • Handle: RePEc:spr:ijsaem:v:11:y:2020:i:2:d:10.1007_s13198-020-00946-3
    DOI: 10.1007/s13198-020-00946-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13198-020-00946-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13198-020-00946-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sonka, Steve, 2014. "Big Data and the Ag Sector: More than Lots of Numbers," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 17(1), pages 1-20, February.
    2. Peng Wang & Myounggu Kang, 2014. "An empirical analysis on the housing prices in the Pearl River Delta Economic Region of China," International Journal of Urban Sciences, Taylor & Francis Journals, vol. 18(1), pages 103-114, March.
    3. David Wheeler & Michael Tiefelsdorf, 2005. "Multicollinearity and correlation among local regression coefficients in geographically weighted regression," Journal of Geographical Systems, Springer, vol. 7(2), pages 161-187, June.
    4. Archana Singh & Ajay Rana & Jayanthi Ranjan, 2015. "Proposed analytical customer centric model for an automobile industry," International Journal of Data Mining, Modelling and Management, Inderscience Enterprises Ltd, vol. 7(4), pages 314-330.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jungsun Kim & Jaewoong Won & Hyeongsoon Kim & Joonghyeok Heo, 2021. "Machine-Learning-Based Prediction of Land Prices in Seoul, South Korea," Sustainability, MDPI, vol. 13(23), pages 1-14, November.
    2. Maral Taşcılar & Kerem Yavuz Arslanlı, 2022. "Forecasting commercial real estate indicators under COVID-19 by adopting human activity using social big data," Asia-Pacific Journal of Regional Science, Springer, vol. 6(3), pages 1111-1132, October.
    3. Cankun Wei & Meichen Fu & Li Wang & Hanbing Yang & Feng Tang & Yuqing Xiong, 2022. "The Research Development of Hedonic Price Model-Based Real Estate Appraisal in the Era of Big Data," Land, MDPI, vol. 11(3), pages 1-30, February.
    4. Marco Locurcio & Pierluigi Morano & Francesco Tajani & Felicia Di Liddo, 2020. "An Innovative GIS-Based Territorial Information Tool for the Evaluation of Corporate Properties: An Application to the Italian Context," Sustainability, MDPI, vol. 12(14), pages 1-29, July.
    5. Silva, Diego S. & Yamashita, Gabrielli Harumi & Cortimiglia, Marcelo Nogueira & Brust-Renck, Priscila G. & ten Caten, Carla Schwengber, 2022. "Are we ready to assess digital readiness? Exploring digital implications for social progress from the Network Readiness Index," Technology in Society, Elsevier, vol. 68(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Diana Gutiérrez Posada & Fernando Rubiera Morollón & Ana Viñuela, 2018. "Ageing Places in an Ageing Country: The Local Dynamics of the Elderly Population in Spain," Tijdschrift voor Economische en Sociale Geografie, Royal Dutch Geographical Society KNAG, vol. 109(3), pages 332-349, July.
    2. Marco Helbich & Wolfgang Brunauer & Eric Vaz & Peter Nijkamp, 2014. "Spatial Heterogeneity in Hedonic House Price Models: The Case of Austria," Urban Studies, Urban Studies Journal Limited, vol. 51(2), pages 390-411, February.
    3. Yu, Haitao & Peng, Zhong-Ren, 2019. "Exploring the spatial variation of ridesourcing demand and its relationship to built environment and socioeconomic factors with the geographically weighted Poisson regression," Journal of Transport Geography, Elsevier, vol. 75(C), pages 147-163.
    4. Kristoffer B. Birkeland & Allan D. D'Silva & Roland Füss & Are Oust, 2021. "The Predictability of House Prices: "Human Against Machine"," International Real Estate Review, Global Social Science Institute, vol. 24(2), pages 139-183.
    5. Hoehun Ha & Wei Tu, 2018. "An Ecological Study on the Spatially Varying Relationship between County-Level Suicide Rates and Altitude in the United States," IJERPH, MDPI, vol. 15(4), pages 1-16, April.
    6. Alexis Comber & Paul Harris, 2018. "Geographically weighted elastic net logistic regression," Journal of Geographical Systems, Springer, vol. 20(4), pages 317-341, October.
    7. Oshan, Taylor M., 2022. "Navigating the methodological landscape in spatial analysis: a comment on ‘A Route Map for Successful Applications of Geographically-Weighted Regression’," OSF Preprints rckzj, Center for Open Science.
    8. Carla Shoff & Tse-Chuan Yang, 2012. "Spatially varying predictors of teenage birth rates among counties in the United States," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 27(14), pages 377-418.
    9. Stephen Matthews & Daniel M. Parker, 2013. "Progress in Spatial Demography," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 28(10), pages 271-312.
    10. Chuanhua Wei & Chao Liu & Fengyun Gui, 2017. "Geographically weight seemingly unrelated regression (GWSUR): a method for exploring spatio-temporal heterogeneity," Applied Economics, Taylor & Francis Journals, vol. 49(42), pages 4189-4195, September.
    11. A. Stewart Fotheringham & Taylor M. Oshan, 2016. "Geographically weighted regression and multicollinearity: dispelling the myth," Journal of Geographical Systems, Springer, vol. 18(4), pages 303-329, October.
    12. Li, Hengyun & Chen, Jason Li & Li, Gang & Goh, Carey, 2016. "Tourism and regional income inequality: Evidence from China," Annals of Tourism Research, Elsevier, vol. 58(C), pages 81-99.
    13. Olaru, Doina & Mulley, Corinne & Smith, Brett & Ma, Liang, 2017. "Policy-led selection of the most appropriate empirical model to estimate hedonic prices in the residential market," Journal of Transport Geography, Elsevier, vol. 62(C), pages 213-228.
    14. Löchl, Michael & Axhausen, Kay W., 2010. "Modelling hedonic residential rents for land use and transport simulation while considering spatial effects," The Journal of Transport and Land Use, Center for Transportation Studies, University of Minnesota, vol. 3(2), pages 39-63.
    15. Dean Hanink & Robert Cromley & Avraham Ebenstein, 2012. "Spatial Variation in the Determinants of House Prices and Apartment Rents in China," The Journal of Real Estate Finance and Economics, Springer, vol. 45(2), pages 347-363, August.
    16. Stamatis Kalogirou, 2012. "Testing local versions of correlation coefficients," Review of Regional Research: Jahrbuch für Regionalwissenschaft, Springer;Gesellschaft für Regionalforschung (GfR), vol. 32(1), pages 45-61, March.
    17. repec:rre:publsh:v:51:y:2021:i:2 is not listed on IDEAS
    18. Gollini, Isabella & Lu, Binbin & Charlton, Martin & Brunsdon, Christopher & Harris, Paul, 2015. "GWmodel: An R Package for Exploring Spatial Heterogeneity Using Geographically Weighted Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i17).
    19. Mendieta, Rodrigo & Ontaneda, Diego & Pontarollo, Nicola, 2019. "Canton growth in Ecuador and the role of spatial heterogeneity," Revista CEPAL, Naciones Unidas Comisión Económica para América Latina y el Caribe (CEPAL), December.
    20. Paul Harris & Bruno Lanfranco & Binbin Lu & Alexis Comber, 2020. "Influence of Geographical Effects in Hedonic Pricing Models for Grass-Fed Cattle in Uruguay," Agriculture, MDPI, vol. 10(7), pages 1-17, July.
    21. Feuillet, T. & Commenges, H. & Menai, M. & Salze, P. & Perchoux, C. & Reuillon, R. & Kesse-Guyot, E. & Enaux, C. & Nazare, J.-A. & Hercberg, S. & Simon, C. & Charreire, H. & Oppert, J.M., 2018. "A massive geographically weighted regression model of walking-environment relationships," Journal of Transport Geography, Elsevier, vol. 68(C), pages 118-129.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:ijsaem:v:11:y:2020:i:2:d:10.1007_s13198-020-00946-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.