IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v268y2020ics0306261920304773.html
   My bibliography  Save this article

Data-driven estimation of building energy consumption with multi-source heterogeneous data

Author

Listed:
  • Pan, Yue
  • Zhang, Limao

Abstract

For better energy evaluation and management, a categorical boosting (CatBoost)-based predictive method is presented to accurately estimate building energy consumption by learning large volumes of multi-source heterogeneous data collected from buildings. To be specific, the newly-developed CatBoost model belonging to the ensemble learning has superiority in handling categorical variables and producing reliable results. As a case study, our proposed method is validated in a multi-dimensional dataset about Seattle's building energy performance provided by the city’s government, aiming to estimate the weather normalized site energy use intensity of buildings and characterize its non-linear relationship with other 12 possible influential features. Results from the 5-fold cross-validation demonstrate that the model exhibits a strong ability in predicting the exact value of energy intensity precisely, which can even outperform popular machine learning algorithms including random forest and gradient boosting decision tree under R2 of 0.897. Based on a defined threshold, these predicted values can be classified as the normal or abnormal energy consumption reaching an accuracy of 99.32% for outlier detection, which is helpful in alarming potential risks at an early stage and developing strategies to enhance the energy efficiency. Moreover, results from the established model can be interpreted objectively, suggesting that features concerning the physical and energy characteristics contribute more to energy estimation than environmental features. Since such results understand the building energy consumption and efficiency in a data-driven manner, they can eventually serve as guidance for building owners and designers in designing and renovating buildings to achieve better energy-conserving performance.

Suggested Citation

  • Pan, Yue & Zhang, Limao, 2020. "Data-driven estimation of building energy consumption with multi-source heterogeneous data," Applied Energy, Elsevier, vol. 268(C).
  • Handle: RePEc:eee:appene:v:268:y:2020:i:c:s0306261920304773
    DOI: 10.1016/j.apenergy.2020.114965
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261920304773
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2020.114965?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jain, Rishee K. & Smith, Kevin M. & Culligan, Patricia J. & Taylor, John E., 2014. "Forecasting energy consumption of multi-family residential buildings using support vector regression: Investigating the impact of temporal and spatial monitoring granularity on performance accuracy," Applied Energy, Elsevier, vol. 123(C), pages 168-178.
    2. Guo, Yabin & Wang, Jiangyu & Chen, Huanxin & Li, Guannan & Liu, Jiangyan & Xu, Chengliang & Huang, Ronggeng & Huang, Yao, 2018. "Machine learning-based thermal response time ahead energy demand prediction for building heating systems," Applied Energy, Elsevier, vol. 221(C), pages 16-27.
    3. Nan Zhou & Nina Khanna & Wei Feng & Jing Ke & Mark Levine, 2018. "Scenarios of energy efficiency and CO2 emissions reduction potential in the buildings sector in China to year 2050," Nature Energy, Nature, vol. 3(11), pages 978-984, November.
    4. Ma, Jun & Cheng, Jack C.P., 2016. "Identifying the influential features on the regional energy use intensity of residential buildings based on Random Forests," Applied Energy, Elsevier, vol. 183(C), pages 193-201.
    5. Bartusch, Cajsa & Odlare, Monica & Wallin, Fredrik & Wester, Lars, 2012. "Exploring variance in residential electricity consumption: Household features and building properties," Applied Energy, Elsevier, vol. 92(C), pages 637-643.
    6. Biswas, M.A. Rafe & Robinson, Melvin D. & Fumo, Nelson, 2016. "Prediction of residential building energy consumption: A neural network approach," Energy, Elsevier, vol. 117(P1), pages 84-92.
    7. Li, Hong Xian & Li, Yan & Jiang, Boya & Zhang, Limao & Wu, Xianguo & Lin, Jingyi, 2020. "Energy performance optimisation of building envelope retrofit through integrated orthogonal arrays with data envelopment analysis," Renewable Energy, Elsevier, vol. 149(C), pages 1414-1423.
    8. Rahman, Aowabin & Srikumar, Vivek & Smith, Amanda D., 2018. "Predicting electricity consumption for commercial and residential buildings using deep recurrent neural networks," Applied Energy, Elsevier, vol. 212(C), pages 372-385.
    9. Abu Bakar, Nur Najihah & Hassan, Mohammad Yusri & Abdullah, Hayati & Rahman, Hasimah Abdul & Abdullah, Md Pauzi & Hussin, Faridah & Bandi, Masilah, 2015. "Energy efficiency index as an indicator for measuring building energy performance: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 44(C), pages 1-11.
    10. Zhao, Yang & Li, Tingting & Zhang, Xuejun & Zhang, Chaobo, 2019. "Artificial intelligence-based fault detection and diagnosis methods for building energy systems: Advantages, challenges and the future," Renewable and Sustainable Energy Reviews, Elsevier, vol. 109(C), pages 85-101.
    11. Omar Isaac Asensio & Magali A. Delmas, 2017. "The effectiveness of US energy efficiency building labels," Nature Energy, Nature, vol. 2(4), pages 1-9, April.
    12. Chengdong Li & Zixiang Ding & Dongbin Zhao & Jianqiang Yi & Guiqing Zhang, 2017. "Building Energy Consumption Prediction: An Extreme Deep Learning Approach," Energies, MDPI, vol. 10(10), pages 1-20, October.
    13. Hong, Jingke & Shen, Qiping & Xue, Fan, 2016. "A multi-regional structural path analysis of the energy supply chain in China's construction industry," Energy Policy, Elsevier, vol. 92(C), pages 56-68.
    14. Annunziata, Eleonora & Frey, Marco & Rizzi, Francesco, 2013. "Towards nearly zero-energy buildings: The state-of-art of national regulations in Europe," Energy, Elsevier, vol. 57(C), pages 125-133.
    15. Amasyali, Kadir & El-Gohary, Nora M., 2018. "A review of data-driven building energy consumption prediction studies," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1192-1205.
    16. Margaret Walls, 2017. "Energy efficiency: Building labels lead to savings," Nature Energy, Nature, vol. 2(4), pages 1-2, April.
    17. Wang, Zeyu & Srinivasan, Ravi S., 2017. "A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models," Renewable and Sustainable Energy Reviews, Elsevier, vol. 75(C), pages 796-808.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Venkatraj, V. & Dixit, M.K., 2022. "Challenges in implementing data-driven approaches for building life cycle energy assessment: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 160(C).
    2. Tran, Duc-Hoc & Luong, Duc-Long & Chou, Jui-Sheng, 2020. "Nature-inspired metaheuristic ensemble model for forecasting energy consumption in residential buildings," Energy, Elsevier, vol. 191(C).
    3. Zhong, Hai & Wang, Jiajun & Jia, Hongjie & Mu, Yunfei & Lv, Shilei, 2019. "Vector field-based support vector regression for building energy consumption prediction," Applied Energy, Elsevier, vol. 242(C), pages 403-414.
    4. Jason Runge & Radu Zmeureanu, 2021. "A Review of Deep Learning Techniques for Forecasting Energy Use in Buildings," Energies, MDPI, vol. 14(3), pages 1-26, January.
    5. Li, Xinyi & Yao, Runming, 2020. "A machine-learning-based approach to predict residential annual space heating and cooling loads considering occupant behaviour," Energy, Elsevier, vol. 212(C).
    6. Ding, Zhikun & Chen, Weilin & Hu, Ting & Xu, Xiaoxiao, 2021. "Evolutionary double attention-based long short-term memory model for building energy prediction: Case study of a green building," Applied Energy, Elsevier, vol. 288(C).
    7. Peplinski, McKenna & Dilkina, Bistra & Chen, Mo & Silva, Sam J. & Ban-Weiss, George A. & Sanders, Kelly T., 2024. "A machine learning framework to estimate residential electricity demand based on smart meter electricity, climate, building characteristics, and socioeconomic datasets," Applied Energy, Elsevier, vol. 357(C).
    8. Jason Runge & Radu Zmeureanu, 2019. "Forecasting Energy Use in Buildings Using Artificial Neural Networks: A Review," Energies, MDPI, vol. 12(17), pages 1-27, August.
    9. Ahmed Gassar, Abdo Abdullah & Yun, Geun Young & Kim, Sumin, 2019. "Data-driven approach to prediction of residential energy consumption at urban scales in London," Energy, Elsevier, vol. 187(C).
    10. Fan, Cheng & Xiao, Fu & Yan, Chengchu & Liu, Chengliang & Li, Zhengdao & Wang, Jiayuan, 2019. "A novel methodology to explain and evaluate data-driven building energy performance models based on interpretable machine learning," Applied Energy, Elsevier, vol. 235(C), pages 1551-1560.
    11. Fathi, Soheil & Srinivasan, Ravi & Fenner, Andriel & Fathi, Sahand, 2020. "Machine learning applications in urban building energy performance forecasting: A systematic review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 133(C).
    12. Wang, Zeyu & Liu, Jian & Zhang, Yuanxin & Yuan, Hongping & Zhang, Ruixue & Srinivasan, Ravi S., 2021. "Practical issues in implementing machine-learning models for building energy efficiency: Moving beyond obstacles," Renewable and Sustainable Energy Reviews, Elsevier, vol. 143(C).
    13. Fan, Cheng & Xiao, Fu & Song, Mengjie & Wang, Jiayuan, 2019. "A graph mining-based methodology for discovering and visualizing high-level knowledge for building energy management," Applied Energy, Elsevier, vol. 251(C), pages 1-1.
    14. Deb, Chirag & Dai, Zhonghao & Schlueter, Arno, 2021. "A machine learning-based framework for cost-optimal building retrofit," Applied Energy, Elsevier, vol. 294(C).
    15. Chou, Jui-Sheng & Tran, Duc-Son, 2018. "Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders," Energy, Elsevier, vol. 165(PB), pages 709-726.
    16. Gautham Krishnadas & Aristides Kiprakis, 2020. "A Machine Learning Pipeline for Demand Response Capacity Scheduling," Energies, MDPI, vol. 13(7), pages 1-25, April.
    17. Wang, Ran & Lu, Shilei & Feng, Wei, 2020. "A novel improved model for building energy consumption prediction based on model integration," Applied Energy, Elsevier, vol. 262(C).
    18. Thomas Wu & Bo Wang & Dongdong Zhang & Ziwei Zhao & Hongyu Zhu, 2023. "Benchmarking Evaluation of Building Energy Consumption Based on Data Mining," Sustainability, MDPI, vol. 15(6), pages 1-16, March.
    19. R. Rueda & M. P. Cuéllar & M. Molina-Solana & Y. Guo & M. C. Pegalajar, 2019. "Generalised Regression Hypothesis Induction for Energy Consumption Forecasting," Energies, MDPI, vol. 12(6), pages 1-22, March.
    20. Zhou, Xinlei & Lin, Wenye & Kumar, Ritunesh & Cui, Ping & Ma, Zhenjun, 2022. "A data-driven strategy using long short term memory models and reinforcement learning to predict building electricity consumption," Applied Energy, Elsevier, vol. 306(PB).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:268:y:2020:i:c:s0306261920304773. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.