IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v328y2022ics0306261922014672.html
   My bibliography  Save this article

Semi-supervised learning based framework for urban level building electricity consumption prediction

Author

Listed:
  • Jin, Xiaoyu
  • Xiao, Fu
  • Zhang, Chong
  • Chen, Zhijie

Abstract

The spatial feature of building energy consumption in a city is essential for urban level energy planning and policy making. With the increasing availability of urban level building energy benchmarking datasets, machine learning has shown a powerful capability of making data-driven predictions on urban level building energy consumption. However, the building energy benchmarking datasets usually only cover large buildings, which are not sufficient representations of all buildings in a city. Besides building energy benchmarking datasets, many other urban level open datasets are also valuable to building energy prediction, but they do not contain building energy use data, in other words, they are unlabeled. This study proposes a novel framework based on semi-supervised learning to make effective use of the unlabeled datasets to develop more generic urban level data-driven building energy prediction models, and energy mapping with higher space resolution. The framework consists of preliminary labeling, selection of pseudo labeled samples and predictive modelling. Several machine learning algorithms are proposed and compared for generating pseudo labels of building electricity consumption for unlabeled datasets of small and medium-sized buildings. A selection process consisting of convergence testing and screening is designed to select pseudo labeled samples with high credibility to enlarge the labeled dataset. A novel two-level performance evaluation method is proposed to evaluate the performance of the framework at both urban level and district level to enhance the spatial resolution of the predictions. The framework is implemented to model and map the individual electricity consumptions of all buildings in two years in the districts of New York City using multiple open datasets. The results show significant improvement in terms of prediction accuracy at both levels. In addition, the applicability of the model to various buildings in the city is remarkably enhanced.

Suggested Citation

  • Jin, Xiaoyu & Xiao, Fu & Zhang, Chong & Chen, Zhijie, 2022. "Semi-supervised learning based framework for urban level building electricity consumption prediction," Applied Energy, Elsevier, vol. 328(C).
  • Handle: RePEc:eee:appene:v:328:y:2022:i:c:s0306261922014672
    DOI: 10.1016/j.apenergy.2022.120210
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261922014672
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2022.120210?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hong, Tianzhen & Piette, Mary Ann & Chen, Yixing & Lee, Sang Hoon & Taylor-Lange, Sarah C. & Zhang, Rongpeng & Sun, Kaiyu & Price, Phillip, 2015. "Commercial Building Energy Saver: An energy retrofit analysis toolkit," Applied Energy, Elsevier, vol. 159(C), pages 298-309.
    2. Arjunan, Pandarasamy & Poolla, Kameshwar & Miller, Clayton, 2020. "EnergyStar++: Towards more accurate and explanatory building energy benchmarking," Applied Energy, Elsevier, vol. 276(C).
    3. Tor Iversen & Ching‐to Albert Ma, 2022. "Technology adoption by primary care physicians," Health Economics, John Wiley & Sons, Ltd., vol. 31(3), pages 443-465, March.
    4. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    5. Abbasabadi, Narjes & Ashayeri, Mehdi & Azari, Rahman & Stephens, Brent & Heidarinejad, Mohammad, 2019. "An integrated data-driven framework for urban energy use modeling (UEUM)," Applied Energy, Elsevier, vol. 253(C), pages 1-1.
    6. Zhang, Wenwen & Robinson, Caleb & Guhathakurta, Subhrajit & Garikapati, Venu M. & Dilkina, Bistra & Brown, Marilyn A. & Pendyala, Ram M., 2018. "Estimating residential energy consumption in metropolitan areas: A microsimulation approach," Energy, Elsevier, vol. 155(C), pages 162-173.
    7. Ali, Usman & Shamsi, Mohammad Haris & Bohacek, Mark & Purcell, Karl & Hoare, Cathal & Mangina, Eleni & O’Donnell, James, 2020. "A data-driven approach for multi-scale GIS-based building energy modeling for analysis, planning and support decision making," Applied Energy, Elsevier, vol. 279(C).
    8. Roth, Jonathan & Martin, Amory & Miller, Clayton & Jain, Rishee K., 2020. "SynCity: Using open data to create a synthetic city of hourly building energy estimates by integrating data-driven and physics-based methods," Applied Energy, Elsevier, vol. 280(C).
    9. Lee, Sang Hoon & Hong, Tianzhen & Piette, Mary Ann & Taylor-Lange, Sarah C., 2015. "Energy retrofit analysis toolkits for commercial buildings: A review," Energy, Elsevier, vol. 89(C), pages 1087-1100.
    10. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    11. Johari, F. & Peronato, G. & Sadeghian, P. & Zhao, X. & Widén, J., 2020. "Urban building energy modeling: State of the art and future prospects," Renewable and Sustainable Energy Reviews, Elsevier, vol. 128(C).
    12. Jiang, Feifeng & Ma, Jun & Li, Zheng & Ding, Yuexiong, 2022. "Prediction of energy use intensity of urban buildings using the semi-supervised deep learning model," Energy, Elsevier, vol. 249(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gong, Zhipeng & Wan, Anping & Ji, Yunsong & AL-Bukhaiti, Khalil & Yao, Zhehe, 2024. "Improving short-term offshore wind speed forecast accuracy using a VMD-PE-FCGRU hybrid model," Energy, Elsevier, vol. 295(C).
    2. Wenya Xu & Yanxue Li & Guanjie He & Yang Xu & Weijun Gao, 2023. "Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control," Energies, MDPI, vol. 16(13), pages 1-19, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Perwez, Usama & Yamaguchi, Yohei & Ma, Tao & Dai, Yanjun & Shimoda, Yoshiyuki, 2022. "Multi-scale GIS-synthetic hybrid approach for the development of commercial building stock energy model," Applied Energy, Elsevier, vol. 323(C).
    2. Kotarela, Faidra & Kyritsis, Anastasios & Agathokleous, Rafaela & Papanikolaou, Nick, 2023. "On the exploitation of dynamic simulations for the design of buildings energy systems," Energy, Elsevier, vol. 271(C).
    3. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    4. Carstensen, Kai & Heinrich, Markus & Reif, Magnus & Wolters, Maik H., 2020. "Predicting ordinary and severe recessions with a three-state Markov-switching dynamic factor model," International Journal of Forecasting, Elsevier, vol. 36(3), pages 829-850.
    5. Hou-Tai Chang & Ping-Huai Wang & Wei-Fang Chen & Chen-Ju Lin, 2022. "Risk Assessment of Early Lung Cancer with LDCT and Health Examinations," IJERPH, MDPI, vol. 19(8), pages 1-12, April.
    6. Wang, Qiao & Zhou, Wei & Cheng, Yonggang & Ma, Gang & Chang, Xiaolin & Miao, Yu & Chen, E, 2018. "Regularized moving least-square method and regularized improved interpolating moving least-square method with nonsingular moment matrices," Applied Mathematics and Computation, Elsevier, vol. 325(C), pages 120-145.
    7. Mkhadri, Abdallah & Ouhourane, Mohamed, 2013. "An extended variable inclusion and shrinkage algorithm for correlated variables," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 631-644.
    8. Lucian Belascu & Alexandra Horobet & Georgiana Vrinceanu & Consuela Popescu, 2021. "Performance Dissimilarities in European Union Manufacturing: The Effect of Ownership and Technological Intensity," Sustainability, MDPI, vol. 13(18), pages 1-19, September.
    9. Candelon, B. & Hurlin, C. & Tokpavi, S., 2012. "Sampling error and double shrinkage estimation of minimum variance portfolios," Journal of Empirical Finance, Elsevier, vol. 19(4), pages 511-527.
    10. Andrea Carriero & Todd E. Clark & Massimiliano Marcellino, 2022. "Specification Choices in Quantile Regression for Empirical Macroeconomics," Working Papers 22-25, Federal Reserve Bank of Cleveland.
    11. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    12. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    13. Chuliá, Helena & Garrón, Ignacio & Uribe, Jorge M., 2024. "Daily growth at risk: Financial or real drivers? The answer is not always the same," International Journal of Forecasting, Elsevier, vol. 40(2), pages 762-776.
    14. Enrico Bergamini & Georg Zachmann, 2020. "Exploring EU’s Regional Potential in Low-Carbon Technologies," Sustainability, MDPI, vol. 13(1), pages 1-28, December.
    15. Qianyun Li & Runmin Shi & Faming Liang, 2019. "Drug sensitivity prediction with high-dimensional mixture regression," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-18, February.
    16. Jung, Yoon Mo & Whang, Joyce Jiyoung & Yun, Sangwoon, 2020. "Sparse probabilistic K-means," Applied Mathematics and Computation, Elsevier, vol. 382(C).
    17. Christopher J Greenwood & George J Youssef & Primrose Letcher & Jacqui A Macdonald & Lauryn J Hagg & Ann Sanson & Jenn Mcintosh & Delyse M Hutchinson & John W Toumbourou & Matthew Fuller-Tyszkiewicz &, 2020. "A comparison of penalised regression methods for informing the selection of predictive markers," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-14, November.
    18. Norman R. Swanson & Weiqi Xiong, 2018. "Big data analytics in economics: What have we learned so far, and where should we go from here?," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 51(3), pages 695-746, August.
    19. Soave, David & Lawless, Jerald F., 2023. "Regularized regression for two phase failure time studies," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    20. Moharil Janhavi & May Paul & Gaile Daniel P. & Blair Rachael Hageman, 2016. "Belief propagation in genotype-phenotype networks," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(1), pages 39-53, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:328:y:2022:i:c:s0306261922014672. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.