IDEAS home Printed from https://ideas.repec.org/a/eee/energy/v239y2022ipds0360544221025214.html
   My bibliography  Save this article

Trade-off between accuracy and fairness of data-driven building and indoor environment models: A comparative study of pre-processing methods

Author

Listed:
  • Sun, Ying
  • Haghighat, Fariborz
  • Fung, Benjamin C.M.

Abstract

Data-driven models have drawn extensive attention in the building domain in recent years, and their predictive accuracy depends on features or data distribution. Accuracy variation among users or periods creates a certain unfairness to some users. This paper addresses a new research problem called fairness-aware prediction of data-driven building and indoor environment models. First, three types of fairness definitions are introduced in building engineering. Next, Type I and Type II fairness are investigated. To achieve fairness Type I, we study the effect of suppressing the protected attribute (i.e., attribute whose value cannot be disclosed or be discriminated against) from inputs. To improve fairness Type II while preserving the predictive accuracy of data-driven building and indoor environment models, we propose three pre-processing methods for training dataset—sequential sampling, reversed preferential sampling, and sequential preferential sampling. The proposed methods are compared to two existing pre-processing methods in a case study for lighting status prediction in an apartment building. Overall, 576 study cases were used to study the effect of these pre-processing methods on the accuracy and fairness of 12 series of lighting status prediction based on 2 types of feature combinations and 4 types of classifiers. Predictive results show that suppressing the protected attribute slightly influences overall predictive accuracy, while all pre-processing methods decrease it. However, in general, sequential sampling would be a good option for improving fairness Type II with an acceptable accuracy decrease. Fairness improvement performance of other pre-processing methods varies depending on applied features and classifiers.

Suggested Citation

  • Sun, Ying & Haghighat, Fariborz & Fung, Benjamin C.M., 2022. "Trade-off between accuracy and fairness of data-driven building and indoor environment models: A comparative study of pre-processing methods," Energy, Elsevier, vol. 239(PD).
  • Handle: RePEc:eee:energy:v:239:y:2022:i:pd:s0360544221025214
    DOI: 10.1016/j.energy.2021.122273
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0360544221025214
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.energy.2021.122273?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Chaobo & Li, Junyang & Zhao, Yang & Li, Tingting & Chen, Qi & Zhang, Xuejun & Qiu, Weikang, 2021. "Problem of data imbalance in building energy load prediction: Concept, influence, and solution," Applied Energy, Elsevier, vol. 297(C).
    2. Reynolds, Jonathan & Rezgui, Yacine & Kwan, Alan & Piriou, Solène, 2018. "A zone-level, building energy optimisation combining an artificial neural network, a genetic algorithm, and model predictive control," Energy, Elsevier, vol. 151(C), pages 729-739.
    3. Ascione, Fabrizio & Bianco, Nicola & De Stasio, Claudio & Mauro, Gerardo Maria & Vanoli, Giuseppe Peter, 2017. "Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: A novel approach," Energy, Elsevier, vol. 118(C), pages 999-1017.
    4. Peng, Yuzhen & Rysanek, Adam & Nagy, Zoltán & Schlüter, Arno, 2018. "Using machine learning techniques for occupancy-prediction-based cooling control in office buildings," Applied Energy, Elsevier, vol. 211(C), pages 1343-1358.
    5. Amasyali, Kadir & El-Gohary, Nora M., 2018. "A review of data-driven building energy consumption prediction studies," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1192-1205.
    6. Tabares-Velasco, Paulo Cesar & Speake, Andrew & Harris, Maxwell & Newman, Alexandra & Vincent, Tyrone & Lanahan, Michael, 2019. "A modeling framework for optimization-based control of a residential building thermostat for time-of-use pricing," Applied Energy, Elsevier, vol. 242(C), pages 1346-1357.
    7. Naji, Sareh & Keivani, Afram & Shamshirband, Shahaboddin & Alengaram, U. Johnson & Jumaat, Mohd Zamin & Mansor, Zulkefli & Lee, Malrey, 2016. "Estimating building energy consumption using extreme learning machine method," Energy, Elsevier, vol. 97(C), pages 506-516.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Manfren, Massimiliano & Nastasi, Benedetto, 2023. "Interpretable data-driven building load profiles modelling for Measurement and Verification 2.0," Energy, Elsevier, vol. 283(C).
    2. Liu, Zhengguang & Guo, Zhiling & Chen, Qi & Song, Chenchen & Shang, Wenlong & Yuan, Meng & Zhang, Haoran, 2023. "A review of data-driven smart building-integrated photovoltaic systems: Challenges and objectives," Energy, Elsevier, vol. 263(PE).
    3. Li, Guannan & Li, Fan & Ahmad, Tanveer & Liu, Jiangyan & Li, Tao & Fang, Xi & Wu, Yubei, 2022. "Performance evaluation of sequence-to-sequence-Attention model for short-term multi-step ahead building energy predictions," Energy, Elsevier, vol. 259(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Zeyu & Liu, Jian & Zhang, Yuanxin & Yuan, Hongping & Zhang, Ruixue & Srinivasan, Ravi S., 2021. "Practical issues in implementing machine-learning models for building energy efficiency: Moving beyond obstacles," Renewable and Sustainable Energy Reviews, Elsevier, vol. 143(C).
    2. Zhang, Liang & Wen, Jin & Li, Yanfei & Chen, Jianli & Ye, Yunyang & Fu, Yangyang & Livingood, William, 2021. "A review of machine learning in building load prediction," Applied Energy, Elsevier, vol. 285(C).
    3. Kathirgamanathan, Anjukan & De Rosa, Mattia & Mangina, Eleni & Finn, Donal P., 2021. "Data-driven predictive control for unlocking building energy flexibility: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 135(C).
    4. Abhinandana Boodi & Karim Beddiar & Malek Benamour & Yassine Amirat & Mohamed Benbouzid, 2018. "Intelligent Systems for Building Energy and Occupant Comfort Optimization: A State of the Art Review and Recommendations," Energies, MDPI, vol. 11(10), pages 1-26, September.
    5. Gautham Krishnadas & Aristides Kiprakis, 2020. "A Machine Learning Pipeline for Demand Response Capacity Scheduling," Energies, MDPI, vol. 13(7), pages 1-25, April.
    6. James Ogundiran & Ehsan Asadi & Manuel Gameiro da Silva, 2024. "A Systematic Review on the Use of AI for Energy Efficiency and Indoor Environmental Quality in Buildings," Sustainability, MDPI, vol. 16(9), pages 1-30, April.
    7. Amasyali, Kadir & El-Gohary, Nora M., 2021. "Real data-driven occupant-behavior optimization for reduced energy consumption and improved comfort," Applied Energy, Elsevier, vol. 302(C).
    8. Thomas Wu & Bo Wang & Dongdong Zhang & Ziwei Zhao & Hongyu Zhu, 2023. "Benchmarking Evaluation of Building Energy Consumption Based on Data Mining," Sustainability, MDPI, vol. 15(6), pages 1-16, March.
    9. Fateme Dinmohammadi & Yuxuan Han & Mahmood Shafiee, 2023. "Predicting Energy Consumption in Residential Buildings Using Advanced Machine Learning Algorithms," Energies, MDPI, vol. 16(9), pages 1-23, April.
    10. Zou, Rongwei & Yang, Qiliang & Xing, Jianchun & Zhou, Qizhen & Xie, Liqiang & Chen, Wenjie, 2024. "Predicting the electric power consumption of office buildings based on dynamic and static hybrid data analysis," Energy, Elsevier, vol. 290(C).
    11. Zhang, Yan & Teoh, Bak Koon & Wu, Maozhi & Chen, Jiayu & Zhang, Limao, 2023. "Data-driven estimation of building energy consumption and GHG emissions using explainable artificial intelligence," Energy, Elsevier, vol. 262(PA).
    12. Kapp, Sean & Choi, Jun-Ki & Hong, Taehoon, 2023. "Predicting industrial building energy consumption with statistical and machine-learning models informed by physical system parameters," Renewable and Sustainable Energy Reviews, Elsevier, vol. 172(C).
    13. Venkatraj, V. & Dixit, M.K., 2022. "Challenges in implementing data-driven approaches for building life cycle energy assessment: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 160(C).
    14. Niemierko, Rochus & Töppel, Jannick & Tränkler, Timm, 2019. "A D-vine copula quantile regression approach for the prediction of residential heating energy consumption based on historical data," Applied Energy, Elsevier, vol. 233, pages 691-708.
    15. Di Natale, L. & Svetozarevic, B. & Heer, P. & Jones, C.N., 2022. "Physically Consistent Neural Networks for building thermal modeling: Theory and analysis," Applied Energy, Elsevier, vol. 325(C).
    16. Yan, Biao & Yang, Wansheng & He, Fuquan & Zeng, Wenhao, 2023. "Occupant behavior impact in buildings and the artificial intelligence-based techniques and data-driven approach solutions," Renewable and Sustainable Energy Reviews, Elsevier, vol. 184(C).
    17. Jahangir Hossain & Aida. F. A. Kadir & Ainain. N. Hanafi & Hussain Shareef & Tamer Khatib & Kyairul. A. Baharin & Mohamad. F. Sulaima, 2023. "A Review on Optimal Energy Management in Commercial Buildings," Energies, MDPI, vol. 16(4), pages 1-40, February.
    18. Grillone, Benedetto & Danov, Stoyan & Sumper, Andreas & Cipriano, Jordi & Mor, Gerard, 2020. "A review of deterministic and data-driven methods to quantify energy efficiency savings and to predict retrofitting scenarios in buildings," Renewable and Sustainable Energy Reviews, Elsevier, vol. 131(C).
    19. Li, Xinyi & Yao, Runming, 2020. "A machine-learning-based approach to predict residential annual space heating and cooling loads considering occupant behaviour," Energy, Elsevier, vol. 212(C).
    20. Ramya Kuppusamy & Srete Nikolovski & Yuvaraja Teekaraman, 2023. "Review of Machine Learning Techniques for Power Quality Performance Evaluation in Grid-Connected Systems," Sustainability, MDPI, vol. 15(20), pages 1-29, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:239:y:2022:i:pd:s0360544221025214. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.