IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v15y2022i23p8894-d983389.html
   My bibliography  Save this article

Semantic-Similarity-Based Schema Matching for Management of Building Energy Data

Author

Listed:
  • Zhiyu Pan

    (Institute for Automation of Complex Power Systems, RWTH Aachen University, 52074 Aachen, Germany)

  • Guanchen Pan

    (Institute for Automation of Complex Power Systems, RWTH Aachen University, 52074 Aachen, Germany)

  • Antonello Monti

    (Institute for Automation of Complex Power Systems, RWTH Aachen University, 52074 Aachen, Germany
    Fraunhofer Institute for Applied Information Technology FIT, 53757 Sankt Augustin, Germany)

Abstract

The increase in heterogeneous data in the building energy domain creates a difficult challenge for data integration. Schema matching, which maps the raw data from the building energy domain to a generic data model, is the necessary step in data integration and provides a unique representation. Only a small amount of labeled data for schema matching exists and it is time-consuming and labor-intensive to manually label data. This paper applies semantic-similarity methods to the automatic schema-mapping process by combining knowledge from natural language processing, which reduces the manual effort in heterogeneous data integration. The active-learning method is applied to solve the lack-of-labeled-data problem in schema matching. The results of the schema matching with building-energy-domain data show the pre-trained language model provides a massive improvement in the accuracy of schema matching and the active-learning method greatly reduces the amount of labeled data required.

Suggested Citation

  • Zhiyu Pan & Guanchen Pan & Antonello Monti, 2022. "Semantic-Similarity-Based Schema Matching for Management of Building Energy Data," Energies, MDPI, vol. 15(23), pages 1-23, November.
  • Handle: RePEc:gam:jeners:v:15:y:2022:i:23:p:8894-:d:983389
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/15/23/8894/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/15/23/8894/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Marco Pau & Panagiotis Kapsalis & Zhiyu Pan & George Korbakis & Dario Pellegrino & Antonello Monti, 2022. "MATRYCS—A Big Data Architecture for Advanced Services in the Building Domain," Energies, MDPI, vol. 15(7), pages 1-22, April.
    2. Balaji, Bharathan & Bhattacharya, Arka & Fierro, Gabriel & Gao, Jingkun & Gluck, Joshua & Hong, Dezhi & Johansen, Aslak & Koh, Jason & Ploennigs, Joern & Agarwal, Yuvraj & Bergés, Mario & Culler, Davi, 2018. "Brick : Metadata schema for portable smart building applications," Applied Energy, Elsevier, vol. 226(C), pages 1273-1292.
    3. Marco Pritoni & Drew Paine & Gabriel Fierro & Cory Mosiman & Michael Poplawski & Avijit Saha & Joel Bender & Jessica Granderson, 2021. "Metadata Schemas and Ontologies for Building Energy Applications: A Critical Review and Use Case Analysis," Energies, MDPI, vol. 14(7), pages 1-37, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Zhelun & O’Neill, Zheng & Wen, Jin & Pradhan, Ojas & Yang, Tao & Lu, Xing & Lin, Guanjing & Miyata, Shohei & Lee, Seungjae & Shen, Chou & Chiosa, Roberto & Piscitelli, Marco Savino & Capozzoli, , 2023. "A review of data-driven fault detection and diagnostics for building HVAC systems," Applied Energy, Elsevier, vol. 339(C).
    2. Filippos Lygerakis & Nikos Kampelis & Dionysia Kolokotsa, 2022. "Knowledge Graphs’ Ontologies and Applications for Energy Efficiency in Buildings: A Review," Energies, MDPI, vol. 15(20), pages 1-32, October.
    3. Ru-Guan Wang & Wen-Jen Ho & Kuei-Chun Chiang & Yung-Chieh Hung & Jen-Kuo Tai & Jia-Cheng Tan & Mei-Ling Chuang & Chi-Yun Ke & Yi-Fan Chien & An-Ping Jeng & Chien-Cheng Chou, 2023. "Analyzing Long-Term and High Instantaneous Power Consumption of Buildings from Smart Meter Big Data with Deep Learning and Knowledge Graph Techniques," Energies, MDPI, vol. 16(19), pages 1-24, September.
    4. Yimin Chen & Guanjing Lin & Eliot Crowe & Jessica Granderson, 2021. "Development of a Unified Taxonomy for HVAC System Faults," Energies, MDPI, vol. 14(17), pages 1-25, September.
    5. Sulzer, Matthias & Wetter, Michael & Mutschler, Robin & Sangiovanni-Vincentelli, Alberto, 2023. "Platform-based design for energy systems," Applied Energy, Elsevier, vol. 352(C).
    6. Luo, Na & Pritoni, Marco & Hong, Tianzhen, 2021. "An overview of data tools for representing and managing building information and performance data," Renewable and Sustainable Energy Reviews, Elsevier, vol. 147(C).
    7. Antonio De Nicola & Maria Luisa Villani, 2021. "Smart City Ontologies and Their Applications: A Systematic Literature Review," Sustainability, MDPI, vol. 13(10), pages 1-40, May.
    8. Cezar-Petre Simion & Cătălin-Alexandru Verdeș & Alexandra-Andreea Mironescu & Florin-Gabriel Anghel, 2023. "Digitalization in Energy Production, Distribution, and Consumption: A Systematic Literature Review," Energies, MDPI, vol. 16(4), pages 1-30, February.
    9. Cory Mosiman & Gregor Henze & Herbert Els, 2021. "Development and Application of Schema Based Occupant-Centric Building Performance Metrics," Energies, MDPI, vol. 14(12), pages 1-16, June.
    10. Khan Rahmat Ullah & Marudhappan Thirugnanasambandam & Rahman Saidur & Kazi Akikur Rahman & Md. Riaz Kayser, 2021. "Analysis of Energy Use and Energy Savings: A Case Study of a Condiment Industry in India," Energies, MDPI, vol. 14(16), pages 1-25, August.
    11. Le, Duc Nha & Le Tuan, Loc & Dang Tuan, Minh Nguyen, 2019. "Smart-building management system: An Internet-of-Things (IoT) application business model in Vietnam," Technological Forecasting and Social Change, Elsevier, vol. 141(C), pages 22-35.
    12. Angelo Massafra & Carlo Costantino & Giorgia Predari & Riccardo Gulli, 2023. "Building Information Modeling and Building Performance Simulation-Based Decision Support Systems for Improved Built Heritage Operation," Sustainability, MDPI, vol. 15(14), pages 1-31, July.
    13. Wetter, Michael & Ehrlich, Paul & Gautier, Antoine & Grahovac, Milica & Haves, Philip & Hu, Jianjun & Prakash, Anand & Robin, Dave & Zhang, Kun, 2022. "OpenBuildingControl: Digitizing the control delivery from building energy modeling to specification, implementation and formal verification," Energy, Elsevier, vol. 238(PA).
    14. Gardian, H. & Beck, J.-P. & Koch, M. & Kunze, R. & Muschner, C. & Hülk, L. & Bucksteeg, M., 2022. "Data harmonisation for energy system analysis – Example of multi-model experiments," Renewable and Sustainable Energy Reviews, Elsevier, vol. 162(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:15:y:2022:i:23:p:8894-:d:983389. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.