IDEAS home Printed from https://ideas.repec.org/p/arz/wpaper/2022_234.html
   My bibliography  Save this paper

Machine Learning in Building Documentation (ML-BAU-DOK) - Foundations for Information Extraction for Energy Efficiency and Life Cycle Analysis

Author

Listed:
  • Jonathan Rothenbusch
  • Konstantin Schütz
  • Feibai Huang
  • Björn-Martin Kurzrock

Abstract

The construction and real estate industry features long product life cycles, a wide range of stakeholders and a high information density. Information is not only available in large quantities, but also in a very heterogeneous and user-specific manner. Digital building documentation is partially available in some companies and even non-existent in others. The result is often analogue, unstructured building documentation, which makes the processing of data and information considerably more difficult and, in the worst case, leads to media disruptions between those involved.However, the benefits of a lean, complete and targeted digital building documentation can be manifold. In particular, automated information extraction and further information retrieval are seen as having great potential. Information extraction as an ultimate aim, requires a defined handling of analogue documents, transparent criteria regarding data quality and machine readability as well as a clear classification system.The research project ML-BAU-DOK (funded by The Federal Office for Building and Regional Planning BBSR, SWD-10.08.18.7-20.26) presents the necessary preparatory processes for advanced digital use of building documentation. First, a set of rules is created to digitize paper-based documentation in a targeted manner. The automated separation of mass documentation into individual documents, as well as the classification of documents into selected document classes, is mapped using machine-learning. The document classes are consolidated from the current worldwide class standards and prioritized according to their information content. The project includes the evaluation of 600,000 document pages, which are analysed class-specifically with regard to two use cases, energy efficiency and life cycle analysis. The methodology ensures transferability of the results to other use cases.The key result of the ML-BAU-DOK is an algorithm that automatically separates individual documents from a mass scan, assigns the individual documents to defined document classes, and thus reduces the amount of scanning and filing required. This leads to a classification system that enables information extraction as a subsequent goal and brings the construction and real estate industry closer to a Common Data Environment.

Suggested Citation

  • Jonathan Rothenbusch & Konstantin Schütz & Feibai Huang & Björn-Martin Kurzrock, 2022. "Machine Learning in Building Documentation (ML-BAU-DOK) - Foundations for Information Extraction for Energy Efficiency and Life Cycle Analysis," ERES 2022_234, European Real Estate Society (ERES).
  • Handle: RePEc:arz:wpaper:2022_234
    as

    Download full text from publisher

    File URL: https://eres.architexturez.net/doc/eres-id-eres2022-234
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    Document Classification; Document Separation; Heterogeneous Building Documentation; Machine Learning;
    All these keywords.

    JEL classification:

    • R3 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - Real Estate Markets, Spatial Production Analysis, and Firm Location

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arz:wpaper:2022_234. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Architexturez Imprints (email available below). General contact details of provider: https://edirc.repec.org/data/eressea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.