IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v7y2022i8p113-d887021.html
   My bibliography  Save this article

Data Warehousing Process Modeling from Classical Approaches to New Trends: Main Features and Comparisons

Author

Listed:
  • Asma Dhaouadi

    (RIADI Laboratory, University of Manouba, Mannouba 2010, Tunisia
    LISTIC Laboratory, University of Savoie Mont Blanc, France Annecy-Chambéry, 74940 Chambéry, France
    Faculty of Sciences of Tunis, University of Tunis El Manar, Tunis 1068, Tunisia)

  • Khadija Bousselmi

    (LISTIC Laboratory, University of Savoie Mont Blanc, France Annecy-Chambéry, 74940 Chambéry, France)

  • Mohamed Mohsen Gammoudi

    (RIADI Laboratory, University of Manouba, Mannouba 2010, Tunisia
    Higher Institute of Arts and Multimedia Manouba, University of Manouba, Manouba 2010, Tunisia)

  • Sébastien Monnet

    (LISTIC Laboratory, University of Savoie Mont Blanc, France Annecy-Chambéry, 74940 Chambéry, France)

  • Slimane Hammoudi

    (ERIS, ESEO-Grande Ecole d’Ingénieurs Généralistes, 49100 Angers, France)

Abstract

The extract, transform, and load (ETL) process is at the core of data warehousing architectures. As such, the success of data warehouse (DW) projects is essentially based on the proper modeling of the ETL process. As there is no standard model for the representation and design of this process, several researchers have made efforts to propose modeling methods based on different formalisms, such as unified modeling language (UML), ontology, model-driven architecture (MDA), model-driven development (MDD), and graphical flow, which includes business process model notation (BPMN), colored Petri nets (CPN), Yet Another Workflow Language (YAWL), CommonCube, entity modeling diagram (EMD), and so on. With the emergence of Big Data, despite the multitude of relevant approaches proposed for modeling the ETL process in classical environments, part of the community has been motivated to provide new data warehousing methods that support Big Data specifications. In this paper, we present a summary of relevant works related to the modeling of data warehousing approaches, from classical ETL processes to ELT design approaches. A systematic literature review is conducted and a detailed set of comparison criteria are defined in order to allow the reader to better understand the evolution of these processes. Our study paints a complete picture of ETL modeling approaches, from their advent to the era of Big Data, while comparing their main characteristics. This study allows for the identification of the main challenges and issues related to the design of Big Data warehousing systems, mainly involving the lack of a generic design model for data collection, storage, processing, querying, and analysis.

Suggested Citation

  • Asma Dhaouadi & Khadija Bousselmi & Mohamed Mohsen Gammoudi & Sébastien Monnet & Slimane Hammoudi, 2022. "Data Warehousing Process Modeling from Classical Approaches to New Trends: Main Features and Comparisons," Data, MDPI, vol. 7(8), pages 1-38, August.
  • Handle: RePEc:gam:jdataj:v:7:y:2022:i:8:p:113-:d:887021
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/7/8/113/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/7/8/113/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dimitrios Skoutas & Alkis Simitsis, 2007. "Ontology-Based Conceptual Design of ETL Processes for Both Structured and Semi-Structured Data," International Journal on Semantic Web and Information Systems (IJSWIS), IGI Global, vol. 3(4), pages 1-24, October.
    2. Mahfoud Bala & Omar Boussaid & Zaia Alimazighi, 2016. "Extracting-Transforming-Loading Modeling Approach for Big Data Analytics," International Journal of Decision Support System Technology (IJDSST), IGI Global, vol. 8(4), pages 50-69, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jianhui Chen & Ning Zhong & Jianhua Feng, 2017. "Developing a Provenance Warehouse for the Systematic Brain Informatics Study," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(06), pages 1581-1609, November.
    2. Petar Jovanovic & Sergi Nadal & Oscar Romero & Alberto Abelló & Besim Bilalli, 0. "Quarry: A User-centered Big Data Integration Platform," Information Systems Frontiers, Springer, vol. 0, pages 1-25.
    3. Petar Jovanovic & Sergi Nadal & Oscar Romero & Alberto Abelló & Besim Bilalli, 2021. "Quarry: A User-centered Big Data Integration Platform," Information Systems Frontiers, Springer, vol. 23(1), pages 9-33, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:7:y:2022:i:8:p:113-:d:887021. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.