IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v12y2020i8p126-d390384.html
   My bibliography  Save this article

Data Lake Governance: Towards a Systemic and Natural Ecosystem Analogy

Author

Listed:
  • Marzieh Derakhshannia

    (LIRMM, Univ. Montpellier, CNRS, 34090 Montpellier, France)

  • Carmen Gervet

    (Espace Dev, Univ. Montpellier, IRD, Univ Guyane, Univ. Réunion, 34293 Montpellier, France)

  • Hicham Hajj-Hassan

    (CNRS-L, Beirut P.O. Box 11-8281, Lebanon)

  • Anne Laurent

    (LIRMM, Univ. Montpellier, CNRS, 34090 Montpellier, France)

  • Arnaud Martin

    (CEFE, Univ. Montpellier, CNRS, 34293 Montpellier, France)

Abstract

The realm of big data has brought new venues for knowledge acquisition, but also major challenges including data interoperability and effective management. The great volume of miscellaneous data renders the generation of new knowledge a complex data analysis process. Presently, big data technologies provide multiple solutions and tools towards the semantic analysis of heterogeneous data, including their accessibility and reusability. However, in addition to learning from data, we are faced with the issue of data storage and management in a cost-effective and reliable manner. This is the core topic of this paper. A data lake, inspired by the natural lake, is a centralized data repository that stores all kinds of data in any format and structure. This allows any type of data to be ingested into the data lake without any restriction or normalization. This could lead to a critical problem known as data swamp, which can contain invalid or incoherent data that adds no values for further knowledge acquisition. To deal with the potential avalanche of data, some legislation is required to turn such heterogeneous datasets into manageable data. In this article, we address this problem and propose some solutions concerning innovative methods, derived from a multidisciplinary science perspective to manage data lake. The proposed methods imitate the supply chain management and natural lake principles with an emphasis on the importance of the data life cycle, to implement responsible data governance for the data lake.

Suggested Citation

  • Marzieh Derakhshannia & Carmen Gervet & Hicham Hajj-Hassan & Anne Laurent & Arnaud Martin, 2020. "Data Lake Governance: Towards a Systemic and Natural Ecosystem Analogy," Future Internet, MDPI, vol. 12(8), pages 1-16, July.
  • Handle: RePEc:gam:jftint:v:12:y:2020:i:8:p:126-:d:390384
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/12/8/126/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/12/8/126/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sawik, Tadeusz, 2015. "On the fair optimization of cost and customer service level in a supply chain under disruption risks," Omega, Elsevier, vol. 53(C), pages 58-66.
    2. Beamon, Benita M., 1998. "Supply chain design and analysis:: Models and methods," International Journal of Production Economics, Elsevier, vol. 55(3), pages 281-294, August.
    3. Majid Al-Ruithe & Elhadj Benkhelifa & Khawar Hameed, 2018. "Data Governance Taxonomy: Cloud versus Non-Cloud," Sustainability, MDPI, vol. 10(1), pages 1-26, January.
    4. van den Broek, Tijs & van Veenstra, Anne Fleur, 2018. "Governance of big data collaborations: How to balance regulatory compliance and disruptive innovation," Technological Forecasting and Social Change, Elsevier, vol. 129(C), pages 330-338.
    5. Hagelaar, Geoffrey J.L.F. & van der Vorst, Jack G.A.J., 2001. "Environmental Supply Chain Management: Using Life Cycle Assessment To Structure Supply Chains," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 4(4), pages 1-14.
    6. Azaron, A. & Brown, K.N. & Tarim, S.A. & Modarres, M., 2008. "A multi-objective stochastic programming approach for supply chain design considering risk," International Journal of Production Economics, Elsevier, vol. 116(1), pages 129-138, November.
    7. Josip Mesaric & Dario Sebalj & Jelena Franjkovic, 2016. "Supply Chains In The Context Of Life Cycle Assessment And Sustainability," Business Logistics in Modern Management, Josip Juraj Strossmayer University of Osijek, Faculty of Economics, Croatia, vol. 16, pages 53-70.
    8. Riggins, Frederick J. & Klamm, Bonnie K., 2017. "Data governance case at KrauseMcMahon LLP in an era of self-service BI and Big Data," Journal of Accounting Education, Elsevier, vol. 38(C), pages 23-36.
    9. Delfmann, Werner & Albers, Sascha, 2000. "Supply chain management in the global context," Working Paper Series 102, University of Cologne, Department of Business Policy and Logistics.
    10. B Ritchie & C Brindley, 2007. "An emergent framework for supply chain risk management and performance measurement," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 58(11), pages 1398-1411, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seyyed Mohammad Seyyed Alizadeh Ganji & Mohammad Hayati, 2016. "Identifying and Assessing the Risks in the Supply Chain," Modern Applied Science, Canadian Center of Science and Education, vol. 10(6), pages 1-74, June.
    2. Heckmann, Iris & Comes, Tina & Nickel, Stefan, 2015. "A critical review on supply chain risk – Definition, measure and modeling," Omega, Elsevier, vol. 52(C), pages 119-132.
    3. Anassaya Chawviang & Supaporn Kiattisin, 2022. "Sustainable Development: Smart Co-Operative Management Framework," Sustainability, MDPI, vol. 14(6), pages 1-25, March.
    4. Ivanov, Dmitry & Pavlov, Alexander & Pavlov, Dmitry & Sokolov, Boris, 2017. "Minimization of disruption-related return flows in the supply chain," International Journal of Production Economics, Elsevier, vol. 183(PB), pages 503-513.
    5. Soysal, Mehmet & Bloemhof-Ruwaard, Jacqueline.M. & Meuwissen, Miranda P.M. & van der Vorst, Jack G.A.J., 2012. "A Review on Quantitative Models for Sustainable Food Logistics Management," International Journal on Food System Dynamics, International Center for Management, Communication, and Research, vol. 3(2), pages 1-20, December.
    6. Adam Firman Rizki & Yvonne Augustine, 2022. "Green supply chain management practices: direct effects sustainability performance," Technium Social Sciences Journal, Technium Science, vol. 28(1), pages 389-407, February.
    7. Umair Waqas & Azmawani Abd Rahman & Normaz Wana Ismail & Norazlyn Kamal Basha & Sonia Umair, 2023. "Influence of supply chain risk management and its mediating role on supply chain performance: perspectives from an agri-fresh produce," Annals of Operations Research, Springer, vol. 324(1), pages 1399-1427, May.
    8. Jahani, Hamed & Abbasi, Babak & Sheu, Jiuh-Biing & Klibi, Walid, 2024. "Supply chain network design with financial considerations: A comprehensive review," European Journal of Operational Research, Elsevier, vol. 312(3), pages 799-839.
    9. Tang, Christopher S. & Davarzani, Hoda & Sarkis, Joseph, 2015. "Quantitative models for managing supply chain risks: A reviewAuthor-Name: Fahimnia, Behnam," European Journal of Operational Research, Elsevier, vol. 247(1), pages 1-15.
    10. Morteza Lalmazloumian & Kuan Yew Wong & Kannan Govindan & Devika Kannan, 2016. "A robust optimization model for agile and build-to-order supply chain planning under uncertainties," Annals of Operations Research, Springer, vol. 240(2), pages 435-470, May.
    11. Dmitry Ivanov & Alexandre Dolgui & Boris Sokolov & Marina Ivanova, 2017. "Literature review on disruption recovery in the supply chain," International Journal of Production Research, Taylor & Francis Journals, vol. 55(20), pages 6158-6174, October.
    12. Reza Ramezanian & Sadjad Khalesi, 2021. "Integration of multi-product supply chain network design and assembly line balancing," Operational Research, Springer, vol. 21(1), pages 453-483, March.
    13. Ogulin, R. & Selen, W. & Ashayeri, J., 2010. "Determinants of Informal Coordination in Networked Supply Chains," Discussion Paper 2010-133, Tilburg University, Center for Economic Research.
    14. Rich, Karl M. & Ross, R. Brent & Baker, A. Derek & Negassa, Asfaw, 2011. "Quantifying value chain analysis in the context of livestock systems in developing countries," Food Policy, Elsevier, vol. 36(2), pages 214-222, April.
    15. Qazi, Abroon & Dickson, Alex & Quigley, John & Gaudenzi, Barbara, 2018. "Supply chain risk network management: A Bayesian belief network and expected utility based approach for managing supply chain risks," International Journal of Production Economics, Elsevier, vol. 196(C), pages 24-42.
    16. repec:mth:ijmis8:v:4:y:2019:i:1:p:1-19 is not listed on IDEAS
    17. Tang, Lianhua & Li, Yantong & Bai, Danyu & Liu, Tao & Coelho, Leandro C., 2022. "Bi-objective optimization for a multi-period COVID-19 vaccination planning problem," Omega, Elsevier, vol. 110(C).
    18. García Cáceres, Rafael Guillermo & Aráoz Durand, Julián Arturo & Gómez, Fernando Palacios, 2009. "Integral analysis method - IAM," European Journal of Operational Research, Elsevier, vol. 192(3), pages 891-903, February.
    19. repec:tkp:ijsrsy:v:2:y:2012:i:2:p:73-91 is not listed on IDEAS
    20. Monideepa Tarafdar & Sufian Qrunfleh, 2017. "Agile supply chain strategy and supply chain performance: complementary roles of supply chain practices and information systems capability for agility," International Journal of Production Research, Taylor & Francis Journals, vol. 55(4), pages 925-938, February.
    21. Lin Hu & Qinghai Chen & Tingting Yang & Chuanjian Yi & Jing Chen, 2024. "Visualization and Analysis of Hotspots and Trends in Seafood Cold Chain Logistics Based on CiteSpace, VOSviewer, and RStudio Bibliometrix," Sustainability, MDPI, vol. 16(15), pages 1-22, July.
    22. Hrabec, Dušan & Hvattum, Lars Magnus & Hoff, Arild, 2022. "The value of integrated planning for production, inventory, and routing decisions: A systematic review and meta-analysis," International Journal of Production Economics, Elsevier, vol. 248(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:12:y:2020:i:8:p:126-:d:390384. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.