IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v9y2024i12p151-d1546270.html
   My bibliography  Save this article

A Framework for Current and New Data Quality Dimensions: An Overview

Author

Listed:
  • Russell Miller

    (Informatics, Data Science Department, National Physical Laboratory, Glasgow G1 1RD, UK)

  • Harvey Whelan

    (Informatics, Data Science Department, National Physical Laboratory, Glasgow G1 1RD, UK
    Department of Natural Sciences, University of Bath, Bath BA2 7AX, UK)

  • Michael Chrubasik

    (Informatics, Data Science Department, National Physical Laboratory, Glasgow G1 1RD, UK)

  • David Whittaker

    (Informatics, Data Science Department, National Physical Laboratory, Glasgow G1 1RD, UK)

  • Paul Duncan

    (Informatics, Data Science Department, National Physical Laboratory, Glasgow G1 1RD, UK)

  • João Gregório

    (Informatics, Data Science Department, National Physical Laboratory, Glasgow G1 1RD, UK)

Abstract

This paper presents a comprehensive exploration of data quality terminology, revealing a significant lack of standardisation in the field. The goal of this work was to conduct a comparative analysis of data quality terminology across different domains and structure it into a hierarchical data model. We propose a novel approach for aggregating disparate data quality terms used to describe the multiple facets of data quality under common umbrella terms with a focus on the ISO 25012 standard. We introduce four additional data quality dimensions: governance, usefulness, quantity, and semantics. These dimensions enhance specificity, complementing the framework established by the ISO 25012 standard, as well as contribute to a broad understanding of data quality aspects. The ISO 25012 standard, a general standard for managing the data quality in information systems, offers a foundation for the development of our proposed Data Quality Data Model. This is due to the prevalent nature of digital systems across a multitude of domains. In contrast, frameworks such as ALCOA+, which were originally developed for specific regulated industries, can be applied more broadly but may not always be generalisable. Ultimately, the model we propose aggregates and classifies data quality terminology, facilitating seamless communication of the data quality between different domains when collaboration is required to tackle cross-domain projects or challenges. By establishing this hierarchical model, we aim to improve understanding and implementation of data quality practices, thereby addressing critical issues in various domains.

Suggested Citation

  • Russell Miller & Harvey Whelan & Michael Chrubasik & David Whittaker & Paul Duncan & João Gregório, 2024. "A Framework for Current and New Data Quality Dimensions: An Overview," Data, MDPI, vol. 9(12), pages 1-26, December.
  • Handle: RePEc:gam:jdataj:v:9:y:2024:i:12:p:151-:d:1546270
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/9/12/151/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/9/12/151/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hong Chen & David Hailey & Ning Wang & Ping Yu, 2014. "A Review of Data Quality Assessment Methods for Public Health Information Systems," IJERPH, MDPI, vol. 11(5), pages 1-38, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christina-Ioanna Papadopoulou & Efstratios Loizou & Fotios Chatzitheodoridis & Anastasios Michailidis & Christos Karelakis & Yannis Fallas & Aikaterini Paltaki, 2023. "What Makes Farmers Aware in Adopting Circular Bioeconomy Practices? Evidence from a Greek Rural Region," Land, MDPI, vol. 12(4), pages 1-17, April.
    2. David Naranjo-Gil & María Jesús Sánchez-Expósito & Laura Gómez-Ruiz, 2016. "Traditional vs. Contemporary Management Control Practices for Developing Public Health Policies," IJERPH, MDPI, vol. 13(7), pages 1-13, July.
    3. Syed Mustafa Ali & Farah Naureen & Arif Noor & Maged N. Kamel Boulos & Javariya Aamir & Muhammad Ishaq & Naveed Anjum & John Ainsworth & Aamna Rashid & Arman Majidulla & Irum Fatima, 2018. "Data Quality: A Negotiator between Paper-Based and Digital Records in Pakistan’s TB Control Program," Data, MDPI, vol. 3(3), pages 1-16, July.
    4. Agatha Ravi Vidiasratri & Lisdrianto Hanindriyo & Caroline Manuela Hartanto, 2024. "Charting the Future of Oral Health: A Bibliometric Exploration of Quality-of-Life Research in Dentistry," IJERPH, MDPI, vol. 21(3), pages 1-15, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:9:y:2024:i:12:p:151-:d:1546270. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.