IDEAS home Printed from https://ideas.repec.org/a/eee/ininma/v41y2018icp50-56.html
   My bibliography  Save this article

Analyzing data quality issues in research information systems via data profiling

Author

Listed:
  • Azeroual, Otmane
  • Saake, Gunter
  • Schallehn, Eike

Abstract

The success or failure of a RIS in a scientific institution is largely related to the quality of the data available as a basis for the RIS applications. The most beautiful Business Intelligence (BI) tools (reporting, etc.) are worthless when displaying incorrect, incomplete, or inconsistent data. An integral part of every RIS is thus the integration of data from the operative systems. Before starting the integration process (ETL) of a source system, a rich analysis of source data is required. With the support of a data quality check, causes of quality problems can usually be detected. Corresponding analyzes are performed with data profiling to provide a good picture of the state of the data. In this paper, methods of data profiling are presented in order to gain an overview of the quality of the data in the source systems before their integration into the RIS. With the help of data profiling, the scientific institutions can not only evaluate their research information and provide information about their quality, but also examine the dependencies and redundancies between data fields and better correct them within their RIS.

Suggested Citation

  • Azeroual, Otmane & Saake, Gunter & Schallehn, Eike, 2018. "Analyzing data quality issues in research information systems via data profiling," International Journal of Information Management, Elsevier, vol. 41(C), pages 50-56.
  • Handle: RePEc:eee:ininma:v:41:y:2018:i:c:p:50-56
    DOI: 10.1016/j.ijinfomgt.2018.02.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0268401218300975
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijinfomgt.2018.02.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Otmane Azeroual, 2020. "Data Wrangling in Database Systems: Purging of Dirty Data," Data, MDPI, vol. 5(2), pages 1-9, June.
    2. Otmane Azeroual & Joachim Schöpfel & Dragan Ivanovic, 2020. "Influence of Information Quality via Implemented German RCD Standard in Research Information Systems," Data, MDPI, vol. 5(2), pages 1-10, March.
    3. Han Meng & Xiaoyu Qi & Gang Mei, 2024. "A Deep Learning Approach for Stochastic Structural Plane Generation Based on Denoising Diffusion Probabilistic Models," Mathematics, MDPI, vol. 12(13), pages 1-22, June.
    4. Otmane Azeroual & Gunter Saake & Mohammad Abuosba & Joachim Schöpfel, 2020. "Data Quality as a Critical Success Factor for User Acceptance of Research Information Systems," Data, MDPI, vol. 5(2), pages 1-13, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ininma:v:41:y:2018:i:c:p:50-56. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/international-journal-of-information-management .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.