IDEAS home Printed from https://ideas.repec.org/a/igg/jdsst0/v8y2016i4p34-49.html
   My bibliography  Save this article

De-Identification of Unstructured Textual Data using Artificial Immune System for Privacy Preserving

Author

Listed:
  • Amine Rahmani

    (Department of Computer Science, Dr. Tahar Moulay University of Saida, Saida, Algeria)

  • Abdelmalek Amine

    (Department of Computer Science, Dr. Tahar Moulay University of Saida, Saida, Algeria)

  • Reda Mohamed Hamou

    (Department of Computer Science, Dr. Tahar Moulay University of Saida, Saida, Algeria)

  • Mohamed Amine Boudia

    (Department of Computer Science, Dr. Tahar Moulay University of Saida, Saida, Algeria)

  • Hadj Ahmed Bouarara

    (GeCoDe laboratory, Department of Computer Sciences, Dr. Tahar Moulay University of Saida, Algeria)

Abstract

The development of new technologies has led the world into a tipping point. One of these technologies is the big data which made the revolution of computer sciences. Big data has come with new challenges. These challenges can be resumed in the aim of creating scalable and efficient services that can treat huge amounts of heterogeneous data in small scale of time while preserving users' privacy. Textual data occupy a wide space in internet. These data could contain information that can lead to identify users. For that, the development of such approaches that can detect and remove any identifiable information has become a critical research area known as de-identification. This paper tackle the problem of privacy in textual data. The authors' proposed approach consists of using artificial immune systems and MapReduce to detect and hide identifiable words with no matter on their variants using the personnel information of the user from his profile. After many experiments, the system shows a high efficiency in term of number of detected words, the way they are hided with, and time of execution.

Suggested Citation

  • Amine Rahmani & Abdelmalek Amine & Reda Mohamed Hamou & Mohamed Amine Boudia & Hadj Ahmed Bouarara, 2016. "De-Identification of Unstructured Textual Data using Artificial Immune System for Privacy Preserving," International Journal of Decision Support System Technology (IJDSST), IGI Global, vol. 8(4), pages 34-49, October.
  • Handle: RePEc:igg:jdsst0:v:8:y:2016:i:4:p:34-49
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJDSST.2016100103
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Henriques de Gusmão, Ana Paula & Mendonça Silva, Maisa & Poleto, Thiago & Camara e Silva, Lúcio & Cabral Seixas Costa, Ana Paula, 2018. "Cybersecurity risk analysis model using fault tree analysis and fuzzy decision theory," International Journal of Information Management, Elsevier, vol. 43(C), pages 248-260.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jdsst0:v:8:y:2016:i:4:p:34-49. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.