IDEAS home Printed from https://ideas.repec.org/a/igg/jirr00/v6y2016i4p51-73.html
   My bibliography  Save this article

A New Algorithm of Grouping Cockroaches Classifier (GCC) for Textual Plagiarism Detection

Author

Listed:
  • Hadj Ahmed Bouarara

    (Hadj Ahmed Bouarara, GeCode Laboratory, Department of Computer Science, Tahar Moulay University of Saida, Saida, Algeria)

  • Reda Mohamed Hamou

    (GeCode Laboratory, Department of Computer Science, Tahar Moulay University of Saida, Saida, Algeria)

Abstract

In the last decade with the new technology, it is important to allow users to access information freely, while at the same time, restrict them from illegal copying and distribution of information. In the age of information technologies plagiarism has become a topical subject in the digital world and turned into a serious problem. The author's work deals with the development of a new system for combating this phenomenon using a new insect behaviour algorithm called Groping cockroaches classifier GCC. Each suspicious text (cockroach) will be classified (hidden) in a class (shelter) that can be plagiarism or no-plagiarism, using a security function that is based on the attractiveness of each class (calculated using the aggregation operators (shelter darkness, congeners attraction and security quality)) and the displacement probability (calculated using the naive Bayes algorithm). The experimental results performed on the Pan 09 dataset and using the validation measures (recall, precision, f-measure, and entropy), have demonstrated that GCC has clear advantages over others plagiarism detection techniques existed in literature. Finally, a set of service was added in order to detect the different cases of plagiarism such as plagiarism with translation, plagiarism of idea, plagiarism with synonymy, and plagiarism paraphrase.

Suggested Citation

  • Hadj Ahmed Bouarara & Reda Mohamed Hamou, 2016. "A New Algorithm of Grouping Cockroaches Classifier (GCC) for Textual Plagiarism Detection," International Journal of Information Retrieval Research (IJIRR), IGI Global, vol. 6(4), pages 51-73, October.
  • Handle: RePEc:igg:jirr00:v:6:y:2016:i:4:p:51-73
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJIRR.2016100104
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jirr00:v:6:y:2016:i:4:p:51-73. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.