IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i5p1275-d1089376.html
   My bibliography  Save this article

JQPro:Join Query Processing in a Distributed System for Big RDF Data Using the Hash-Merge Join Technique

Author

Listed:
  • Nahla Mohammed Elzein

    (Faculty of Computer Science, Future University, Khartoum 10553, Sudan)

  • Mazlina Abdul Majid

    (Faculty of Computing, University Malaysia Pahang, Pekan 26600, Malaysia)

  • Ibrahim Abaker Targio Hashem

    (Department of Computer Science, College of Computing and Informatics, University of Sharjah, Sharjah 27272, United Arab Emirates)

  • Ashraf Osman Ibrahim

    (Data Science Programme, Faculty of Computing and Informatics, Universiti Malaysia Sabah, Kota Kinabalu 88400, Malaysia
    Advanced Machine Intelligence Research Group, Universiti Malaysia Sabah, Kota Kinabalu 88400, Malaysia)

  • Anas W. Abulfaraj

    (Department of Information Systems, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi Arabia)

  • Faisal Binzagr

    (Department of Computer Science, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi Arabia)

Abstract

In the last decade, the volume of semantic data has increased exponentially, with the number of Resource Description Framework (RDF) datasets exceeding trillions of triples in RDF repositories. Hence, the size of RDF datasets continues to grow. However, with the increasing number of RDF triples, complex multiple RDF queries are becoming a significant demand. Sometimes, such complex queries produce many common sub-expressions in a single query or over multiple queries running as a batch. In addition, it is also difficult to minimize the number of RDF queries and processing time for a large amount of related data in a typical distributed environment encounter. To address this complication, we introduce a join query processing model for big RDF data, called JQPro. By adopting a MapReduce framework in JQPro, we developed three new algorithms, which are hash-join, sort-merge, and enhanced MapReduce-join for join query processing of RDF data. Based on an experiment conducted, the result showed that the JQPro model outperformed the two popular algorithms, gStore and RDF-3X, with respect to the average execution time. Furthermore, the JQPro model was also tested against RDF-3X, RDFox, and PARJs using the LUBM benchmark. The result showed that the JQPro model had better performance in comparison with the other models. In conclusion, the findings showed that JQPro achieved improved performance with 87.77% in terms of execution time. Hence, in comparison with the selected models, JQPro performs better.

Suggested Citation

  • Nahla Mohammed Elzein & Mazlina Abdul Majid & Ibrahim Abaker Targio Hashem & Ashraf Osman Ibrahim & Anas W. Abulfaraj & Faisal Binzagr, 2023. "JQPro:Join Query Processing in a Distributed System for Big RDF Data Using the Hash-Merge Join Technique," Mathematics, MDPI, vol. 11(5), pages 1-20, March.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:5:p:1275-:d:1089376
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/5/1275/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/5/1275/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Felwa Abukhodair & Wafaa Alsaggaf & Amani Tariq Jamal & Sayed Abdel-Khalek & Romany F. Mansour, 2021. "An Intelligent Metaheuristic Binary Pigeon Optimization-Based Feature Selection and Big Data Classification in a MapReduce Environment," Mathematics, MDPI, vol. 9(20), pages 1-14, October.
    2. Elham Azhir & Mehdi Hosseinzadeh & Faheem Khan & Amir Mosavi, 2022. "Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark," Mathematics, MDPI, vol. 10(19), pages 1-11, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:5:p:1275-:d:1089376. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.