IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0177567.html
   My bibliography  Save this article

An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems

Author

Listed:
  • Hajara Idris
  • Absalom E Ezugwu
  • Sahalu B Junaidu
  • Aderemi O Adewumi

Abstract

The Grid scheduler, schedules user jobs on the best available resource in terms of resource characteristics by optimizing job execution time. Resource failure in Grid is no longer an exception but a regular occurring event as resources are increasingly being used by the scientific community to solve computationally intensive problems which typically run for days or even months. It is therefore absolutely essential that these long-running applications are able to tolerate failures and avoid re-computations from scratch after resource failure has occurred, to satisfy the user’s Quality of Service (QoS) requirement. Job Scheduling with Fault Tolerance in Grid Computing using Ant Colony Optimization is proposed to ensure that jobs are executed successfully even when resource failure has occurred. The technique employed in this paper, is the use of resource failure rate, as well as checkpoint-based roll back recovery strategy. Check-pointing aims at reducing the amount of work that is lost upon failure of the system by immediately saving the state of the system. A comparison of the proposed approach with an existing Ant Colony Optimization (ACO) algorithm is discussed. The experimental results of the implemented Fault Tolerance scheduling algorithm show that there is an improvement in the user’s QoS requirement over the existing ACO algorithm, which has no fault tolerance integrated in it. The performance evaluation of the two algorithms was measured in terms of the three main scheduling performance metrics: makespan, throughput and average turnaround time.

Suggested Citation

  • Hajara Idris & Absalom E Ezugwu & Sahalu B Junaidu & Aderemi O Adewumi, 2017. "An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-24, May.
  • Handle: RePEc:plo:pone00:0177567
    DOI: 10.1371/journal.pone.0177567
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177567
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0177567&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0177567?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Absalom E. Ezugwu & Seyed M. Buhari & Sahalu B. Junaidu, 2013. "Virtual Machine Allocation in Cloud Computing Environment," International Journal of Cloud Applications and Computing (IJCAC), IGI Global, vol. 3(2), pages 47-60, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang Chen & Zhang Xiufeng & Zhao Guohua, 2020. "Research on hot rolling scheduling problem based on Two-phase Pareto algorithm," PLOS ONE, Public Library of Science, vol. 15(12), pages 1-14, December.
    2. Xiaolei Wang & Tiejun Ci & Sang-Bing Tsai & Aijun Liu & Quan Chen, 2018. "An empirical study of collaborative capacity evaluation and scheduling optimization for a CPD project," PLOS ONE, Public Library of Science, vol. 13(8), pages 1-16, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      More about this item

      Statistics

      Access and download statistics

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0177567. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.