Have a personal or library account? Click to login
A Hybrid Scheduler for Many Task Computing in Big Data Systems Cover

Abstract

With the rapid evolution of the distributed computing world in the last few years, the amount of data created and processed has fast increased to petabytes or even exabytes scale. Such huge data sets need data-intensive computing applications and impose performance requirements to the infrastructures that support them, such as high scalability, storage, fault tolerance but also efficient scheduling algorithms. This paper focuses on providing a hybrid scheduling algorithm for many task computing that addresses big data environments with few penalties, taking into consideration the deadlines and satisfying a data dependent task model. The hybrid solution consists of several heuristics and algorithms (min-min, min-max and earliest deadline first) combined in order to provide a scheduling algorithm that matches our problem. The experimental results are conducted by simulation and prove that the proposed hybrid algorithm behaves very well in terms of meeting deadlines.

DOI: https://doi.org/10.1515/amcs-2017-0027 | Journal eISSN: 2083-8492 | Journal ISSN: 1641-876X
Language: English
Page range: 385 - 399
Submitted on: Nov 26, 2016
|
Accepted on: Mar 20, 2017
|
Published on: Jul 8, 2017
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2017 Laura Vasiliu, Florin Pop, Catalin Negru, Mariana Mocanu, Valentin Cristea, Joanna Kolodziej, published by University of Zielona Góra
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.