Leveraging Dependency In Scheduling And Preemption For High Throughput In Data-Parallel Clusters
Keywords
Preemption; Priority; Scheduling; Task dependency
Abstract
Task scheduling and preemption are two important functions in data-parallel clusters. Though directed acyclic graph task dependencies are common in data-parallel clusters, previous task scheduling and preemption methods do not fully utilize such task dependency to increase throughput since they simply schedule precedent tasks prior to their dependent tasks or neglect the dependency. We notice that in both scheduling and preemption, choosing a task with more dependent tasks to run allows more tasks to be runnable next, which facilitates to select a task that can more increase throughput. Accordingly, in this paper, we propose a Dependency-aware Scheduling and Preemption system (DSP) to achieve high throughput. First, we build an integer linear programming model to minimize the makespan (i.e., the time when all jobs finish execution) with the consideration of task dependency and deadline, and derive the target server and start time for each task, which can minimize the makespan. Second, we utilize task dependency to determine tasks' priorities for preemption. Finally, we propose a method to reduce the number of unnecessary preemptions that cause more overhead than the throughput gain. Extensive experimental results based on a real cluster and Amazon EC2 cloud service show that DSP achieves much higher throughput compared to existing strategies.
Publication Date
10-29-2018
Publication Title
Proceedings - IEEE International Conference on Cluster Computing, ICCC
Volume
2018-September
Number of Pages
359-369
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/CLUSTER.2018.00054
Copyright Status
Unknown
Socpus ID
85057292901 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85057292901
STARS Citation
Liu, Jinwei; Shen, Haiying; Sarker, Ankur; and Chung, Wingyan, "Leveraging Dependency In Scheduling And Preemption For High Throughput In Data-Parallel Clusters" (2018). Scopus Export 2015-2019. 10052.
https://stars.library.ucf.edu/scopus2015/10052