Scopus Export 2015-2019

Leveraging Dependency In Scheduling And Preemption For High Throughput In Data-Parallel Clusters

Keywords

Preemption; Priority; Scheduling; Task dependency

Abstract

Task scheduling and preemption are two important functions in data-parallel clusters. Though directed acyclic graph task dependencies are common in data-parallel clusters, previous task scheduling and preemption methods do not fully utilize such task dependency to increase throughput since they simply schedule precedent tasks prior to their dependent tasks or neglect the dependency. We notice that in both scheduling and preemption, choosing a task with more dependent tasks to run allows more tasks to be runnable next, which facilitates to select a task that can more increase throughput. Accordingly, in this paper, we propose a Dependency-aware Scheduling and Preemption system (DSP) to achieve high throughput. First, we build an integer linear programming model to minimize the makespan (i.e., the time when all jobs finish execution) with the consideration of task dependency and deadline, and derive the target server and start time for each task, which can minimize the makespan. Second, we utilize task dependency to determine tasks' priorities for preemption. Finally, we propose a method to reduce the number of unnecessary preemptions that cause more overhead than the throughput gain. Extensive experimental results based on a real cluster and Amazon EC2 cloud service show that DSP achieves much higher throughput compared to existing strategies.

Publication Date

10-29-2018

Publication Title

Proceedings - IEEE International Conference on Cluster Computing, ICCC

Volume

2018-September

Number of Pages

359-369

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/CLUSTER.2018.00054

Copyright Status

Unknown

Socpus ID

85057292901 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/85057292901

STARS Citation

Liu, Jinwei; Shen, Haiying; Sarker, Ankur; and Chung, Wingyan, "Leveraging Dependency In Scheduling And Preemption For High Throughput In Data-Parallel Clusters" (2018). Scopus Export 2015-2019. 10052.
https://stars.library.ucf.edu/scopus2015/10052

This document is currently not available here.

COinS

Scopus Export 2015-2019

Leveraging Dependency In Scheduling And Preemption For High Throughput In Data-Parallel Clusters

Keywords

Abstract

Publication Date

Publication Title

Volume

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Explore

Connect

Scopus Export 2015-2019

Leveraging Dependency In Scheduling And Preemption For High Throughput In Data-Parallel Clusters

Creator

Keywords

Abstract

Publication Date

Publication Title

Volume

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Share

Explore

Connect