Scheduling Distributed Clusters of Parallel Machines: Primal-Dual and LP-based Approximation Algorithms [Full Version]

Murray, Riley; Khuller, Samir; Chao, Megan

doi:10.4230/LIPIcs.ESA.2016.68

Computer Science > Data Structures and Algorithms

arXiv:1610.09058 (cs)

[Submitted on 28 Oct 2016]

Title:Scheduling Distributed Clusters of Parallel Machines: Primal-Dual and LP-based Approximation Algorithms [Full Version]

Authors:Riley Murray, Samir Khuller, Megan Chao

View PDF

Abstract:The Map-Reduce computing framework rose to prominence with datasets of such size that dozens of machines on a single cluster were needed for individual jobs. As datasets approach the exabyte scale, a single job may need distributed processing not only on multiple machines, but on multiple clusters. We consider a scheduling problem to minimize weighted average completion time of N jobs on M distributed clusters of parallel machines. In keeping with the scale of the problems motivating this work, we assume that (1) each job is divided into M "subjobs" and (2) distinct subjobs of a given job may be processed concurrently.
When each cluster is a single machine, this is the NP-Hard concurrent open shop problem. A clear limitation of such a model is that a serial processing assumption sidesteps the issue of how different tasks of a given subjob might be processed in parallel. Our algorithms explicitly model clusters as pools of resources and effectively overcome this issue.
Under a variety of parameter settings, we develop two constant factor approximation algorithms for this problem. The first algorithm uses an LP relaxation tailored to this problem from prior work. This LP-based algorithm provides strong performance guarantees. Our second algorithm exploits a surprisingly simple mapping to the special case of one machine per cluster. This mapping-based algorithm is combinatorial and extremely fast. These are the first constant factor approximations for this problem.

Comments:	A shorter version of this paper (one that omitted several proofs) appeared in the proceedings of the 2016 European Symposium on Algorithms
Subjects:	Data Structures and Algorithms (cs.DS)
ACM classes:	F.2.2
Cite as:	arXiv:1610.09058 [cs.DS]
	(or arXiv:1610.09058v1 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.1610.09058
Journal reference:	Leibniz International Proceedings in Informatics (LIPIcs), Volume 58, 2016, pages 68:1--68:17
Related DOI:	https://doi.org/10.4230/LIPIcs.ESA.2016.68

Submission history

From: Riley Murray [view email]
[v1] Fri, 28 Oct 2016 02:14:25 UTC (685 KB)

Computer Science > Data Structures and Algorithms

Title:Scheduling Distributed Clusters of Parallel Machines: Primal-Dual and LP-based Approximation Algorithms [Full Version]

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Scheduling Distributed Clusters of Parallel Machines: Primal-Dual and LP-based Approximation Algorithms [Full Version]

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators