Joint scheduling of MapReduce jobs with servers : performance bounds and experiments

Ling, X; Yuan, Y; Wang, D; Liu, J; Yang, J

doi:10.1016/j.jpdc.2016.02.002

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/61154

DC Field	Value	Language
dc.contributor	Department of Computing	-
dc.creator	Ling, X	-
dc.creator	Yuan, Y	-
dc.creator	Wang, D	-
dc.creator	Liu, J	-
dc.creator	Yang, J	-
dc.date.accessioned	2016-12-19T08:54:58Z	-
dc.date.available	2016-12-19T08:54:58Z	-
dc.identifier.issn	0743-7315	en_US
dc.identifier.uri	http://hdl.handle.net/10397/61154	-
dc.language.iso	en	en_US
dc.publisher	Academic Press	en_US
dc.rights	© 2016 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).	en_US
dc.rights	The following publication Ling, X., Yuan, Y., Wang, D., Liu, J., & Yang, J. (2016). Joint scheduling of mapreduce jobs with servers: Performance bounds and experiments. Journal of Parallel and Distributed Computing, 90, 52-66 is available at https://doi.org/10.1016/j.jpdc.2016.02.002	en_US
dc.subject	Fast heuristic	en_US
dc.subject	MapReduce	en_US
dc.subject	NP-complete	en_US
dc.subject	Scheduling	en_US
dc.subject	Server assignment	en_US
dc.title	Joint scheduling of MapReduce jobs with servers : performance bounds and experiments	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	52	en_US
dc.identifier.epage	66	en_US
dc.identifier.volume	90-91	en_US
dc.identifier.doi	10.1016/j.jpdc.2016.02.002	en_US
dcterms.abstract	MapReduce-like frameworks have achieved tremendous success for large-scale data processing in data centers. A key feature distinguishing MapReduce from previous parallel models is that it interleaves parallel and sequential computation. Past schemes, and especially their theoretical bounds, on general parallel models are therefore, unlikely to be applied to MapReduce directly. There are many recent studies on MapReduce job and task scheduling. These studies assume that the servers are assigned in advance. In current data centers, multiple MapReduce jobs of different importance levels run together. In this paper, we investigate a schedule problem for MapReduce taking server assignment into consideration as well. We formulate a MapReduce server-job organizer problem (MSJO) and show that it is NP-complete. We develop a 3-approximation algorithm and a fast heuristic design. Moreover, we further propose a novel fine-grained practical algorithm for general MapReduce-like task scheduling problem. Finally, we evaluate our algorithms through both simulations and experiments on Amazon EC2 with an implementation with Hadoop. The results confirm the superiority of our algorithms.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	Journal of parallel and distributed computing, 2016, v. 90-91, p. 52-66	-
dcterms.isPartOf	Journal of parallel and distributed computing	-
dcterms.issued	2016	-
dc.identifier.isi	WOS:000374627000005	-
dc.identifier.scopus	2-s2.0-84961784554	-
dc.description.validate	201901_a bcma	en_US
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	OA_IR/PIRA	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	CC	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Ling_Joint_scheduling_MapReduce.pdf		1.19 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Page views

316

Last Week
15

Last month

Citations as of Apr 12, 2026

Downloads

265

Citations as of Apr 12, 2026

SCOPUS^TM
Citations

18

Last Week
0

Last month
0

Citations as of May 8, 2026

WEB OF SCIENCE^TM
Citations

11

Last Week
0

Last month
1

Citations as of Apr 23, 2026

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM