Joint scheduling of tasks and network flows in big data clusters

Yang, L; Liu, XX; Cao, JN; Wang, ZY

doi:10.1109/ACCESS.2018.2878864

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/80445

DC Field	Value	Language
dc.contributor	Department of Computing	-
dc.creator	Yang, L	-
dc.creator	Liu, XX	-
dc.creator	Cao, JN	-
dc.creator	Wang, ZY	-
dc.date.accessioned	2019-03-26T09:17:13Z	-
dc.date.available	2019-03-26T09:17:13Z	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	http://hdl.handle.net/10397/80445	-
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.rights	© 2018 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.	en_US
dc.rights	Post with permission of the publisher.	en_US
dc.rights	The following publication Yang, L., Liu, X. X., Cao, J. N., & Wang, Z. Y. (2018). Joint scheduling of tasks and network flows in big data clusters. IEEE Access, 6, 66600-66611 is available at https://dx.doi.org/10.1109/ACCESS.2018.2878864	en_US
dc.subject	Task scheduling	en_US
dc.subject	Flow scheduling	en_US
dc.subject	Data centers	en_US
dc.subject	Software defined networks	en_US
dc.title	Joint scheduling of tasks and network flows in big data clusters	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	66600	-
dc.identifier.epage	66611	-
dc.identifier.volume	6	-
dc.identifier.doi	10.1109/ACCESS.2018.2878864	-
dcterms.abstract	As an increasing number of big data processing platforms like Hadoop, Spark, and Storm appear and normally share the resources in the data center, it has been important and challenging to schedule various jobs from these platforms onto the underlying data center resources such that the overall job completion time is minimized. To solve the problem, the existing work either focus on the task-level scheduling techniques, such as Quincy and delay scheduling, or focus on the network flow scheduling techniques, such as D3 and preemptive distributed quick. These works deal with the scheduling of tasks and network flows separately and cannot achieve optimal performance. The reason is that the task scheduling without regard of the available network bandwidths may generate the task placement that causes serious network congestions and thus leads to long data transmission time. In this paper, we propose the joint scheduling technique by coordinating the task placement and the scheduling of network flows arising from these tasks. We develop a software-defined network (SDN)-based online scheduling framework which selects the task placement based on the available bandwidth on the SDN switches and at meanwhile optimally allocates the bandwidth to each data flow. Comprehensive trace-driven simulations show that the joint scheduling technique can take full use of the network bandwidth and thus reduce the job completion time by 55% on average compared with the benchmark methods.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	IEEE access, 2018, v. 6, p. 66600-66611	-
dcterms.isPartOf	IEEE access	-
dcterms.issued	2018	-
dc.identifier.isi	WOS:000452355000001	-
dc.description.validate	201903 bcrc	-
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	OA_IR/PIRA	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	VoR allowed	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Yang_Network_Flows_Clusters.pdf		7.2 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Page views

188

Last Week
1

Last month

Citations as of Oct 6, 2025

Downloads

150

Citations as of Oct 6, 2025

SCOPUS^TM
Citations

6

Citations as of Jun 21, 2024

WEB OF SCIENCE^TM
Citations

5

Citations as of Oct 23, 2025

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM