Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/80445
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.creatorYang, L-
dc.creatorLiu, XX-
dc.creatorCao, JN-
dc.creatorWang, ZY-
dc.date.accessioned2019-03-26T09:17:13Z-
dc.date.available2019-03-26T09:17:13Z-
dc.identifier.issn2169-3536-
dc.identifier.urihttp://hdl.handle.net/10397/80445-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2018 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.en_US
dc.rightsPost with permission of the publisher.en_US
dc.rightsThe following publication Yang, L., Liu, X. X., Cao, J. N., & Wang, Z. Y. (2018). Joint scheduling of tasks and network flows in big data clusters. IEEE Access, 6, 66600-66611 is available at https://dx.doi.org/10.1109/ACCESS.2018.2878864en_US
dc.subjectTask schedulingen_US
dc.subjectFlow schedulingen_US
dc.subjectData centersen_US
dc.subjectSoftware defined networksen_US
dc.titleJoint scheduling of tasks and network flows in big data clustersen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage66600-
dc.identifier.epage66611-
dc.identifier.volume6-
dc.identifier.doi10.1109/ACCESS.2018.2878864-
dcterms.abstractAs an increasing number of big data processing platforms like Hadoop, Spark, and Storm appear and normally share the resources in the data center, it has been important and challenging to schedule various jobs from these platforms onto the underlying data center resources such that the overall job completion time is minimized. To solve the problem, the existing work either focus on the task-level scheduling techniques, such as Quincy and delay scheduling, or focus on the network flow scheduling techniques, such as D3 and preemptive distributed quick. These works deal with the scheduling of tasks and network flows separately and cannot achieve optimal performance. The reason is that the task scheduling without regard of the available network bandwidths may generate the task placement that causes serious network congestions and thus leads to long data transmission time. In this paper, we propose the joint scheduling technique by coordinating the task placement and the scheduling of network flows arising from these tasks. We develop a software-defined network (SDN)-based online scheduling framework which selects the task placement based on the available bandwidth on the SDN switches and at meanwhile optimally allocates the bandwidth to each data flow. Comprehensive trace-driven simulations show that the joint scheduling technique can take full use of the network bandwidth and thus reduce the job completion time by 55% on average compared with the benchmark methods.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE access, 2018, v. 6, p. 66600-66611-
dcterms.isPartOfIEEE access-
dcterms.issued2018-
dc.identifier.isiWOS:000452355000001-
dc.description.validate201903 bcrc-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_IR/PIRAen_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Yang_Network_Flows_Clusters.pdf7.2 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

120
Last Week
1
Last month
Citations as of Apr 14, 2024

Downloads

115
Citations as of Apr 14, 2024

SCOPUSTM   
Citations

6
Citations as of Apr 12, 2024

WEB OF SCIENCETM
Citations

5
Citations as of Apr 18, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.