Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/115059
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Research Institute for Smart Ageing | - |
| dc.creator | Wang, J | - |
| dc.creator | Ma, R | - |
| dc.creator | Yang, X | - |
| dc.creator | Qi, Q | - |
| dc.creator | Zhuang, Z | - |
| dc.creator | Wang, J | - |
| dc.creator | Liao, J | - |
| dc.creator | Guo, S | - |
| dc.date.accessioned | 2025-09-09T07:40:22Z | - |
| dc.date.available | 2025-09-09T07:40:22Z | - |
| dc.identifier.issn | 1544-3566 | - |
| dc.identifier.uri | http://hdl.handle.net/10397/115059 | - |
| dc.language.iso | en | en_US |
| dc.publisher | Association for Computing Machinery | en_US |
| dc.rights | This work is licensed under a Creative Commons Attribution International 4.0 License (https://creativecommons.org/licenses/by/4.0/). | en_US |
| dc.rights | ©2025 Copyright held by the owner/author(s). | en_US |
| dc.rights | The following publication Wang, J., Ma, R., Yang, X., Qi, Q., Zhuang, Z., Wang, J., Liao, J., & Guo, S. (2025). DeepZoning: Re-accelerate CNN Inference with Zoning Graph for Heterogeneous Edge Cluster. ACM Trans. Archit. Code Optim., 22(1), Article 10 is available at https://doi.org/10.1145/3701995. | en_US |
| dc.subject | Cooperative CNN inference | en_US |
| dc.subject | Edge computing | en_US |
| dc.subject | Graph partition | en_US |
| dc.subject | Model deployment | en_US |
| dc.title | DeepZoning : re-accelerate CNN inference with zoning graph for heterogeneous edge cluster | en_US |
| dc.type | Journal/Magazine Article | en_US |
| dc.identifier.volume | 22 | - |
| dc.identifier.issue | 1 | - |
| dc.identifier.doi | 10.1145/3701995 | - |
| dcterms.abstract | Parallelizing CNN inference on heterogeneous edge clusters with data parallelism has gained popularity as a way to meet real-time requirements without sacrificing model accuracy. However, existing algorithms struggle to find optimal parallel granularity for complex CNNS, the structure of which is a directed acyclic graph (DAG) rather than a chain, and the parallel dimension is inflexible. To distribute the workload of modern CNNs on heterogeneous devices is also proven as NP-hard problem. In this article, we introduce DeepZoning, a versatile and cooperative inference framework that combines both model and data parallelism to accelerate CNN inference. DeepZoning employs two algorithms at different levels: (1) a low-level Adaptive Workload Partition algorithm that uses linear programming and takes spatial and channel dimensions into optimization during the search for feature map distribution on heterogeneous devices, and (2) a high-level Model Partition algorithm that finds the optimal model granularity and organizes complex CNNs into sequential zones to balance communication and computation during execution. Our experimental evaluations show that DeepZoning is effective, achieving up to a 3.02× speed improvement on our experimental prototype compared to state-of-the-art algorithms. | - |
| dcterms.accessRights | open access | en_US |
| dcterms.bibliographicCitation | ACM transactions on architecture and code optimization, Mar. 2025, v. 22, no. 1, 10 | - |
| dcterms.isPartOf | ACM transactions on architecture and code optimization | - |
| dcterms.issued | 2025-03 | - |
| dc.identifier.scopus | 2-s2.0-105003630561 | - |
| dc.identifier.eissn | 1544-3973 | - |
| dc.identifier.artn | 10 | - |
| dc.description.validate | 202509 bcch | - |
| dc.description.oa | Version of Record | en_US |
| dc.identifier.FolderNumber | OA_Scopus/WOS | en_US |
| dc.description.fundingSource | Others | en_US |
| dc.description.fundingText | This work was supported by the National Natural Science Foundation of China under Grants (U23B2001, 62171057, 62101064, 62201072, 62001054, 62071067), the Ministry of Education and China Mobile Joint Fund (MCM20200202, MCM20180101). | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.description.oaCategory | CC | en_US |
| Appears in Collections: | Journal/Magazine Article | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 3701995.pdf | 4.37 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



