Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/81700
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.creatorZhang, Y-
dc.creatorLing, ZW-
dc.creatorLv, MS-
dc.creatorGuan, N-
dc.date.accessioned2020-02-10T12:28:42Z-
dc.date.available2020-02-10T12:28:42Z-
dc.identifier.issn2169-3536en_US
dc.identifier.urihttp://hdl.handle.net/10397/81700-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rightsThis work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0en_US
dc.rightsThe following publication Y. Zhang, Z. Ling, M. Lv and N. Guan, "A Gaussian Set Sampling Model for Efficient Shared Cache Profiling on Multi-Cores," in IEEE Access, vol. 7, pp. 115560-115567, 2019 is available at https://dx.doi.org/10.1109/ACCESS.2019.2936439en_US
dc.subjectGaussian distributionen_US
dc.subjectMulti-coreen_US
dc.subjectShared cacheen_US
dc.subjectSet samplingen_US
dc.titleA Gaussian set sampling model for efficient shared cache profiling on multi-coresen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage115560en_US
dc.identifier.epage115567en_US
dc.identifier.volume7en_US
dc.identifier.doi10.1109/ACCESS.2019.2936439en_US
dcterms.abstractThe last level cache (LLC) has significant impact to system performance on modern multi-core processors. But as cache sizes reach several megabytes and more, the overhead of exploring performance on LLC greatly increases as well. To improve the efficiency of performance analysis, we propose a setsampling-based cache profiling model for the performance analysis on multi-core LLC. We first explore the memory access distributions on LLC by developing a low-overhead stress-application-based method. The results show that memory access distributions can be approximated by Gaussian distribution function. Based on this observation, a Gaussian-distribution-based set sampling model is proposed which can predict program performance with limited representative samples. We evaluate our model on a contemporary multicore machine and show that 1) the proposed method can precisely predict program performance on LLC under different contention intensities and 2) our method can achieve similar precision with less samples compared to widely adopted set sampling methods such as the random sampling and the continuous address sampling.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE access, 2019, v. 7, p. 115560-115567en_US
dcterms.isPartOfIEEE accessen_US
dcterms.issued2019-
dc.identifier.isiWOS:000484228300003-
dc.identifier.scopus2-s2.0-85097333552-
dc.description.validate202002 bcrc-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Scopus/WOSen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Zhang_Gaussian_Set_Multi-Cores.pdf9.46 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

132
Last Week
3
Last month
Citations as of Jan 5, 2025

Downloads

78
Citations as of Jan 5, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.