Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/100944
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Aeronautical and Aviation Engineeringen_US
dc.creatorLi, Fen_US
dc.creatorZou, Fen_US
dc.creatorRao, Jen_US
dc.date.accessioned2023-08-23T07:09:59Z-
dc.date.available2023-08-23T07:09:59Z-
dc.identifier.issn0041-624Xen_US
dc.identifier.urihttp://hdl.handle.net/10397/100944-
dc.language.isoenen_US
dc.publisherElsevieren_US
dc.rights© 2023 Elsevier B.V. All rights reserved.en_US
dc.rights© 2023. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.rightsThe following publication Li, F., Zou, F., & Rao, J. (2023). A multi-GPU and CUDA-aware MPI-based spectral element formulation for ultrasonic wave propagation in solid media. Ultrasonics, 134, 107049 is available at https://doi.org/10.1016/j.ultras.2023.107049.en_US
dc.subjectUltrasonic waveen_US
dc.subjectSpectral element formulationen_US
dc.subjectMulti-GPUen_US
dc.subjectCUDAen_US
dc.subjectCUDA-aware MPIen_US
dc.titleA multi-GPU and CUDA-aware MPI-based spectral element formulation for ultrasonic wave propagation in solid mediaen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume134en_US
dc.identifier.doi10.1016/j.ultras.2023.107049en_US
dcterms.abstractIn this paper, we introduce a new multi-GPU-based spectral element (SE) formulation for simulating ultrasonic wave propagation in solids. To maximize communication efficiency, we purposely developed, based on CUDA-aware MPI, two novel message exchange strategies which allow the common nodal forces of different subdomains to be shared between different GPUs in a direct manner, as opposed to via CPU hosts, during central difference-based time integration steps. The new multi-GPU and CUDA-aware MPI-based formulation is benchmarked against a multi-CPU core and classical MPI-based counterpart, demonstrating a remarkable acceleration in each and every stage of the computation of ultrasonic wave propagation, namely matrix assembly, time integration and message exchange. More importantly, both the computational efficiency and the degree-of-freedom limit of the new formulation are actually scalable with the number of GPUs used, potentially allowing larger structures to be computed and higher computational speeds to be realized. Finally, the new formulation was used to simulate the interaction between Lamb waves and randomly shaped thickness loss defects on plates, showing its potential to become an efficient, accurate and robust technique for addressing the propagation of ultrasonic waves in realistic engineering structures.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationUltrasonics, Sept 2023, v. 134, 107049en_US
dcterms.isPartOfUltrasonicsen_US
dcterms.issued2023-09-
dc.identifier.eissn1874-9968en_US
dc.identifier.artn107049en_US
dc.description.validate202308 bcchen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumbera2369-n01-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Li_Multi-GPU_CUDA-aware_MPI-based.pdfPre-Published version3.71 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

76
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

1
Citations as of Jun 21, 2024

WEB OF SCIENCETM
Citations

1
Citations as of Jan 2, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.