Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107701
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.creatorLiew, SRCen_US
dc.creatorLaw, NFen_US
dc.date.accessioned2024-07-09T07:09:55Z-
dc.date.available2024-07-09T07:09:55Z-
dc.identifier.urihttp://hdl.handle.net/10397/107701-
dc.language.isoenen_US
dc.publisherSpringer Singaporeen_US
dc.rights© The Author(s) 2023. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.en_US
dc.rightsThe following publication Liew, S.R.C., Law, N.F. Use of subword tokenization for domain generation algorithm classification. Cybersecurity 6, 49 (2023) is available at https://doi.org/10.1186/s42400-023-00183-8.en_US
dc.subjectBotnet detectionen_US
dc.subjectDomain namesen_US
dc.subjectMachine learning-based botnet detectionen_US
dc.subjectNetwork securityen_US
dc.titleUse of subword tokenization for domain generation algorithm classificationen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume6en_US
dc.identifier.issue1en_US
dc.identifier.doi10.1186/s42400-023-00183-8en_US
dcterms.abstractDomain name generation algorithm (DGA) classification is an essential but challenging problem. Both feature-extracting machine learning (ML) methods and deep learning (DL) models such as convolutional neural networks and long short-term memory have been developed. However, the performance of these approaches varies with different types of DGAs. Most features in the ML methods can characterize random-looking DGAs better than word-looking DGAs. To improve the classification performance on word-looking DGAs, subword tokenization is employed for the DL models. Our experimental results proved that the subword tokenization can provide excellent classification performance on the word-looking DGAs. We then propose an integrated scheme that chooses an appropriate method for DGA classification depending on the nature of the DGAs. Results show that the integrated scheme outperformed existing ML and DL methods, and also the subword DL methods.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationCybersecurity, 2023, v. 6, no. 1, 49en_US
dcterms.isPartOfCybersecurityen_US
dcterms.issued2023-
dc.identifier.scopus2-s2.0-85169978106-
dc.identifier.eissn2523-3246en_US
dc.identifier.artn49en_US
dc.description.validate202407 bcwhen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumbera2981-
dc.identifier.SubFormID49022-
dc.description.fundingSourceSelf-fundeden_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
s42400-023-00183-8.pdf1.56 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

43
Citations as of Apr 14, 2025

Downloads

9
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

9
Citations as of Sep 12, 2025

WEB OF SCIENCETM
Citations

2
Citations as of Nov 14, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.