Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/108743
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Biomedical Engineering-
dc.contributorFaculty of Engineering-
dc.contributorFaculty of Engineering-
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.contributorSchool of Nursing-
dc.contributorResearch Institute for Smart Ageing-
dc.creatorLai, DKH-
dc.creatorCheng, ESW-
dc.creatorSo, BPH-
dc.creatorMao, YJ-
dc.creatorCheung, SMY-
dc.creatorCheung, DSK-
dc.creatorWong, DWC-
dc.creatorCheung, JCW-
dc.date.accessioned2024-08-27T04:40:22Z-
dc.date.available2024-08-27T04:40:22Z-
dc.identifier.urihttp://hdl.handle.net/10397/108743-
dc.language.isoenen_US
dc.publisherMDPI AGen_US
dc.rights© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Lai DK-H, Cheng ES-W, So BP-H, Mao Y-J, Cheung SM-Y, Cheung DSK, Wong DW-C, Cheung JC-W. Transformer Models and Convolutional Networks with Different Activation Functions for Swallow Classification Using Depth Video Data. Mathematics. 2023; 11(14):3081 is available at https://doi.org/10.3390/math11143081.en_US
dc.subjectAspiration pneumoniaen_US
dc.subjectComputer-aided screeningen_US
dc.subjectDeep learningen_US
dc.subjectDysphagiaen_US
dc.subjectGerontechnologyen_US
dc.titleTransformer models and convolutional networks with different activation functions for swallow classification using depth video dataen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume11-
dc.identifier.issue14-
dc.identifier.doi10.3390/math11143081-
dcterms.abstractDysphagia is a common geriatric syndrome that might induce serious complications and death. Standard diagnostics using the Videofluoroscopic Swallowing Study (VFSS) or Fiberoptic Evaluation of Swallowing (FEES) are expensive and expose patients to risks, while bedside screening is subjective and might lack reliability. An affordable and accessible instrumented screening is necessary. This study aimed to evaluate the classification performance of Transformer models and convolutional networks in identifying swallowing and non-swallowing tasks through depth video data. Different activation functions (ReLU, LeakyReLU, GELU, ELU, SiLU, and GLU) were then evaluated on the best-performing model. Sixty-five healthy participants (n = 65) were invited to perform swallowing (eating a cracker and drinking water) and non-swallowing tasks (a deep breath and pronouncing vowels: “/eɪ/”, “/iː/”, “/aɪ/”, “/oʊ/”, “/u:/”). Swallowing and non-swallowing were classified by Transformer models (TimeSFormer, Video Vision Transformer (ViViT)), and convolutional neural networks (SlowFast, X3D, and R(2+1)D), respectively. In general, convolutional neural networks outperformed the Transformer models. X3D was the best model with good-to-excellent performance (F1-score: 0.920; adjusted F1-score: 0.885) in classifying swallowing and non-swallowing conditions. Moreover, X3D with its default activation function (ReLU) produced the best results, although LeakyReLU performed better in deep breathing and pronouncing “/aɪ/” tasks. Future studies shall consider collecting more data for pretraining and developing a hyperparameter tuning strategy for activation functions and the high dimensionality video data for Transformer models.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationMathematics, July 2023, v. 11, no. 14, 3081-
dcterms.isPartOfMathematics-
dcterms.issued2023-07-
dc.identifier.scopus2-s2.0-85166230111-
dc.identifier.eissn2227-7390-
dc.identifier.artn3081-
dc.description.validate202408 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Scopus/WOSen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextHealth Bureau, Hong Kong; Research Institute for Smart Ageing, The Hong Kong Polytechnic Universityen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
mathematics-11-03081-v2.pdf2.91 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

72
Citations as of Nov 10, 2025

Downloads

18
Citations as of Nov 10, 2025

SCOPUSTM   
Citations

13
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

2
Citations as of Feb 6, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.