Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107917
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Chinese and Bilingual Studies-
dc.creatorWang, Z-
dc.creatorLiu, M-
dc.creatorLiu, K-
dc.date.accessioned2024-07-17T07:13:12Z-
dc.date.available2024-07-17T07:13:12Z-
dc.identifier.issn0883-9514-
dc.identifier.urihttp://hdl.handle.net/10397/107917-
dc.language.isoenen_US
dc.publisherTaylor & Francis Inc.en_US
dc.rights© 2024 The Author(s). Published with license by Taylor & Francis Group, LLC. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent.en_US
dc.rightsThe following publication Wang, Z., Liu, M., & Liu, K. (2024). Utilizing Machine Learning Techniques for Classifying Translated and Non-Translated Corporate Annual Reports. Applied Artificial Intelligence, 38(1) is available at https://doi.org/10.1080/08839514.2024.2340393.en_US
dc.titleUtilizing machine learning techniques for classifying translated and non-translated corporate annual reportsen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume38-
dc.identifier.issue1-
dc.identifier.doi10.1080/08839514.2024.2340393-
dcterms.abstractGlobalization has led to the widespread adoption of translated corporate annual reports in international markets. Nonetheless, it remains largely unexplored whether these translated documents fulfill the same function and communicate as effectively to international investors as their non-translated counterparts. Considering their significance to stakeholders, differentiating between these two types of reports is essential, yet research in this area is insufficient. This study seeks to bridge this gap by leveraging machine learning algorithms to classify corporate annual reports based on their translation status. By constructing corpora of comparable texts and employing thirteen syntactic complexity indices as features, we analyzed the reports using eight different algorithms: Naïve Bayes, Logistic Regression, Support Vector Machine, k-Nearest Neighbors, Neural Network, Random Forest, Gradient Boosting and Deep Learning. Additionally, ensemble models were created by combining the three most effective algorithms. The best-performing model in our study achieved an Area Under the Curve (AUC) of 99.3%. This innovative approach demonstrates the effectiveness of syntactic complexity indices in machine learning for classifying translational language in corporate reporting, contributing valuable insights to text classification and translational language research. Our findings offer critical implications for stakeholders in multilingual contexts, highlighting the need for further research in this field.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationApplied artificial intelligence, 2024, v. 38, no. 1, 2340393-
dcterms.isPartOfApplied artificial intelligence-
dcterms.issued2024-
dc.identifier.scopus2-s2.0-85189932040-
dc.identifier.eissn1087-6545-
dc.identifier.artn2340393-
dc.description.validate202407 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumbera3021aen_US
dc.identifier.SubFormID49218en_US
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Wang_Utilizing_Machine_Learning.pdf3.19 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

85
Citations as of Nov 10, 2025

Downloads

89
Citations as of Nov 10, 2025

SCOPUSTM   
Citations

10
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

8
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.