Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/108877
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Chinese and Bilingual Studies | en_US |
| dc.contributor | Department of Computing | en_US |
| dc.contributor | Department of Applied Mathematics | en_US |
| dc.creator | Xiang, R | en_US |
| dc.creator | Chersoni, E | en_US |
| dc.creator | Li, Y | en_US |
| dc.creator | Li, J | en_US |
| dc.creator | Huang, CR | en_US |
| dc.creator | Pan, Y | en_US |
| dc.creator | Li, Y | en_US |
| dc.date.accessioned | 2024-09-04T07:42:12Z | - |
| dc.date.available | 2024-09-04T07:42:12Z | - |
| dc.identifier.issn | 1574-020X | en_US |
| dc.identifier.uri | http://hdl.handle.net/10397/108877 | - |
| dc.language.iso | en | en_US |
| dc.publisher | Springer Dordrecht | en_US |
| dc.rights | © The Author(s) 2024 | en_US |
| dc.rights | This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. | en_US |
| dc.rights | The following publication Xiang, R., Chersoni, E., Li, Y. et al. Cantonese natural language processing in the transformers era: a survey and current challenges. Lang Resources & Evaluation 59, 1747–1773 (2025) is available at https://doi.org/10.1007/s10579-024-09744-w. | en_US |
| dc.subject | Cantonese | en_US |
| dc.subject | Code-switching | en_US |
| dc.subject | Evaluation resources | en_US |
| dc.subject | Multilingualism | en_US |
| dc.subject | NLP for social media | en_US |
| dc.title | Cantonese natural language processing in the transformers era : a survey and current challenges | en_US |
| dc.type | Journal/Magazine Article | en_US |
| dc.identifier.spage | 1747 | en_US |
| dc.identifier.epage | 1773 | en_US |
| dc.identifier.volume | 59 | en_US |
| dc.identifier.issue | 2 | en_US |
| dc.identifier.doi | 10.1007/s10579-024-09744-w | en_US |
| dcterms.abstract | Despite being spoken by a large population of speakers worldwide, Cantonese is under-resourced in terms of the data scale and diversity compared to other major languages. This limitation has excluded it from the current “pre-training and fine-tuning” paradigm that is dominated by Transformer architectures. In this paper, we provide a comprehensive review on the existing resources and methodologies for Cantonese Natural Language Processing, covering the recent progress in language understanding, text generation and development of language models. We finally discuss two aspects of the Cantonese language that could make it potentially challenging even for state-of-the-art architectures: colloquialism and multilinguality | en_US |
| dcterms.accessRights | open access | en_US |
| dcterms.bibliographicCitation | Language resources and evaluation, June 2025, v. 59, no. 2, p. 1747-1773 | en_US |
| dcterms.isPartOf | Language resources and evaluation | en_US |
| dcterms.issued | 2025-06 | - |
| dc.identifier.scopus | 2-s2.0-85195564852 | - |
| dc.identifier.eissn | 1574-0218 | en_US |
| dc.description.validate | 202409 bcch | en_US |
| dc.description.oa | Version of Record | en_US |
| dc.identifier.FolderNumber | OA_TA | - |
| dc.description.fundingSource | Self-funded | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.description.TA | Springer Nature (2024) | en_US |
| dc.description.oaCategory | TA | en_US |
| Appears in Collections: | Journal/Magazine Article | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| s10579-024-09744-w.pdf | 1.38 MB | Adobe PDF | View/Open |
Page views
104
Citations as of Oct 6, 2025
Downloads
36
Citations as of Oct 6, 2025
SCOPUSTM
Citations
1
Citations as of Oct 24, 2025
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



