Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/115639
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Chinese and Bilingual Studies-
dc.creatorPeng, B-
dc.creatorHsu, Y-
dc.creatorChersoni, E-
dc.creatorQiu, L-
dc.creatorHuang, CR-
dc.date.accessioned2025-10-10T00:19:45Z-
dc.date.available2025-10-10T00:19:45Z-
dc.identifier.issn1574-020X-
dc.identifier.urihttp://hdl.handle.net/10397/115639-
dc.language.isoenen_US
dc.publisherSpringer Dordrechten_US
dc.rights© The Author(s) 2025en_US
dc.rightsOpen Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.en_US
dc.rightsThe following publication Peng, B., Hsu, Yy., Chersoni, E. et al. Multilingual prediction of semantic norms with language models: a study on English and Chinese. Lang Resources & Evaluation (2025) is available at https://doi.org/10.1007/s10579-025-09866-9.en_US
dc.subjectCognitive modelingen_US
dc.subjectLarge language modelsen_US
dc.subjectMultilingualityen_US
dc.subjectPsycholinguisticsen_US
dc.subjectSemantic normsen_US
dc.subjectWord embeddingsen_US
dc.titleMultilingual prediction of semantic norms with language models : a study on English and Chineseen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.doi10.1007/s10579-025-09866-9-
dcterms.abstractLexical semantic norms characterize each lexical concept in terms of a set of semantic features for the words of a language. They provide essential resources for behavioral, computational, and neuro-cognitive studies of language and human cognition. Recent research advocate for the need for cognitively motivated feature sets, arguing that semantic representations grounded in human cognition can facilitate cross-linguistic modeling and even enable the prediction of a word’s semantic features based on its translation in another language. In this study, we present a new dataset of brain-based, Binder-style semantic norms for Chinese. Using the corresponding English dataset and the representational power of multilingual language models, we conduct systematic experiments on semantic norm prediction both within and across languages. We evaluate monolingual and English-Chinese cross-lingual norm prediction using two different methods: embedding-based regression vs. prompting with large language models. Our results show that bidirectional models from the BERT family and GPT-4 achieve a good level of accuracy, with moderate-to-high correlations with human ratings. Notably, in the cross-lingual setting, the best and the worst predicted features align with the higher and lower end of levels of human agreement when comparing norms of words between translated words. Our results support a novel computational approach for supplementing and expanding cognitive semantic norms, highlighting the potential of language models to bridge cross-linguistic semantic representations.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationLanguage resources and evaluation, Published: 06 September 2025, Latest articles, https://doi.org/10.1007/s10579-025-09866-9-
dcterms.isPartOfLanguage resources and evaluation-
dcterms.issued2025-
dc.identifier.scopus2-s2.0-105015430484-
dc.identifier.eissn1574-0218-
dc.description.validate202510 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_TAen_US
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work has been supported by the Start-up Fund for new recruits of the Hong Kong Polytechnic University (1-BE8G) and the RGC Direct Allocation Grant (A-PB1C). We would also like to thank the two reviewers for their constructive feedback.en_US
dc.description.pubStatusEarly releaseen_US
dc.description.TASpringer Nature (2025)en_US
dc.description.oaCategoryTAen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
s10579-025-09866-9.pdf3.24 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.