Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/90417
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Chinese and Bilingual Studies | en_US |
| dc.contributor | Department of English | en_US |
| dc.creator | Hou, R | en_US |
| dc.creator | Huang, CR | en_US |
| dc.creator | Ahrens, K | en_US |
| dc.creator | Lee, YMS | en_US |
| dc.date.accessioned | 2021-07-06T02:41:57Z | - |
| dc.date.available | 2021-07-06T02:41:57Z | - |
| dc.identifier.issn | 2055-7671 | en_US |
| dc.identifier.uri | http://hdl.handle.net/10397/90417 | - |
| dc.language.iso | en | en_US |
| dc.publisher | Oxford University Press | en_US |
| dc.rights | © The Author(s) 2019. Published by Oxford University Press on behalf of EADH. All rights reserved. | en_US |
| dc.rights | This is a pre-copyedited, author-produced PDF of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Renkui Hou, Chu-Ren Huang, Kathleen Ahrens, Yat-Mei Sophia Lee, Linguistic characteristics of Chinese register based on the Menzerath—Altmann law and text clustering, Digital Scholarship in the Humanities, Volume 35, Issue 1, April 2020, Pages 54–66 is available online at: https://doi.org/10.1093/llc/fqz005. | en_US |
| dc.title | Linguistic characteristics of Chinese register based on the Menzerath—Altmann law and text clustering | en_US |
| dc.type | Journal/Magazine Article | en_US |
| dc.identifier.spage | 54 | en_US |
| dc.identifier.epage | 66 | en_US |
| dc.identifier.volume | 35 | en_US |
| dc.identifier.issue | 1 | en_US |
| dc.identifier.doi | 10.1093/llc/fqz005 | en_US |
| dcterms.abstract | This article explores the linguistic features of different registers in Chinese through text clustering driven by the Menzerath–Altmann (MA) law. We propose to calculate the average word length distribution according to clause length. The MA law predicts that texts from different registers will show differences in terms of average word length distribution in texts. As predicted by the MA law, analysis result demonstrates that average word length decreases with the increase of clause length in each register and that their relationship can be fitted by the formula y = axbe−cx. We hypothesize that it is the situation type, i.e. whether the text is dialectic or monologue, that is the linguistic characteristic behind the dichotomy of word length distribution. To confirm these register-distinguishing linguistic features, texts were represented by the average word length distribution and the fitted parameters using the vector space model and clustered according to their register categories. Good clustering results show that average word length distribution in certain length clauses and their fitted parameters can be used as the distinctive characteristics of these three registers. | en_US |
| dcterms.accessRights | open access | en_US |
| dcterms.bibliographicCitation | Digital scholarship in the humanities, Apr. 2020, v. 35, no. 1, p. 54-66 | en_US |
| dcterms.isPartOf | Digital scholarship in the humanities | en_US |
| dcterms.issued | 2020-04 | - |
| dc.identifier.eissn | 2055-768X | en_US |
| dc.description.validate | 202107 bcvc | en_US |
| dc.description.oa | Accepted Manuscript | en_US |
| dc.identifier.FolderNumber | a0947-n02 | - |
| dc.description.fundingSource | RGC | en_US |
| dc.description.fundingSource | Others | en_US |
| dc.description.fundingText | National Social Science Fund in China (Grant Award Number: 16BYY110), Hong Kong GRF (Grant Number 156097-15H) and The Hong Kong Polytechnic University (Grant Number 4-ZZFE) | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.description.oaCategory | Green (AAM) | en_US |
| Appears in Collections: | Journal/Magazine Article | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Hou_Linguistic_characteristics_Chinese.pdf | Pre-Published version | 567.15 kB | Adobe PDF | View/Open |
Page views
105
Last Week
1
1
Last month
Citations as of Apr 14, 2025
Downloads
99
Citations as of Apr 14, 2025
WEB OF SCIENCETM
Citations
8
Citations as of Oct 10, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



