Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/92350
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Chinese and Bilingual Studies | en_US |
dc.creator | Liesenfeld, A | en_US |
dc.creator | Parti, G | en_US |
dc.creator | Hsu, YY | en_US |
dc.creator | Huang, CR | en_US |
dc.date.accessioned | 2022-03-22T06:32:47Z | - |
dc.date.available | 2022-03-22T06:32:47Z | - |
dc.identifier.uri | http://hdl.handle.net/10397/92350 | - |
dc.description | 34th Pacific Asia Conference on Language, Information and Computation, Oct. 2020, Hanoi, Vietnam | en_US |
dc.language.iso | en | en_US |
dc.publisher | Association for Computational Linguistics | en_US |
dc.rights | Copyright of contributed papers reserved by respective authors. | en_US |
dc.rights | Posted with permission of the author. | en_US |
dc.rights | The following publication Andreas Liesenfeld, Gábor Parti, Yuyin Hsu, and Chu-Ren Huang. 2020. Predicting gender and age categories in English conversations using lexical, non-lexical, and turn-taking features. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, pages 157–166, Hanoi, Vietnam. Association for Computational Linguistics is available at https://aclanthology.org/2020.paclic-1.19/. | en_US |
dc.title | Predicting gender and age categories in English conversations using lexical, non-lexical, and turn-taking features | en_US |
dc.type | Conference Paper | en_US |
dc.identifier.spage | 157 | en_US |
dc.identifier.epage | 166 | en_US |
dcterms.abstract | This paper examines gender and age salience and (stereo)typicality in British English talk with the aim to predict gender and age categories based on lexical, phrasal and turntaking features. We examine the SpokenBNC, a corpus of around 11.4 million words of British English conversations and identify behavioural differences between speakers that are labelled for gender and age categories. We explore differences in language use and turn-taking dynamics and identify a range of characteristics that set the categories apart. We find that female speakers tend to produce more and slightly longer turns, while turns by male speakers feature a higher type-token ratio and a distinct range of minimal particles such as “eh”, “uh” and “em”. Across age groups, we observe, for instance, that swear words and laughter characterize young speakers’ talk, while old speakers tend to produce more truncated words. We then use the observed characteristics to predict gender and age labels of speakers per conversation and per turn as a classification task, showing that non-lexical utterances such as minimal particles that are usually left out of dialog data can contribute to setting the categories apart. | en_US |
dcterms.accessRights | open access | en_US |
dcterms.bibliographicCitation | In ML Nguyen, MC Luong & S Song (Eds.), Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, 24-26 October, 2020, University of Science, Vietnam National University Hanoi, Vietnam, p. 157-166. Association for Computational Linguistics, 2020 | en_US |
dcterms.issued | 2020-10 | - |
dc.relation.ispartofbook | Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation | en_US |
dc.relation.conference | Pacific Asia Conference on Language, Information and Computation [PACLIC] | en_US |
dc.description.validate | 202203 bcfc | en_US |
dc.description.oa | Version of Record | en_US |
dc.identifier.FolderNumber | a1141-n03, CBS-0051 | - |
dc.identifier.SubFormID | 43996 | - |
dc.description.fundingSource | Self-funded | en_US |
dc.description.pubStatus | Published | en_US |
dc.identifier.OPUS | 50567980 | - |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2020.paclic-1.19.pdf | 730.6 kB | Adobe PDF | View/Open |
Page views
64
Last Week
0
0
Last month
Citations as of May 12, 2024
Downloads
14
Citations as of May 12, 2024
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.