Predicting gender and age categories in English conversations using lexical, non-lexical, and turn-taking features

Liesenfeld, A; Parti, G; Hsu, YY; Huang, CR

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/92350

DC Field	Value	Language
dc.contributor	Department of Chinese and Bilingual Studies	en_US
dc.creator	Liesenfeld, A	en_US
dc.creator	Parti, G	en_US
dc.creator	Hsu, YY	en_US
dc.creator	Huang, CR	en_US
dc.date.accessioned	2022-03-22T06:32:47Z	-
dc.date.available	2022-03-22T06:32:47Z	-
dc.identifier.uri	http://hdl.handle.net/10397/92350	-
dc.description	34th Pacific Asia Conference on Language, Information and Computation, Oct. 2020, Hanoi, Vietnam	en_US
dc.language.iso	en	en_US
dc.publisher	Association for Computational Linguistics	en_US
dc.rights	Copyright of contributed papers reserved by respective authors.	en_US
dc.rights	Posted with permission of the author.	en_US
dc.rights	The following publication Andreas Liesenfeld, Gábor Parti, Yuyin Hsu, and Chu-Ren Huang. 2020. Predicting gender and age categories in English conversations using lexical, non-lexical, and turn-taking features. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, pages 157–166, Hanoi, Vietnam. Association for Computational Linguistics is available at https://aclanthology.org/2020.paclic-1.19/.	en_US
dc.title	Predicting gender and age categories in English conversations using lexical, non-lexical, and turn-taking features	en_US
dc.type	Conference Paper	en_US
dc.identifier.spage	157	en_US
dc.identifier.epage	166	en_US
dcterms.abstract	This paper examines gender and age salience and (stereo)typicality in British English talk with the aim to predict gender and age categories based on lexical, phrasal and turntaking features. We examine the SpokenBNC, a corpus of around 11.4 million words of British English conversations and identify behavioural differences between speakers that are labelled for gender and age categories. We explore differences in language use and turn-taking dynamics and identify a range of characteristics that set the categories apart. We find that female speakers tend to produce more and slightly longer turns, while turns by male speakers feature a higher type-token ratio and a distinct range of minimal particles such as “eh”, “uh” and “em”. Across age groups, we observe, for instance, that swear words and laughter characterize young speakers’ talk, while old speakers tend to produce more truncated words. We then use the observed characteristics to predict gender and age labels of speakers per conversation and per turn as a classification task, showing that non-lexical utterances such as minimal particles that are usually left out of dialog data can contribute to setting the categories apart.	en_US
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	In ML Nguyen, MC Luong & S Song (Eds.), Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, 24-26 October, 2020, University of Science, Vietnam National University Hanoi, Vietnam, p. 157-166. Association for Computational Linguistics, 2020	en_US
dcterms.issued	2020-10	-
dc.relation.ispartofbook	Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation	en_US
dc.relation.conference	Pacific Asia Conference on Language, Information and Computation [PACLIC]	en_US
dc.description.validate	202203 bcfc	en_US
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	a1141-n03, CBS-0051	-
dc.identifier.SubFormID	43996	-
dc.description.fundingSource	Self-funded	en_US
dc.description.pubStatus	Published	en_US
dc.identifier.OPUS	50567980	-
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
2020.paclic-1.19.pdf		730.6 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Page views

64

Last Week
0

Last month

Citations as of May 12, 2024

Downloads

14

Citations as of May 12, 2024

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

Google ScholarTM

Google Scholar^TM