Event knowledge in large language models : the gap between the impossible and the unlikely

Kauf, C; Ivanova, AA; Rambelli, G; Chersoni, E; She, JS; Chowdhury, Z; Fedorenko, E; Lenci, A

doi:10.1111/cogs.13386

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106691

DC Field	Value	Language
dc.contributor	Department of Chinese and Bilingual Studies	en_US
dc.creator	Kauf, C	en_US
dc.creator	Ivanova, AA	en_US
dc.creator	Rambelli, G	en_US
dc.creator	Chersoni, E	en_US
dc.creator	She, JS	en_US
dc.creator	Chowdhury, Z	en_US
dc.creator	Fedorenko, E	en_US
dc.creator	Lenci, A	en_US
dc.date.accessioned	2024-06-03T02:11:32Z	-
dc.date.available	2024-06-03T02:11:32Z	-
dc.identifier.uri	http://hdl.handle.net/10397/106691	-
dc.language.iso	en	en_US
dc.publisher	Wiley-Blackwell Publishing, Inc.	en_US
dc.rights	© 2023 The Authors. Cognitive Science published by Wiley Periodicals LLC on behalf of Cognitive Science Society (CSS).	en_US
dc.rights	This is an open access article under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits use, distribution and reproduction in any medium, provided the original work is properly cited.	en_US
dc.rights	The following publication Kauf, C., Ivanova, A.A., Rambelli, G., Chersoni, E., She, J.S., Chowdhury, Z., Fedorenko, E. and Lenci, A. (2023), Event Knowledge in Large Language Models: The Gap Between the Impossible and the Unlikely. Cognitive Science, 47: e13386 is available at https://doi.org/10.1111/cogs.13386.	en_US
dc.subject	Artificial neural networks	en_US
dc.subject	Generalized event knowledge	en_US
dc.subject	Language models	en_US
dc.subject	Plausibility	en_US
dc.subject	Semantics	en_US
dc.subject	Syntax	en_US
dc.subject	Typicality	en_US
dc.subject	World knowledge	en_US
dc.title	Event knowledge in large language models : the gap between the impossible and the unlikely	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.volume	47	en_US
dc.identifier.issue	11	en_US
dc.identifier.doi	10.1111/cogs.13386	en_US
dcterms.abstract	Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs’ semantic abilities is whether they acquire generalized knowledge of common events. Here, we test whether five pretrained LLMs (from 2018's BERT to 2023's MPT) assign a higher likelihood to plausible descriptions of agent−patient interactions than to minimally different implausible versions of the same event. Using three curated sets of minimal sentence pairs (total n = 1215), we found that pretrained LLMs possess substantial event knowledge, outperforming other distributional language models. In particular, they almost always assign a higher likelihood to possible versus impossible events (The teacher bought the laptop vs. The laptop bought the teacher). However, LLMs show less consistent preferences for likely versus unlikely events (The nanny tutored the boy vs. The boy tutored the nanny). In follow-up analyses, we show that (i) LLM scores are driven by both plausibility and surface-level sentence features, (ii) LLM scores generalize well across syntactic variants (active vs. passive constructions) but less well across semantic variants (synonymous sentences), (iii) some LLM errors mirror human judgment ambiguity, and (iv) sentence plausibility serves as an organizing dimension in internal LLM representations. Overall, our results show that important aspects of event knowledge naturally emerge from distributional linguistic patterns, but also highlight a gap between representations of possible/impossible and likely/unlikely events.	en_US
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	Cognitive science, Nov. 2023, v. 47, e13386	en_US
dcterms.isPartOf	Cognitive science	en_US
dcterms.issued	2023-11	-
dc.identifier.scopus	2-s2.0-85177808989	-
dc.identifier.eissn	1551-6709	en_US
dc.identifier.artn	e13386	en_US
dc.description.validate	202405 bcch	en_US
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	a2727a	-
dc.identifier.SubFormID	48136	-
dc.description.fundingSource	RGC	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	CC	en_US
dc.relation.rdata	https://github.com/carina-kauf/lm-event-knowledge	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Kauf_Event_Knowledge_Large.pdf		1.77 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Page views

5

Citations as of Jun 30, 2024

Downloads

4

Citations as of Jun 30, 2024

SCOPUS^TM
Citations

6

Citations as of Jun 21, 2024

WEB OF SCIENCE^TM
Citations

5

Citations as of Jun 27, 2024

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM