Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/111827
PIRA download icon_1.1View/Download Full Text
Title: On the influence of discourse connectives on the predictions of humans and language models
Authors: Britton, J 
Cong, Y
Hsu, YY 
Chersoni, E 
Blache, P
Issue Date: 2024
Source: Frontiers in human neuroscience, 2024, v. 18, 1363120
Abstract: Psycholinguistic literature has consistently shown that humans rely on a rich and organized understanding of event knowledge to predict the forthcoming linguistic input during online sentence comprehension. We, the authors, expect sentences to maintain coherence with the preceding context, making congruent sentence sequences easier to process than incongruent ones. It is widely known that discourse relations between sentences (e.g., temporal, contingency, comparison) are generally made explicit through specific particles, known as discourse connectives, (e.g., and, but, because, after). However, some relations that are easily accessible to the speakers, given their event knowledge, can also be left implicit. The goal of this paper is to investigate the importance of discourse connectives in the prediction of events in human language processing and pretrained language models, with a specific focus on concessives and contrastives, which signal to comprehenders that their event-related predictions have to be reversed. Inspired by previous work, we built a comprehensive set of story stimuli in Italian and Mandarin Chinese that differ in the plausibility and coherence of the situation being described and the presence or absence of a discourse connective. We collected plausibility judgments and reading times from native speakers for the stimuli. Moreover, we correlated the results of the experiments with the predictions given by computational modeling, using Surprisal scores obtained via Transformer-based language models. The human judgements were collected using a seven-point Likert scale and analyzed using cumulative link mixed modeling (CLMM), while the human reading times and language model surprisal scores were analyzed using linear mixed effects regression (LMER). We found that Chinese NLMs are sensitive to plausibility and connectives, although they struggle to reproduce expectation reversal effects due to a connective changing the plausibility of a given scenario; Italian results are even less aligned with human data, with no effects of either plausibility and connectives on Surprisal.
Keywords: Discourse connectives
Event knowledge
Language models
Natural Language Processing
Psycholinguistics
Publisher: Frontiers Research Foundation
Journal: Frontiers in human neuroscience 
EISSN: 1662-5161
DOI: 10.3389/fnhum.2024.1363120
Rights: © 2024 Britton, Cong, Hsu, Chersoni and Blache. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (http://creativecommons.org/licenses/by/4.0/). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
The following publication Britton J, Cong Y, Hsu Y-Y, Chersoni E and Blache P (2024) On the influence of discourse connectives on the predictions of humans and language models. Front. Hum. Neurosci. 18:1363120 is available at https://doi.org/10.3389/fnhum.2024.1363120.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
fnhum-18-1363120.pdf1.44 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

6
Citations as of Apr 14, 2025

Downloads

2
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

1
Citations as of Dec 19, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.