Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/101458
PIRA download icon_1.1View/Download Full Text
Title: Are frequent phrases directly retrieved like idioms ? An investigation with self-paced reading and language models
Authors: Rambelli, G
Chersoni, E 
Senaldi, M
Blache, P
Lenci, A
Issue Date: 2023
Source: Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023), Dubrovnik, Croatia, 6 May 2023, p. 87–98
Abstract: An open question in language comprehension studies is whether non-compositional multiword expressions like idioms and compositional-but-frequent word sequences are processed differently. Are the latter constructed online, or are instead directly retrieved from the lexicon, with a degree of entrenchment depending on their frequency? In this paper, we address this question with two different methodologies. First, we set up a self-paced reading experiment comparing human reading times for idioms and both highfrequency and low-frequency compositional word sequences. Then, we ran the same experiment using the Surprisal metrics computed with Neural Language Models (NLMs). Our results provide evidence that idiomatic and high-frequency compositional expressions are processed similarly by both humans and NLMs. Additional experiments were run to test the possible factors that could affect the NLMs’ performance.
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2023.mwe-1.13
Description: The 19th Workshop on Multiword Expressions (MWE 2023), May 6 2023, Dubrovnik, Croatia
Rights: © 2023 Association for Computational Linguistics
Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/)
The following publication Giulia Rambelli, Emmanuele Chersoni, Marco S. G. Senaldi, Philippe Blache, and Alessandro Lenci. 2023. Are Frequent Phrases Directly Retrieved like Idioms? An Investigation with Self-Paced Reading and Language Models. In Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023), pages 87–98, Dubrovnik, Croatia. Association for Computational Linguistics is available at https://doi.org/10.18653/v1/2023.mwe-1.13.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Rambelli_Frequent_Phrases_Directly.pdf2.55 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

126
Last Week
7
Last month
Citations as of Nov 9, 2025

Downloads

52
Citations as of Nov 9, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.