Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106688
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Chinese and Bilingual Studiesen_US
dc.creatorTesta, Den_US
dc.creatorChersoni, Een_US
dc.creatorLenci, Aen_US
dc.date.accessioned2024-06-03T02:11:30Z-
dc.date.available2024-06-03T02:11:30Z-
dc.identifier.isbn978-1-959429-72-2en_US
dc.identifier.urihttp://hdl.handle.net/10397/106688-
dc.descriptionThe 61st Conference of the the Association for Computational Linguistics, July 9-14, 2023, Toronto, Canadaen_US
dc.language.isoenen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.rights©2023 Association for Computational Linguisticsen_US
dc.rightsACL materials are Copyright © 1963–2024 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License (https://creativecommons.org/licenses/by-nc-sa/3.0/). Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Davide Testa, Emmanuele Chersoni, and Alessandro Lenci. 2023. We Understand Elliptical Sentences, and Language Models should Too: A New Dataset for Studying Ellipsis and its Interaction with Thematic Fit. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3340–3353, Toronto, Canada. Association for Computational Linguistics is available at https://doi.org/10.18653/v1/2023.acl-long.188.en_US
dc.titleWe understand elliptical sentences, and language models should too : a new dataset for studying ellipsis and its interaction with thematic fiten_US
dc.typeConference Paperen_US
dc.identifier.spage3340en_US
dc.identifier.epage3353en_US
dc.identifier.volume1en_US
dc.identifier.doi10.18653/v1/2023.acl-long.188en_US
dcterms.abstractEllipsis is a linguistic phenomenon characterized by the omission of one or more sentence elements. Solving such a linguistic construction is not a trivial issue in natural language processing since it involves the retrieval of non-overtly expressed verbal material, which might in turn require the model to integrate human-like syntactic and semantic knowledge. In this paper, we explored the issue of how the prototypicality of event participants affects the ability of Language Models (LMs) to handle elliptical sentences and to identify the omitted arguments at different degrees of thematic fit, ranging from highly typical participants to semantically anomalous ones. With this purpose in mind, we built ELLie, the first dataset composed entirely of utterances containing different types of elliptical constructions, and structurally suited for evaluating the effect of argument thematic fit in solving ellipsis and reconstructing the missing element. Our tests demonstrated that the probability scores assigned by the models are higher for typical events than for atypical and impossible ones in different elliptical contexts, confirming the influence of prototypicality of the event participants in interpreting such linguistic structures. Finally, we conducted a retrieval task of the elided verb in the sentence in which the low performance of LMs highlighted a considerable difficulty in reconstructing the correct event.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn The 61st Conference of the the Association for Computational Linguistics : Proceedings of the Conference, Volume 1: Long Papers, July 9-14, 2023, p. 3340-3353. Stroudsburg : Association for Computational Linguistics, 2023en_US
dcterms.issued2023-
dc.relation.ispartofbookThe 61st Conference of the the Association for Computational Linguistics : Proceedings of the Conference, Volume 1: Long Papers, July 9-14, 2023en_US
dc.relation.conferenceAssociation for Computational Linguistics. Annual Meeting [ACL]en_US
dc.description.validate202405 bcchen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumbera2727a-
dc.identifier.SubFormID48133-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
dc.relation.rdatahttps://github.com/Caput97/ELLie-ellipsis_and_thematic_fit_with_LMsen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
2023.acl-long.188.pdf250.14 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

68
Citations as of May 11, 2025

Downloads

45
Citations as of May 11, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.