Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106690
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Chinese and Bilingual Studies-
dc.creatorPrange, J-
dc.creatorChersoni, E-
dc.date.accessioned2024-06-03T02:11:31Z-
dc.date.available2024-06-03T02:11:31Z-
dc.identifier.isbn978-1-959429-76-0-
dc.identifier.urihttp://hdl.handle.net/10397/106690-
dc.descriptionThe 12th Joint Conference on Lexical and Computational Semantics, July 13-14, 2023, Toronto, Ontario, Canadaen_US
dc.language.isoenen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.rights©2023 Association for Computational Linguisticsen_US
dc.rightsACL materials are Copyright © 1963–2024 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License (https://creativecommons.org/licenses/by-nc-sa/3.0/). Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Jakob Prange and Emmanuele Chersoni. 2023. Empirical Sufficiency Lower Bounds for Language Modeling with Locally-Bootstrapped Semantic Structures. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), pages 456–468, Toronto, Canada. Association for Computational Linguistics is available at https://doi.org/10.18653/v1/2023.starsem-1.40.en_US
dc.titleEmpirical sufficiency lower bounds for language modeling with locally-bootstrapped semantic structuresen_US
dc.typeConference Paperen_US
dc.identifier.spage456-
dc.identifier.epage468-
dc.identifier.doi10.18653/v1/2023.starsem-1.40-
dcterms.abstractIn this work we build upon negative results from an attempt at language modeling with predicted semantic structure, in order to establish empirical lower bounds on what could have made the attempt successful. More specifically, we design a concise binary vector representation of semantic structure at the lexical level and evaluate in-depth how good an incremental tagger needs to be in order to achieve better-than-baseline performance with an end-to-end semantic-bootstrapping language model. We envision such a system as consisting of a (pretrained) sequential-neural component and a hierarchical-symbolic component working together to generate text with low surprisal and high linguistic interpretability. We find that (a) dimensionality of the semantic vector representation can be dramatically reduced without losing its main advantages and (b) lower bounds on prediction quality cannot be established via a single score alone, but need to take the distributions of signal and noise into account.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn The 12th Joint Conference on Lexical and Computational Semantics : Proceedings of the Conference (*SEM 2023), July 13-14, 2023, p. 456–468. Stroudsburg : Association for Computational Linguistics, 2023-
dcterms.issued2023-
dc.relation.ispartofbookThe 12th Joint Conference on Lexical and Computational Semantics : Proceedings of the Conference (*SEM 2023), July 13-14, 2023-
dc.relation.conferenceJoint Conference on Lexical and Computational Semantics [*SEM],-
dc.description.validate202405 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumbera2727aen_US
dc.identifier.SubFormID48135en_US
dc.description.fundingSourceSelf-fundeden_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
dc.relation.rdatahttps://github.com/jakpra/SufficiencyLowerBounds-
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
2023.starsem-1.40.pdf766.92 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

110
Last Week
8
Last month
Citations as of Apr 12, 2026

Downloads

34
Citations as of Apr 12, 2026

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.