Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/119004
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.contributorResearch Centre for Data Science and Artificial Intelligence-
dc.creatorLi, X-
dc.creatorLi, J-
dc.date.accessioned2026-05-26T08:10:12Z-
dc.date.available2026-05-26T08:10:12Z-
dc.identifier.isbn979-8-89176-094-3-
dc.identifier.urihttp://hdl.handle.net/10397/119004-
dc.descriptionThe 62nd Annual Meeting of the Association for Computational Linguistics, Bangkok, Thailand, August 11-16, 2024en_US
dc.language.isoenen_US
dc.publisherAssociation for Computational Linguistics (ACL)en_US
dc.rights© 2024 Association for Computational Linguisticsen_US
dc.rightsACL materials are Copyright © 1963–2026 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License (https://creativecommons.org/licenses/by-nc-sa/3.0/). Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Xianming Li and Jing Li. 2024. AoE: Angle-optimized Embeddings for Semantic Textual Similarity. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1825–1839, Bangkok, Thailand. Association for Computational Linguistics is available at https://doi.org/10.18653/v1/2024.acl-long.101.en_US
dc.titleAoe : angle-optimized embeddings for semantic textual similarityen_US
dc.typeConference Paperen_US
dc.identifier.spage1825-
dc.identifier.epage1839-
dc.identifier.volume1-
dc.identifier.doi10.18653/v1/2024.acl-long.101-
dcterms.abstractText embedding is pivotal in semantic textual similarity (STS) tasks, which are crucial components in Large Language Model (LLM) applications. STS learning largely relies on the cosine function as the optimization objective to reflect semantic similarity. However, the cosine has saturation zones rendering vanishing gradients and hindering learning subtle semantic differences in text embeddings. To address this issue, we propose a novel Angle-optimized Embedding model, AoE. It optimizes angle differences in complex space to explore similarity in saturation zones better. To set up a comprehensive evaluation, we experimented with existing short-text STS, our newly collected long-text STS, and downstream task datasets. Extensive experimental results on STS and MTEB benchmarks show that AoE significantly outperforms popular text embedding models neglecting cosine saturation zones. It highlights that AoE can produce high-quality text embeddings and broadly benefit downstream tasks. The code is available at: https://github.com/SeanLee97/AnglE.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (v. 1: Long Papers), p. 1825–1839. Bangkok, Thailand: Association for Computational Linguistics, 2024-
dcterms.issued2024-
dc.identifier.scopus2-s2.0-85204492105-
dc.relation.conferenceAnnual Meeting of the Association for Computational Linguistics [ACL]-
dc.description.validate202605 bcjz-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Scopus/WOSen_US
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work is supported by the NSFC Young Scientists Fund (Project No. 62006203), a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. PolyU/25200821), the Innovation and Technology Fund (Project No. PRP/047/22FX), and PolyU Internal Fund from RC-DSAI (Project No. 1-CE1E). Here, we sincerely thank the reviewers and ACs for their valuable input, which has greatly improved our work.en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Conference Paper
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.