Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/119010
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.contributorResearch Centre for Data Science and Artificial Intelligence-
dc.creatorLi, X-
dc.creatorLi, Z-
dc.creatorLi, J-
dc.creatorXie, H-
dc.creatorLi, Q-
dc.date.accessioned2026-05-26T08:10:16Z-
dc.date.available2026-05-26T08:10:16Z-
dc.identifier.urihttp://hdl.handle.net/10397/119010-
dc.descriptionThe Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, Apr 24 2025en_US
dc.language.isoenen_US
dc.publisherOpenReview.neten_US
dc.rightsCC BY 4.0 (https://creativecommons.org/licenses/by/4.0/)en_US
dc.rightsThe following publication Li, X., Li, Z., Li, J., Xie, H., & Li, Q. (2025). ESE: Espresso sentence embeddings. In The Thirteenth International Conference on Learning Representations (ICLR) is available at https://openreview.net/forum?id=plgLA2YBLH.en_US
dc.titleESE : espresso sentence embeddingsen_US
dc.typeConference Paperen_US
dcterms.abstractHigh-quality sentence embeddings are fundamental in many natural language processing (NLP) tasks, such as semantic textual similarity (STS) and retrieval-augmented generation (RAG). However, most existing methods leverage fixed-length sentence embeddings from full-layer language models, which lack the scalability to accommodate the diverse available resources across various applications. Viewing this gap, we propose a novel sentence embedding model Espresso Sentence Embeddings (ESE) with two learning processes. First, the learn-to-express process encodes more salient representations to shallow layers. Second, the learn-to-compress process compacts essential features into the initial dimensions using Principal Component Analysis (PCA). This way, ESE can scale model depth via the former process and embedding size via the latter. Extensive experiments on STS and RAG suggest that ESE can effectively produce high-quality sentence embeddings with less model depth and embedding size, enhancing inference efficiency. The code is available at https://github.com/SeanLee97/AnglE/blob/main/README_ESE.md.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationThe Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, Apr 24 2025, https://openreview.net/forum?id=plgLA2YBLH-
dcterms.issued2025-
dc.relation.conferenceInternational Conference on Learning Representations [ICLR]-
dc.description.validate202605 bcjz-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Othersen_US
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextXianming Li and Jing Li’s work has been supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. PolyU/25200821), the Innovation and Technology Fund (Project No. PRP/047/22FX), and PolyU Internal Fund from RCDSAI (Project No. 1-CE1E). Zongxi Li’s work has been supported by Faculty Research Grants (SDS24A2) of Lingnan University, Hong Kong, and the Faculty Development Scheme (Project No. UGC/FDS16/E10/23), of Hong Kong Research Grants Council; Haoran Xie’s work has been supported by the Faculty Research Grants (SDS24A8) and the Direct Grant (DR25E8) of Lingnan University, Hong Kong; Qing Li’s work has been supported by Hong Kong Research Grants Council through Research Impact Fund (project no. R1015-23).en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Conference Paper
Open Access Information
Status open access
File Version Version of Record
Show simple item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.