ESE : espresso sentence embeddings

Li, X; Li, Z; Li, J; Xie, H; Li, Q

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/119010

DC Field	Value	Language
dc.contributor	Department of Computing	-
dc.contributor	Research Centre for Data Science and Artificial Intelligence	-
dc.creator	Li, X	-
dc.creator	Li, Z	-
dc.creator	Li, J	-
dc.creator	Xie, H	-
dc.creator	Li, Q	-
dc.date.accessioned	2026-05-26T08:10:16Z	-
dc.date.available	2026-05-26T08:10:16Z	-
dc.identifier.uri	http://hdl.handle.net/10397/119010	-
dc.description	The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, Apr 24 2025	en_US
dc.language.iso	en	en_US
dc.publisher	OpenReview.net	en_US
dc.rights	CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/)	en_US
dc.rights	The following publication Li, X., Li, Z., Li, J., Xie, H., & Li, Q. (2025). ESE: Espresso sentence embeddings. In The Thirteenth International Conference on Learning Representations (ICLR) is available at https://openreview.net/forum?id=plgLA2YBLH.	en_US
dc.title	ESE : espresso sentence embeddings	en_US
dc.type	Conference Paper	en_US
dcterms.abstract	High-quality sentence embeddings are fundamental in many natural language processing (NLP) tasks, such as semantic textual similarity (STS) and retrieval-augmented generation (RAG). However, most existing methods leverage fixed-length sentence embeddings from full-layer language models, which lack the scalability to accommodate the diverse available resources across various applications. Viewing this gap, we propose a novel sentence embedding model Espresso Sentence Embeddings (ESE) with two learning processes. First, the learn-to-express process encodes more salient representations to shallow layers. Second, the learn-to-compress process compacts essential features into the initial dimensions using Principal Component Analysis (PCA). This way, ESE can scale model depth via the former process and embedding size via the latter. Extensive experiments on STS and RAG suggest that ESE can effectively produce high-quality sentence embeddings with less model depth and embedding size, enhancing inference efficiency. The code is available at https://github.com/SeanLee97/AnglE/blob/main/README_ESE.md.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, Apr 24 2025, https://openreview.net/forum?id=plgLA2YBLH	-
dcterms.issued	2025	-
dc.relation.conference	International Conference on Learning Representations [ICLR]	-
dc.description.validate	202605 bcjz	-
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	OA_Others	en_US
dc.description.fundingSource	RGC	en_US
dc.description.fundingSource	Others	en_US
dc.description.fundingText	Xianming Li and Jing Li’s work has been supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. PolyU/25200821), the Innovation and Technology Fund (Project No. PRP/047/22FX), and PolyU Internal Fund from RCDSAI (Project No. 1-CE1E). Zongxi Li’s work has been supported by Faculty Research Grants (SDS24A2) of Lingnan University, Hong Kong, and the Faculty Development Scheme (Project No. UGC/FDS16/E10/23), of Hong Kong Research Grants Council; Haoran Xie’s work has been supported by the Faculty Research Grants (SDS24A8) and the Direct Grant (DR25E8) of Lingnan University, Hong Kong; Qing Li’s work has been supported by Hong Kong Research Grants Council through Research Impact Fund (project no. R1015-23).	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	CC	en_US
Appears in Collections:	Conference Paper

Open Access Information

Status	open access
File Version	Version of Record

Show simple item record

Google Scholar^TM

Check

Open Access Information

Google ScholarTM

Google Scholar^TM