Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/119010
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Computing | - |
| dc.contributor | Research Centre for Data Science and Artificial Intelligence | - |
| dc.creator | Li, X | - |
| dc.creator | Li, Z | - |
| dc.creator | Li, J | - |
| dc.creator | Xie, H | - |
| dc.creator | Li, Q | - |
| dc.date.accessioned | 2026-05-26T08:10:16Z | - |
| dc.date.available | 2026-05-26T08:10:16Z | - |
| dc.identifier.uri | http://hdl.handle.net/10397/119010 | - |
| dc.description | The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, Apr 24 2025 | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | OpenReview.net | en_US |
| dc.rights | CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/) | en_US |
| dc.rights | The following publication Li, X., Li, Z., Li, J., Xie, H., & Li, Q. (2025). ESE: Espresso sentence embeddings. In The Thirteenth International Conference on Learning Representations (ICLR) is available at https://openreview.net/forum?id=plgLA2YBLH. | en_US |
| dc.title | ESE : espresso sentence embeddings | en_US |
| dc.type | Conference Paper | en_US |
| dcterms.abstract | High-quality sentence embeddings are fundamental in many natural language processing (NLP) tasks, such as semantic textual similarity (STS) and retrieval-augmented generation (RAG). However, most existing methods leverage fixed-length sentence embeddings from full-layer language models, which lack the scalability to accommodate the diverse available resources across various applications. Viewing this gap, we propose a novel sentence embedding model Espresso Sentence Embeddings (ESE) with two learning processes. First, the learn-to-express process encodes more salient representations to shallow layers. Second, the learn-to-compress process compacts essential features into the initial dimensions using Principal Component Analysis (PCA). This way, ESE can scale model depth via the former process and embedding size via the latter. Extensive experiments on STS and RAG suggest that ESE can effectively produce high-quality sentence embeddings with less model depth and embedding size, enhancing inference efficiency. The code is available at https://github.com/SeanLee97/AnglE/blob/main/README_ESE.md. | - |
| dcterms.accessRights | open access | en_US |
| dcterms.bibliographicCitation | The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, Apr 24 2025, https://openreview.net/forum?id=plgLA2YBLH | - |
| dcterms.issued | 2025 | - |
| dc.relation.conference | International Conference on Learning Representations [ICLR] | - |
| dc.description.validate | 202605 bcjz | - |
| dc.description.oa | Version of Record | en_US |
| dc.identifier.FolderNumber | OA_Others | en_US |
| dc.description.fundingSource | RGC | en_US |
| dc.description.fundingSource | Others | en_US |
| dc.description.fundingText | Xianming Li and Jing Li’s work has been supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. PolyU/25200821), the Innovation and Technology Fund (Project No. PRP/047/22FX), and PolyU Internal Fund from RCDSAI (Project No. 1-CE1E). Zongxi Li’s work has been supported by Faculty Research Grants (SDS24A2) of Lingnan University, Hong Kong, and the Faculty Development Scheme (Project No. UGC/FDS16/E10/23), of Hong Kong Research Grants Council; Haoran Xie’s work has been supported by the Faculty Research Grants (SDS24A8) and the Direct Grant (DR25E8) of Lingnan University, Hong Kong; Qing Li’s work has been supported by Hong Kong Research Grants Council through Research Impact Fund (project no. R1015-23). | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.description.oaCategory | CC | en_US |
| Appears in Collections: | Conference Paper | |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.


