Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106698
PIRA download icon_1.1View/Download Full Text
Title: Evaluating multilingual language models for cross-lingual ESG issue identification
Authors: Li, WY 
Chersoni, E 
Ngai, CSB 
Issue Date: 2024
Source: In CC Chen, Z Ma & U Hahn (Eds.). The Joint Workshop of the 7th Financial Technology and Natural Language Processing (FinNLP), the 5th Knowledge Discovery from Unstructured Data in Financial Services (KDF), and the 4th Economics and Natural Language Processing (ECONLP) Workshop (FinNLP-KDF-ECONLP 2024) : Workshop Proceedings, 20 May, 2024 Torino, Italia, p. 50-58. ELRA and ICCL, 2024
Abstract: The automation of information extraction from ESG reports has recently become a topic of increasing interest in the Natural Language Processing community. While such information is highly relevant for socially responsible investments, identifying the specific issues discussed in a corporate social responsibility report is one of the first steps in an information extraction pipeline. In this paper, we evaluate methods for tackling the Multilingual Environmental, Social and Governance (ESG) Issue Identification Task. Our experiments use existing datasets in English, French and Chinese with a unified label set. Leveraging multilingual language models, we compare two approaches that are commonly adopted for the given task: off-the-shelf and fine-tuning. We show that fine-tuning models end-to-end is more robust than off-the-shelf methods. Additionally, translating text into the same language has negligible performance benefits.
Keywords: Cross-lingual transfer
ESG reports
Multilingual NLP
Pre-trained language models
Text classification
Publisher: ELRA and ICCL
ISBN: 978-2-493814-19-7
Research Data: https://github.com/justinaL/ML-ESG-Eval
Description: 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Lingotto Conference Centre - Torino (Italia), 20-25 May, 2024
Rights: © 2024 ELRA Language Resource Association: CC BY-NC 4.0
ACL materials are Copyright © 1963–2024 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License (https://creativecommons.org/licenses/by-nc-sa/3.0/). Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
The following publication Wing Yan Li, Emmanuele Chersoni, and Cindy Sing Bik Ngai. 2024. Evaluating Multilingual Language Models for Cross-Lingual ESG Issue Identification. In Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing @ LREC-COLING 2024, pages 50–58, Torino, Italia. ELRA and ICCL. is available at https://aclanthology.org/2024.finnlp-1.6.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
2024.finnlp-1.6.pdf236.56 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

118
Last Week
6
Last month
Citations as of Dec 21, 2025

Downloads

41
Citations as of Dec 21, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.