Enhancing automated essay scoring performance via fine-tuning pre-trained language models with combination of regression and ranking

Yang, R; Cao, J; Wen, Z; Wu, Y; He, X

doi:10.18653/v1/2020.findings-emnlp.141

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105512

Title:	Enhancing automated essay scoring performance via fine-tuning pre-trained language models with combination of regression and ranking
Authors:	Yang, R Cao, J Wen, Z Wu, Y He, X
Issue Date:	2020
Source:	In Findings of the Association for Computational Linguistics: EMNLP 2020, p. 1560-1569. Stroudsburg, PA, USA: Association for Computational Linguistics (ACL), 2020
Abstract:	Automated Essay Scoring (AES) is a critical text regression task that automatically assigns scores to essays based on their writing quality. Recently, the performance of sentence prediction tasks has been largely improved by using Pre-trained Language Models via fusing representations from different layers, constructing an auxiliary sentence, using multi-task learning, etc. However, to solve the AES task, previous works utilize shallow neural networks to learn essay representations and constrain calculated scores with regression loss or ranking loss, respectively. Since shallow neural networks trained on limited samples show poor performance to capture deep semantic of texts. And without an accurate scoring function, ranking loss and regression loss measures two different aspects of the calculated scores. To improve AES’s performance, we find a new way to fine-tune pre-trained language models with multiple losses of the same task. In this paper, we propose to utilize a pre-trained language model to learn text representations first. With scores calculated from the representations, mean square error loss and the batch-wise ListNet loss with dynamic weights constrain the scores simultaneously. We utilize Quadratic Weighted Kappa to evaluate our model on the Automated Student Assessment Prize dataset. Our model outperforms not only state-of-the-art neural models near 3 percent but also the latest statistic model. Especially on the two narrative prompts, our model performs much better than all other state-of-the-art models.
Publisher:	Association for Computational Linguistics (ACL)
ISBN:	978-1-952148-90-3
DOI:	10.18653/v1/2020.findings-emnlp.141
Description:	2020 Conference on Empirical Methods in Natural Language Processing, 16th-20th November 2020, Online
Rights:	© 2020 Association for Computational Linguistics This publication is licensed on a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/) The following publication Ruosong Yang, Jiannong Cao, Zhiyuan Wen, Youzheng Wu, and Xiaodong He. 2020. Enhancing Automated Essay Scoring Performance via Fine-tuning Pre-trained Language Models with Combination of Regression and Ranking. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1560–1569, Online. Association for Computational Linguistics is available at https://doi.org/10.18653/v1/2020.findings-emnlp.141.
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
2020.findings-emnlp.141.pdf		281.15 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show full item record

Page views

261

Last Week
75

Last month

Citations as of Aug 17, 2025

Downloads

171

Citations as of Aug 17, 2025

Google Scholar^TM

Check