Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105512
PIRA download icon_1.1View/Download Full Text
Title: Enhancing automated essay scoring performance via fine-tuning pre-trained language models with combination of regression and ranking
Authors: Yang, R 
Cao, J 
Wen, Z 
Wu, Y
He, X
Issue Date: 2020
Source: In Findings of the Association for Computational Linguistics: EMNLP 2020, p. 1560-1569. Stroudsburg, PA, USA: Association for Computational Linguistics (ACL), 2020
Abstract: Automated Essay Scoring (AES) is a critical text regression task that automatically assigns scores to essays based on their writing quality. Recently, the performance of sentence prediction tasks has been largely improved by using Pre-trained Language Models via fusing representations from different layers, constructing an auxiliary sentence, using multi-task learning, etc. However, to solve the AES task, previous works utilize shallow neural networks to learn essay representations and constrain calculated scores with regression loss or ranking loss, respectively. Since shallow neural networks trained on limited samples show poor performance to capture deep semantic of texts. And without an accurate scoring function, ranking loss and regression loss measures two different aspects of the calculated scores. To improve AES’s performance, we find a new way to fine-tune pre-trained language models with multiple losses of the same task. In this paper, we propose to utilize a pre-trained language model to learn text representations first. With scores calculated from the representations, mean square error loss and the batch-wise ListNet loss with dynamic weights constrain the scores simultaneously. We utilize Quadratic Weighted Kappa to evaluate our model on the Automated Student Assessment Prize dataset. Our model outperforms not only state-of-the-art neural models near 3 percent but also the latest statistic model. Especially on the two narrative prompts, our model performs much better than all other state-of-the-art models.
Publisher: Association for Computational Linguistics (ACL)
ISBN: 978-1-952148-90-3
DOI: 10.18653/v1/2020.findings-emnlp.141
Description: 2020 Conference on Empirical Methods in Natural Language Processing, 16th-20th November 2020, Online
Rights: © 2020 Association for Computational Linguistics
This publication is licensed on a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/)
The following publication Ruosong Yang, Jiannong Cao, Zhiyuan Wen, Youzheng Wu, and Xiaodong He. 2020. Enhancing Automated Essay Scoring Performance via Fine-tuning Pre-trained Language Models with Combination of Regression and Ranking. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1560–1569, Online. Association for Computational Linguistics is available at https://doi.org/10.18653/v1/2020.findings-emnlp.141.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
2020.findings-emnlp.141.pdf281.15 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

55
Citations as of Feb 23, 2025

Downloads

13
Citations as of Feb 23, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.