Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118496
PIRA download icon_1.1View/Download Full Text
Title: An AI-driven framework for continuous tourist sentiment scoring using longitudinal and group-level insights with pre-trained language models (RoBERTa-CSS)
Authors: Yang, T 
Hsu, CHC 
Issue Date: 6-Feb-2026
Source: Tourism review, 6 Feb. 2026, v. 81, no. 1, p.167-187
Abstract: Purpose: Tourist sentiment is typically measured as discrete categories (e.g. positive, neutral and negative) through lexicon-based or machine-learning-based approaches in extant studies. However, neuroscience and physiology scholars have argued that sentiments are continuous in nature. Treating sentiment as a categorical state may result in an overly simplified understanding of tourists’ sentiments, ultimately hindering the tourism industry’s ability to derive precise and actionable insights. This study aims to construct an AI-driven framework for continuous tourist sentiment scoring.
Design/methodology/approach: This paper proposed a tool named RoBERTa-CSS (RoBERTa-based Continuous Sentiment Scoring) to calculate tourists’ continuous sentiment scores based on the pre-trained language model RoBERTa. The structure of RoBERTa is refined by adding a fully connected neural network layer, enabling the prediction of continuous sentiment scores. Using Chinese online reviews of a hotel group from multiple travel platforms, 3,500 sentences segmented from 1,000 randomly selected reviews were manually annotated to evaluate the proposed approach.
Findings: The comparison with the state-of-the-art open-source packages, deep learning models, pre-trained language models and generative artificial intelligence tools on multiple evaluation metrics demonstrated the superiority of the proposed RoBERTa-CSS. The method was also validated on an English dataset, showing good performance. Several empirical analyses, including individual-level sentiment flow analysis, group-level sentiment distribution and longitudinal analysis, were performed using the full dataset. The results further showcased the edge of RoBERTa-CSS, compared to extant polarity categorization-oriented sentiment analysis methods.
Originality/value: This study expanded the analytical ability beyond simple categorization to facilitate understanding of the complexity and diversity of human sentiment based on an improved pre-trained language model. The relevance of this paper for tourism practitioners, destination management organizations and online travel platforms is discussed.
Graphical abstract: [Figure not available: see fulltext.]
Keywords: Big data analysis
Continuous sentiment
Pre-trained language model
RoBERTa-CSS
Tourist sentiment
Publisher: Emerald Publishing Limited
Journal: Tourism review 
ISSN: 1660-5373
EISSN: 1759-8451
DOI: 10.1108/TR-05-2025-0550
Rights: © Emerald Publishing Limited. This AAM is provided for your own personal use only. It may not be used for resale, reprinting, systematic distribution, emailing, or for any other commercial purpose without the permission of the publisher.
The following publication Yang T, Hsu CH (2026), "An AI-driven framework for continuous tourist sentiment scoring using longitudinal and group-level insights with pre-trained language models (RoBERTa-CSS)". Tourism Review, Vol. 81 No. 1 pp. 167–187 is published by Emerald and is available at https://doi.org/10.1108/TR-05-2025-0550.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Yang_AI-driven_Framework_Continuous.pdfPre-Published version792.79 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.