Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118272
Title: Integrating infrared facial thermal imaging and tabular data for multimodal prediction of occupants' thermal sensation
Authors: Lan, H 
Hou, HC 
Wong, MS 
Issue Date: 1-May-2025
Source: Building and environment, 1 May 2025, v. 275, 112814
Abstract: Developing robust thermal comfort models is essential for occupant-centric control (OCC) to optimize the indoor thermal environment while minimizing energy consumption. Conventional single-modal machine learning models, relying solely on either tabular or image data, often suffer from limited prediction accuracy and versatility. To address these challenges, this study proposes a multimodal framework that integrates both data types. A dataset of 610 paired records, encompassing environmental data, individual attributes, thermal sensation votes (TSV), and occupants’ facial thermal images, was collected. Separate single-modal models were trained on tabular and image data to identify the best-performing model for each modality. These were subsequently integrated using a self-attention mechanism to develop a unified multimodal predictive model. Results demonstrate that the artificial neural network (ANN), utilizing only tabular data, achieved an accuracy of 69.67% without incorporating temperature variables from facial regions of interest (ROIs), increasing to 72.46% when these variables were included. Conversely, the Inception-V3 model, trained solely on facial thermal images, achieved 63.44% accuracy. By integrating these approaches, the ANN+Inception-V3 multimodal model achieved a significantly improved accuracy of 81.48%, effectively capturing interaction effects from both data types. This study presents a robust framework and methodological reference for advancing multimodal thermal comfort prediction models, enabling scalable, personalized, and energy-efficient management strategies for indoor environments.
Keywords: Indoor thermal environment management
Machine learning
Multimodal model
Occupant-centric control
Self-attention mechanism
Thermal comfort prediction
Publisher: Pergamon Press
Journal: Building and environment 
ISSN: 0360-1323
EISSN: 1873-684X
DOI: 10.1016/j.buildenv.2025.112814
Appears in Collections:Journal/Magazine Article

Open Access Information
Status embargoed access
Embargo End Date 2027-05-01
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.