Integrating infrared facial thermal imaging and tabular data for multimodal prediction of occupants' thermal sensation

Lan, H; Hou, HC; Wong, MS

doi:10.1016/j.buildenv.2025.112814

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118272

DC Field	Value	Language
dc.contributor	Department of Building Environment and Energy Engineering	en_US
dc.contributor	Department of Land Surveying and Geo-Informatics	en_US
dc.contributor	Research Institute for Sustainable Urban Development	en_US
dc.creator	Lan, H	en_US
dc.creator	Hou, HC	en_US
dc.creator	Wong, MS	en_US
dc.date.accessioned	2026-03-30T01:59:47Z	-
dc.date.available	2026-03-30T01:59:47Z	-
dc.identifier.issn	0360-1323	en_US
dc.identifier.uri	http://hdl.handle.net/10397/118272	-
dc.language.iso	en	en_US
dc.publisher	Pergamon Press	en_US
dc.subject	Indoor thermal environment management	en_US
dc.subject	Machine learning	en_US
dc.subject	Multimodal model	en_US
dc.subject	Occupant-centric control	en_US
dc.subject	Self-attention mechanism	en_US
dc.subject	Thermal comfort prediction	en_US
dc.title	Integrating infrared facial thermal imaging and tabular data for multimodal prediction of occupants' thermal sensation	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.volume	275	en_US
dc.identifier.doi	10.1016/j.buildenv.2025.112814	en_US
dcterms.abstract	Developing robust thermal comfort models is essential for occupant-centric control (OCC) to optimize the indoor thermal environment while minimizing energy consumption. Conventional single-modal machine learning models, relying solely on either tabular or image data, often suffer from limited prediction accuracy and versatility. To address these challenges, this study proposes a multimodal framework that integrates both data types. A dataset of 610 paired records, encompassing environmental data, individual attributes, thermal sensation votes (TSV), and occupants’ facial thermal images, was collected. Separate single-modal models were trained on tabular and image data to identify the best-performing model for each modality. These were subsequently integrated using a self-attention mechanism to develop a unified multimodal predictive model. Results demonstrate that the artificial neural network (ANN), utilizing only tabular data, achieved an accuracy of 69.67% without incorporating temperature variables from facial regions of interest (ROIs), increasing to 72.46% when these variables were included. Conversely, the Inception-V3 model, trained solely on facial thermal images, achieved 63.44% accuracy. By integrating these approaches, the ANN+Inception-V3 multimodal model achieved a significantly improved accuracy of 81.48%, effectively capturing interaction effects from both data types. This study presents a robust framework and methodological reference for advancing multimodal thermal comfort prediction models, enabling scalable, personalized, and energy-efficient management strategies for indoor environments.	en_US
dcterms.accessRights	embargoed access	en_US
dcterms.bibliographicCitation	Building and environment, 1 May 2025, v. 275, 112814	en_US
dcterms.isPartOf	Building and environment	en_US
dcterms.issued	2025-05-01	-
dc.identifier.scopus	2-s2.0-86000724700	-
dc.identifier.eissn	1873-684X	en_US
dc.identifier.artn	112814	en_US
dc.description.validate	202603 bchy	en_US
dc.description.oa	Not applicable	en_US
dc.identifier.SubFormID	G001359/2025-12	-
dc.description.fundingSource	RGC	en_US
dc.description.fundingSource	Others	en_US
dc.description.fundingText	Funding text 1: This project received ethical approval from The Hong Kong Polytechnic University under the reference number HSEARS20230906001. We sincerely acknowledge the support provided by The Hong Kong Polytechnic University in facilitating in this project. We are also deeply grateful to the participants for their time, effort, and valuable contributions, which made this study possible. Cynthia Hou thanks the funding support from the Hong Kong Polytechnic University under project ID P0052446. M.S. Wong thanks the funding support from the General Research Fund (grant no. 15603920 and 15609421), and the Collaborative Research Fund (grant no. C5062-21GF) from the Research Grants Council, Hong Kong, China; and the funding support from the Research Institute for Sustainable Urban Development, The Hong Kong Polytechnic University, Hong Kong, China (grant no. 1-BBG2).; Funding text 2: Cynthia Hou thanks the funding support from the Hong Kong Polytechnic University under Project ID P0052446. M.S. Wong thanks the funding support from the General Research Fund (Grant No. 15603920 and 15609421), and the Collaborative Research Fund (Grant No. C5062-21GF) from the Research Grants Council, Hong Kong, China; and the funding support from the Research Institute for Sustainable Urban Development, The Hong Kong Polytechnic University, Hong Kong, China (Grant No. 1-BBG2).	en_US
dc.description.pubStatus	Published	en_US
dc.date.embargo	2027-05-01	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Open Access Information

Status	embargoed access
Embargo End Date	2027-05-01

Access

View full-text via PolyU eLinks

Show simple item record

Google Scholar^TM

Check

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM