Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/116644
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Building and Real Estate | en_US |
| dc.contributor | Mainland Development Office | en_US |
| dc.creator | Zhang, J | en_US |
| dc.creator | Shuai, S | en_US |
| dc.creator | Weng, Y | en_US |
| dc.creator | Hu, Y | en_US |
| dc.creator | Zhang, M | en_US |
| dc.creator | Cheng, M | en_US |
| dc.creator | Zhang, G | en_US |
| dc.date.accessioned | 2026-01-09T02:32:37Z | - |
| dc.date.available | 2026-01-09T02:32:37Z | - |
| dc.identifier.issn | 0926-5805 | en_US |
| dc.identifier.uri | http://hdl.handle.net/10397/116644 | - |
| dc.language.iso | en | en_US |
| dc.publisher | Elsevier | en_US |
| dc.subject | Activity understanding | en_US |
| dc.subject | Chain-of-thought | en_US |
| dc.subject | Construction monitoring | en_US |
| dc.subject | Inertial measurement unit (IMU) | en_US |
| dc.subject | Large language model (LLM) | en_US |
| dc.title | Large language model-driven framework for inertial measurement unit-based worker activity recognition | en_US |
| dc.type | Journal/Magazine Article | en_US |
| dc.identifier.volume | 180 | en_US |
| dc.identifier.doi | 10.1016/j.autcon.2025.106594 | en_US |
| dcterms.abstract | Inertial Measurement Units (IMUs) are widely used in wearable devices to detect worker activities, but current solutions often require multiple sensors and extensive labeled training data, limiting their practicality and applicability across diverse scenarios. This paper proposes a Large Language Model (LLM)-driven framework that recognizes worker activities from a single head-mounted IMU via unsupervised reasoning. Three meta-event principles are formulated and a video-IMU joint labeling tool is developed to extract meta-event features. An Activity Feature Recognizer is developed to identify motion characteristics, while K-Medoids clustering and autocorrelation functions are employed to quantify activity intensity and periodicity. Building upon these, an LLM-driven Agent network comprising a Supervisor, an Observer, and a Summarizer, is proposed to perform reasoning and activity understanding. Experiments achieved a Hamming distance of 2.62 for meta-event activity feature recognition and a 0.931 acceptance rate for Agent-inferred activity descriptions. | en_US |
| dcterms.accessRights | embargoed access | en_US |
| dcterms.bibliographicCitation | Automation in construction, Dec. 2025, v. 180, 106594 | en_US |
| dcterms.isPartOf | Automation in construction | en_US |
| dcterms.issued | 2025-12 | - |
| dc.identifier.scopus | 2-s2.0-105018103490 | - |
| dc.identifier.eissn | 1872-7891 | en_US |
| dc.identifier.artn | 106594 | en_US |
| dc.description.validate | 202601 bchy | en_US |
| dc.description.oa | Not applicable | en_US |
| dc.identifier.SubFormID | G000669/2025-11 | - |
| dc.description.fundingSource | Others | en_US |
| dc.description.fundingText | This research was jointly funded by the National Natural Science Foundation of China [Grant No. 42302322], Science, Technology and Innovation Commission of Shenzhen Municipality [Grant No. JCYJ20240813161904006]. | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.date.embargo | 2027-12-31 | en_US |
| dc.description.oaCategory | Green (AAM) | en_US |
| Appears in Collections: | Journal/Magazine Article | |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



