TGLC : visual object tracking by fusion of global-local information and channel information

Zhang, S; Zhang, D; Zou, Q

doi:10.1007/s11042-024-19002-4

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106607

DC Field	Value	Language
dc.contributor	Department of Mechanical Engineering	en_US
dc.creator	Zhang, S	en_US
dc.creator	Zhang, D	en_US
dc.creator	Zou, Q	en_US
dc.date.accessioned	2024-05-14T05:42:05Z	-
dc.date.available	2024-05-14T05:42:05Z	-
dc.identifier.issn	1380-7501	en_US
dc.identifier.uri	http://hdl.handle.net/10397/106607	-
dc.language.iso	en	en_US
dc.publisher	Springer	en_US
dc.subject	Visual object tracking	en_US
dc.subject	Global-local representation aggregation	en_US
dc.subject	Channel information	en_US
dc.subject	Transformer attention	en_US
dc.subject	Convolution	en_US
dc.title	TGLC : visual object tracking by fusion of global-local information and channel information	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.doi	10.1007/s11042-024-19002-4	en_US
dcterms.abstract	Visual object tracking aspires to locate the target incessantly in each frame with designated initial target location, which is an imperative yet demanding task in computer vision. Recent approaches strive to fuse global information of template and search region for object tracking, which achieve promising tracking performance. However, fusion of global information devastates some local details. Local information is essential for distinguishing the target from background regions. With a focus on addressing this problem, this work presents a novel tracking algorithm TGLC integrating a channel-aware convolution block and Transformer attention for global and local representation aggregation, and for channel information modeling. This method is capable of accurately estimating the bounding box of the target. Extensive experiments are conducted on five widely recognized datasets, i.e., GOT-10k, TrackingNet, LaSOT, OTB100 and UAV123. The results depict that the proposed tracking method achieves competitive tracking performance compared with state-of-the-art trackers while still running in real-time. Visualization of the tracking results on LaSOT further demonstrates the capability of the proposed tracking method to cope with tracking challenges, e.g., illumination variation, deformation of the target and background clutter.	en_US
dcterms.accessRights	embargoed access	en_US
dcterms.bibliographicCitation	Multimedia tools and applications, Published: 28 March 2024, https://doi.org/10.1007/s11042-024-19002-4	en_US
dcterms.isPartOf	Multimedia tools and applications	en_US
dcterms.issued	2024	-
dc.description.validate	202405 bcrc	en_US
dc.description.oa	Not applicable	en_US
dc.identifier.FolderNumber	a2698	-
dc.identifier.SubFormID	48069	-
dc.description.fundingSource	Self-funded	en_US
dc.description.pubStatus	Early release	en_US
dc.date.embargo	0000-00-00 (to be updated)	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Open Access Information

Status	embargoed access
Embargo End Date	0000-00-00 (to be updated)

Access

View full-text via PolyU eLinks

Show simple item record

Page views

13

Citations as of Jun 30, 2024

Google Scholar^TM

Check

Open Access Information

Access

Page views

Google ScholarTM

Altmetric

Google Scholar^TM