Item

Fine-grained Feature and Template Reconstruction for TIR Object Tracking

Liao, Donghai
Shu, Xiu
Li, Zhihui
Liu, Qiao
Yuan, Di
Chang, Xiaojun
He, Zhenyu
Supervisor
Department
Computer Vision
Embargo End Date
Type
Journal article
Date
2025
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Thermal infrared (TIR) object tracking is a significant subject within the field of computer vision. Currently, TIR object tracking faces challenges such as insufficient representation of object texture information and underutilization of temporal information, which severely affects the tracking accuracy of TIR tracking methods. To address these issues, we propose a TIR object tracking method (called: FFTR) based on fine-grained feature and template reconstruction. Specifically, aiming at the fine-grained information of the TIR object, we employ a frequency channel attention mechanism that transforms TIR images into the frequency domain using discrete cosine transform features. By capturing the fine-grained feature of TIR images from the frequency domain, we enhance the model’s ability to comprehend these images. To better leverage temporal information, we utilize a template region reconstruction method. This method reconstructs the template from the previous frame based on the search area of the current frame, which is then incorporated into the attention computation for the subsequent frame, thereby improving the tracking capability of TIR objects. Extensive quantitative and qualitative experiments show that our method achieves competitive tracking performance on the TIR benchmarks.
Citation
D. Liao et al., “Fine-grained Feature and Template Reconstruction for TIR Object Tracking,” IEEE Transactions on Circuits and Systems for Video Technology, pp. 1–1, 2025, doi: 10.1109/TCSVT.2025.3556529.
Source
IEEE Transactions on Circuits and Systems for Video Technology
Conference
Keywords
Transformer encoder, TIR object tracking, Temporal information, Template reconstruction
Subjects
Source
Publisher
IEEE
Full-text link