TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

Last update: Dec 19, 2022

Related tags

Deep Learning TCTrack

Overview

TCTrack: Temporal Contexts for Aerial Tracking （CVPR2022)

Ziang Cao and Ziyuan Huang and Liang Pan and Shiwei Zhang and Ziwei Liu and Changhong Fu

In CVPR, 2022.

[paper]

Abstract

Temporal contexts among consecutive frames are far from being fully utilized in existing visual trackers. In this work, we present TCTrack, a comprehensive framework to fully exploit temporal contexts for aerial tracking. The temporal contexts are incorporated at two levels: the extraction of features and the refinement of similarity maps. Specifically, for feature extraction, an online temporally adaptive convolution is proposed to enhance the spatial features using temporal information, which is achieved by dynamically calibrating the convolution weights according to the previous frames. For similarity map refinement, we propose an adaptive temporal transformer, which first effectively encodes temporal knowledge in a memory-efficient way, before the temporal knowledge is decoded for accurate adjustment of the similarity map. TCTrack is effective and efficient: evaluation on four aerial tracking benchmarks shows its impressive performance; real-world UAV tests show its high speed of over 27 FPS on NVIDIA Jetson AGX Xavier.

The implementation of our online temporally adaptive convolution is based on TadaConv (ICLR2022).

1. Environment setup

This code has been tested on Ubuntu 18.04, Python 3.8.3, Pytorch 0.7.0/1.6.0, CUDA 10.2. Please install related libraries before running this code:

pip install -r requirements.txt

2. Test

Download pretrained model by Baidu （code: 2u1l) or Googledrive and put it into tools/snapshot directory.

Download testing datasets and put them into test_dataset directory.

python ./tools/test.py                                
	--dataset UAV123_10fps                  
    --tracker_name TCTrack
	--snapshot snapshot/general_model.pth # pre-train model path

The testing result will be saved in the results/dataset_name/tracker_name directory.

Note: The results of TCTrack can be downloaded (code:kh3e).

3. Train

Prepare training datasets

Download the datasets：

Note: train_dataset/dataset_name/readme.md has listed detailed operations about how to generate training datasets.

Train a model

To train the TCTrack model, run train.py with the desired configs:

cd tools
python train.py

4. Evaluation

If you want to evaluate the results of our tracker, please put those results into results directory.

python eval.py 	                          \
	--tracker_path ./results          \ # result path
	--dataset UAV10fps                  \ # dataset_name
	--tracker_prefix 'general_model'   # tracker_name

Note: The code is implemented based on pysot-toolkit. We would like to express our sincere thanks to the contributors.

Demo video

References

@article{cao2022tctrack,
  title={{TCTrack: Temporal Contexts for Aerial Tracking}},
  author={Cao, Ziang and Huang, Ziyuan and Pan, Liang and Zhang, Shiwei and Liu, Ziwei and Fu, Changhong},
  journal={arXiv preprint arXiv:2203.01885},
  year={2022}
}

Acknowledgement

The code is implemented based on pysot. We would like to express our sincere thanks to the contributors.

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

Related tags

Overview

TCTrack: Temporal Contexts for Aerial Tracking （CVPR2022)

Abstract

1. Environment setup

2. Test

3. Train

Prepare training datasets

Train a model

4. Evaluation

Demo video

References

Acknowledgement

Owner

Intelligent Vision for Robotics in Complex Environment

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Clairvoyance: a Unified, End-to-End AutoML Pipeline for Medical Time Series

A Strong Baseline for Image Semantic Segmentation

Github Traffic Insights as Prometheus metrics.

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

Doing fast searching of nearest neighbors in high dimensional spaces is an increasingly important problem

Temporally Coherent GAN SIGGRAPH project.

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

Pairwise model for commonlit competition

A data-driven maritime port simulator

Grounding Representation Similarity with Statistical Testing

The official implementation of CircleNet: Anchor-free Detection with Circle Representation, MICCAI 2030

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions

Official Implementation of Domain-Aware Universal Style Transfer

Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

Related tags

Overview

TCTrack: Temporal Contexts for Aerial Tracking （CVPR2022)

Abstract

1. Environment setup

2. Test

3. Train

Prepare training datasets

Train a model

4. Evaluation

Demo video

References

Acknowledgement

Owner

Intelligent Vision for Robotics in Complex Environment

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Clairvoyance: a Unified, End-to-End AutoML Pipeline for Medical Time Series

A Strong Baseline for Image Semantic Segmentation

Github Traffic Insights as Prometheus metrics.

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

Doing fast searching of nearest neighbors in high dimensional spaces is an increasingly important problem

Temporally Coherent GAN SIGGRAPH project.

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

Pairwise model for commonlit competition

A data-driven maritime port simulator

Grounding Representation Similarity with Statistical Testing

The official implementation of CircleNet: Anchor-free Detection with Circle Representation, MICCAI 2030

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions

Official Implementation of Domain-Aware Universal Style Transfer

Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.