Official code for 'Robust Siamese Object Tracking for Unmanned Aerial Manipulator' and offical introduction to UAMT100 benchmark

Last update: Dec 18, 2022

Related tags

Deep Learning SiamSA

Overview

SiamSA: Robust Siamese Object Tracking for Unmanned Aerial Manipulator

Demo video

📹 Our video on Youtube and bilibili demonstrates the evaluation of SiamSA and other 4 state-of-the-art trackers on [email protected] and UAMT100 benchmark.

📹 Real-world tests of SiamSA on a flying UAM platform form first and third perspective are also involved.

UAMT100 benchmark

The UAMT100 benchmark consists of 100 image sequences, which are captured from UAM perspectives. For subsequent tasks of UAM tracking, such as grasping, it represents various possibilities of UAM's tracking the object in an indoor environment.

16 kinds of objects are involved, and 11 attributes are annotated for each sequence. The figure demonstrates four scenarios of UAM tracking in UAMT100. The histogram in the figure is a statistic of attributes in UAMT100.
For more detail, please refer to the benchmark website, which will be released soon.

Environment setup

This code has been tested on Ubuntu 18.04, Python 3.8.3, Pytorch 0.7.0/1.6.0, CUDA 10.2. Please install related libraries before running this code:

pip install -r requirements.txt

Test

Download model from Google Drive or BaiduYun (code: v4r0) and put it into tools/snapshot directory.

Download testing datasets and put them into test_dataset directory. If you want to test the tracker on a new dataset, please refer to pysot-toolkit to set test_dataset.

python test.py 	                    \
	--trackername SiamSA            \ # tracker_name
	--dataset UAV123_10fps          \ # dataset_name
	--snapshot snapshot/model.pth     # model_path

The testing result will be saved in the results/dataset_name/tracker_name directory.

We provide our test results on Google Drive and BaiduYun (code: v4r1).

Train

Prepare training datasets

Download the datasets：

VID
YOUTUBEBB (code: t7j8)
COCO
GOT-10K

Note: train_dataset/dataset_name/readme.md has listed detailed operations about how to generate training datasets.

Train a model

To train the SiamSA model, run train.py with the desired configs:

cd tools
python train.py

Evaluation

If you want to evaluate the tracker mentioned above, please put those results into results directory.

python eval.py 	                      \
	--tracker_path ./results          \ # result path
	--dataset UAV123_10fps            \ # dataset_name
	--tracker_prefix 'model'            # tracker_name

Contact

If you have any questions, please contact me.

Guangze Zheng

Email: [email protected]

Acknowledgement

The code is implemented based on pysot and SiamAPN. We would like to express our sincere thanks to the contributors.
Besides, we would like to thank Ziang Cao for his advice on the code.
As for UAMT100 benchmark, we appreciate the help from Fuling Lin, Haobo Zuo, and Liangliang Yao.
We would like to thank Kunhan Lu for his advice on TensorRT acceleration.

Official code for 'Robust Siamese Object Tracking for Unmanned Aerial Manipulator' and offical introduction to UAMT100 benchmark

Related tags

Overview

SiamSA: Robust Siamese Object Tracking for Unmanned Aerial Manipulator

Demo video

UAMT100 benchmark

Environment setup

Test

Train

Prepare training datasets

Train a model

Evaluation

Contact

Acknowledgement

Owner

Intelligent Vision for Robotics in Complex Environment

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

DeepRec is a recommendation engine based on TensorFlow.

Automatic meme generation model using Tensorflow Keras.

STRIVE: Scene Text Replacement In Videos

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

Package for extracting emotions from social media text. Tailored for financial data.

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions'

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

Saliency - Framework-agnostic implementation for state-of-the-art saliency methods (XRAI, BlurIG, SmoothGrad, and more).

A parallel framework for population-based multi-agent reinforcement learning.

Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).

Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Repositorio de los Laboratorios de Análisis Numérico / Análisis Numérico I de FAMAF, UNC.

🌊 Online machine learning in Python

SlideGraph+: Whole Slide Image Level Graphs to Predict HER2 Status in Breast Cancer

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"