Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Last update: Nov 28, 2022

Overview

DAL

This project hosts the official implementation for our AAAI 2021 paper:

Dynamic Anchor Learning for Arbitrary-Oriented Object Detection [arxiv] [comments].

Abstract

In this paper, we propose a dynamic anchor learning (DAL) method, which utilizes the newly deﬁned matching degree to comprehensively evaluate the localization potential of the anchors and carry out a more efﬁcient label assignment process. In this way, the detector can dynamically select high-quality anchors to achieve accurate object detection, and the divergence between classiﬁcation and regression will be alleviated.

Getting Started

The codes build Rotated RetinaNet with the proposed DAL method for rotation object detection. The supported datasets include: DOTA, HRSC2016, ICDAR2013, ICDAR2015, UCAS-AOD, NWPU VHR-10, VOC.

Installation

Insatll requirements:

pip install -r requirements.txt
pip install git+git://github.com/lehduong/torch-warmup-lr.git

Build the Cython and CUDA modules:

cd $ROOT/utils
sh make.sh
cd $ROOT/utils/overlaps_cuda
python setup.py build_ext --inplace

Installation for DOTA_devkit:

cd $ROOT/datasets/DOTA_devkit
sudo apt-get install swig
swig -c++ -python polyiou.i
python setup.py build_ext --inplace

Inference

You can use the following command to test a dataset. Note that weight, img_dir, dataset,hyp should be modified as appropriate.

python demo.py

Train

Move the dataset to the $ROOT directory.
Generate imageset files for daatset division via:

cd $ROOT/datasets
python generate_imageset.py

Modify the configuration file hyp.py and arguments in train.py, then start training:

python train.py

Evaluation

Different datasets use different test methods. For UCAS-AOD/HRSC2016/VOC/NWPU VHR-10, you need to prepare labels in the appropriate format in advance. Take evaluation on HRSC2016 for example:

cd $ROOT/datasets/evaluate
python hrsc2gt.py

then you can conduct evaluation:

python eval.py

Note that :

the script needs to be executed only once, but testing on different datasets needs to be executed again.
the imageset file used in hrsc2gt.py is generated from generate_imageset.py.

Main Results

Method	Dataset	Bbox	Backbone	Input Size	mAP/F1
DAL	DOTA	OBB	ResNet-101	800 x 800	71.78
DAL	UCAS-AOD	OBB	ResNet-101	800 x 800	89.87
DAL	HRSC2016	OBB	ResNet-50	416 x 416	88.60
DAL	ICDAR2015	OBB	ResNet-101	800 x 800	82.4
DAL	ICDAR2013	HBB	ResNet-101	800 x 800	81.3
DAL	NWPU VHR-10	HBB	ResNet-101	800 x 800	88.3
DAL	VOC 2007	HBB	ResNet-101	800 x 800	76.1

Detections

Citation

If you find our work or code useful in your research, please consider citing:

@article{ming2020dynamic,
  title={Dynamic Anchor Learning for Arbitrary-Oriented Object Detection},
  author={Ming, Qi and Zhou, Zhiqiang and Miao, Lingjuan and Zhang, Hongwei and Li, Linhao},
  journal={arXiv preprint arXiv:2012.04150},
  year={2020}
}

If you have any questions, please contact me via issue or email.

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Related tags

Overview

DAL

Abstract

Getting Started

Installation

Inference

Train

Evaluation

Main Results

Detections

Citation

Owner

ming71

Fusion-in-Decoder Distilling Knowledge from Reader to Retriever for Question Answering

AQP is a modular pipeline built to enable the comparison and testing of different quality metric configurations.

Data Augmentation with Variational Autoencoders

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

TilinGNN: Learning to Tile with Self-Supervised Graph Neural Network (SIGGRAPH 2020)

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)

The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"

This is a collection of our NAS and Vision Transformer work.

Clustering is a popular approach to detect patterns in unlabeled data

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

Campsite Reservation Finder

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

NDE: Climate Modeling with Neural Diffusion Equation, ICDM'21

Easy to use Audio Tagging in PyTorch

Code for the paper "Training GANs with Stronger Augmentations via Contrastive Discriminator" (ICLR 2021)

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image

Applying CLIP to Point Cloud Recognition.