Learning Calibrated-Guidance for Object Detection in Aerial Images

Last update: Sep 22, 2022

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

We propose a simple yet effective Calibrated-Guidance (CG) scheme to enhance channel communications in a feature transformer fashion, which can adaptively determine the calibration weights for each channel based on the global feature affinity-pairs. Specifically, given a set of feature maps, CG first computes the feature similarity between each channel and the remaining channels as the intermediary calibration guidance. Then, re-representing each channel by aggregating all the channels weighted together via the guidance. Our CG can be plugged into any deep neural network, which is named as CG-Net. To demonstrate its effectiveness and efficiency, extensive experiments are carried out on both oriented and horizontal object detection tasks of aerial images. Results on two challenging benchmarks (i.e., DOTA and HRSC2016) demonstrate that our CG-Net can achieve state-of-the-art performance in accuracy with a fair computational overhead.

Introduction

This codebase is created to build benchmarks for object detection in aerial images. It is modified from mmdetection. The master branch works with PyTorch 1.1 or higher. If you would like to use PyTorch 0.4.1, please checkout to the pytorch-0.4.1 branch.

Results

Visualization results for oriented object detection on the test set of DOTA.

Comparison to the baseline on DOTA for oriented object detection with ResNet-101. The figures with blue boxes are the results of the baseline and pink boxes are the results of our proposed CG-Net.

Experiment

ImageNet Pretrained Model from Pytorch

The effectiveness of our proposed methods with different backbone network on the test of DOTA.

Backbone	+CG	Weight	mAP(%)
ResNet-50		download	73.26
ResNet-50	+	download	74.21
ResNet-101		download	73.06
ResNet-101	+	download	74.30
ResNet-152		download	72.78
ResNet-152	+	download	73.53

CG-Net Results in DOTA.

Backbone	Aug Rotate	Task	Weight	mAP(%)
ResNet-101	+	Oriented	download	77.89
ResNet-101	+	Horizontal	download	78.26

Installation

Please refer to INSTALL.md for installation.

Get Started

Please see GETTING_STARTED.md for the basic usage of mmdetection.

Contributing

We appreciate all contributions to improve benchmarks for object detection in aerial images.

Citing

If you use our work, please consider citing:

@InProceedings{liang2021learning,
      title={Learning Calibrated-Guidance for Object Detection in Aerial Images}, 
      author={Dong, Liang and Zongqi, Wei and Dong, Zhang and Qixiang, Geng and Liyan, Zhang and Han, Sun and Huiyu, Zhou and Mingqiang, Wei and Pan, Gao},
      booktitle ={arXiv:2103.11399},
      year={2021}
}

Thanks to the Third Party Libs

Pytorch

mmdetection

AerialDetection

Learning Calibrated-Guidance for Object Detection in Aerial Images

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

Introduction

Results

Experiment

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

PyTorch implementation of Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy

FLSim a flexible, standalone library written in PyTorch that simulates FL settings with a minimal, easy-to-use API

This repository introduces a short project about Transfer Learning for Classification of MRI Images.

Codes for building and training the neural network model described in Domain-informed neural networks for interaction localization within astroparticle experiments.

ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records

Cluttered MNIST Dataset

The code for SAG-DTA: Prediction of Drug–Target Affinity Using Self-Attention Graph Network.

Discerning Decision-Making Process of Deep Neural Networks with Hierarchical Voting Transformation

Implementation of the paper titled "Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees"

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

Neural Caption Generator with Attention

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework

[CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Bayesian dessert for Lasagne

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

Repository of Vision Transformer with Deformable Attention

mPose3D, a mmWave-based 3D human pose estimation model.

Python suite to construct benchmark machine learning datasets from the MIMIC-III clinical database.

A fuzzing framework for SMT solvers