4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Last update: Nov 09, 2022

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR)

Challenge Site

Overview

Synthetic Aperture Radar (SAR) has received more attention due to its complementary superiority on capturing significant information in the remote sensing area. However, for an Aerial View Object Classification (AVOC) task, SAR images still suffer from the long-tailed distribution of the aerial view objects. This disparity dampens the performance of classification methods, especially for the datasensitive deep learning models. In this paper, we propose a two-stage shake-shake network to tackle the long-tailed learning problem. Specifically, it decouples the learning procedure into the representation learning stage and the classification learning stage. Moreover, we apply the test time augmentation (TTA) and a post-processing approach (CAN) to improve the accuracy. In the PBVS 2022 Multi-modal Aerial View Object Classification Challenge Track 1, our method achieves 21.82% and 27.97% accuracy in the development phase and testing phase respectively, which achieves the top-tier among all the participants.

Requirements

Ubuntu (It's only tested on Ubuntu, so it may not work on Windows.)
Python >= 3.7
PyTorch >= 1.4.0
torchvision
```
pip install -r requirements.txt
```

Usage

The first stage training

python train.py --config ./configs/sar10/shake_shake.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val”, under the “dataset” field and “output_dir” under the “train” field in the file “./configs/sar10/shake_shake.yaml”。

The second stage training

python train.py --config ./configs/sar10/shake_shake_fc.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val” under the “dataset” field and “output_dir”, “checkpoint” under the “train” field in the file “./configs/sar10/shake_shake_fc.yaml”。

Test

python predict_TTA.py

You need to change the value of “dataset_dir”, “checkpoint”, under the “test” field in the file “./configs/sar10/shake_shake.yaml”, then you can find the results in file “.result/results.csv”。
You can download the trained model here.

Acknowledge

The codes borrow heavily from hysts/pytorch_image_classification.

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Related tags

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

Overview

Requirements

Usage

The first stage training

The second stage training

Test

Acknowledge

Owner

LinpengPan

Adds timm pretrained backbone to pytorch's FasterRcnn model

YoloAll is a collection of yolo all versions. you you use YoloAll to test yolov3/yolov5/yolox/yolo_fastest

Compares various time-series feature sets on computational performance, within-set structure, and between-set relationships.

Code for our CVPR2021 paper coordinate attention

Self-Correcting Quantum Many-Body Control using Reinforcement Learning with Tensor Networks

This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" (PRICAI 2021)

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

商品推荐系统

CO-PILOT: COllaborative Planning and reInforcement Learning On sub-Task curriculum

CLASP - Contrastive Language-Aminoacid Sequence Pretraining

mmfewshot is an open source few shot learning toolbox based on PyTorch

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Towards Fine-Grained Reasoning for Fake News Detection

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

Continuum Learning with GEM: Gradient Episodic Memory

Code of the paper "Part Detector Discovery in Deep Convolutional Neural Networks" by Marcel Simon, Erik Rodner and Joachim Denzler

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

The self-supervised goal reaching benchmark introduced in Discovering and Achieving Goals via World Models

A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL