Official code for paper "Optimization for Oriented Object Detection via Representation Invariance Loss".

Last update: Nov 28, 2022

Related tags

Overview

Optimization for Oriented Object Detection via Representation Invariance Loss

By Qi Ming, Zhiqiang Zhou, Lingjuan Miao, Xue Yang, and Yunpeng Dong.

The repository hosts the codes for our paper Optimization for Oriented Object Detection via Representation Invariance Loss (paper link), based on mmdetection and s2anet.

Introduction

To be updated.

Installation

conda create -n ridet python=3.7 -y
source activate ridet
conda install pytorch=1.3 torchvision cudatoolkit=10.0 -c pytorch

pip install -r requirements.txt
python setup.py develop
cd mmdet/ops/orn
python setup.py build_ext --inplace

apt-get update
apt-get install swig
apt-get install zip

cd DOTA_devkit
swig -c++ -python polyiou.i
python setup.py build_ext --inplace
cd ..

Getting Started

Datasets

DOTA
HRSC2016
ICDAR2015
UCAS-AOD
VOC2007
MSRA-TD500

Data Preration

cd DOTA_devkit/$DATASET
python prepare_$DATASET.py

Training

Set the following configuration according to your own file directory: $GPUS, $ROOT, $CONFIG, and then start training:

sh train.sh

Testing

Set the following configuration according to your own file directory: $GPUS, $DATASET, $CHECKPOINT, $CONFIG, and then start evaluation:

sh test.sh

Demo

To output the visualization of the detections, the following configuration need to be set: $ROOT, $IMAGES, $CHECKPOINT, $CONFIG, and then start evaluation:

sh demo.sh

Models

All the trained models can be found here with fetch code q9zc.

Notes

The implementation based on mmdetection does not work well on the scene text datasets. Recommend to use my another implementation: RIDet-pytorch.

Citation

To be updated.

Official code for paper "Optimization for Oriented Object Detection via Representation Invariance Loss".

Related tags

Overview

Optimization for Oriented Object Detection via Representation Invariance Loss

Introduction

Installation

Getting Started

Datasets

Data Preration

Training

Testing

Demo

Models

Notes

Citation

Owner

ming71

O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

Mask-invariant Face Recognition through Template-level Knowledge Distillation

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

Open source code for the paper of Neural Sparse Voxel Fields.

banditml is a lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

PyTorch implementation of the TTC algorithm

AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

Conformer: Local Features Coupling Global Representations for Visual Recognition

Mall-Customers-Segmentation - Customer Segmentation Using K-Means Clustering

Freecodecamp Scientific Computing with Python Certification; Solution for Challenge 2: Time Calculator

Collection of common code that's shared among different research projects in FAIR computer vision team.

GPU Accelerated Non-rigid ICP for surface registration

Use stochastic processes to generate samples and use them to train a fully-connected neural network based on Keras

A simple root calculater for python

This repository contain code on Novelty-Driven Binary Particle Swarm Optimisation for Truss Optimisation Problems.

Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0

Transfer Learning for Pose Estimation of Illustrated Characters

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

A Machine Teaching Framework for Scalable Recognition