Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

Housing Price Prediction

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

This is a JAX implementation of Neural Radiance Fields for learning purposes.

Code for ECCV 2020 paper "Contacts and Human Dynamics from Monocular Video".

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Graph Convolutional Neural Networks with Data-driven Graph Filter (GCNN-DDGF)

Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2

Official Implementation of "LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks"

TEA: A Sequential Recommendation Framework via Temporally Evolving Aggregations

This repository contains the scripts for downloading and validating scripts for the documents

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection

Video Swin Transformer - PyTorch

Ground truth data for the Optical Character Recognition of Historical Classical Commentaries.

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

Source code for CVPR2022 paper "Abandoning the Bayer-Filter to See in the Dark"

Another pytorch implementation of FCN (Fully Convolutional Networks)

Deep Distributed Control of Port-Hamiltonian Systems