TransCD: Scene Change Detection via Transformer-based Architecture

Last update: Dec 11, 2022

Related tags

Overview

TransCD: Scene Change Detection via Transformer-based Architecture

Requirements

Python 3.7.0  
Pytorch 1.6.0  
Visdom 0.1.8.9  
Torchvision 0.7.0

Datasets

CD2014 dataset
- paper: changedetection.net: A new change detection benchmark dataset
- paper: CDnet 2014: An Expanded Change Detection Benchmark Dataset
- dataset: http://changedetection.net/
VL-CMU-CD
- paper: Street-view change detection with deconvolutional networks
- dataset: https://ghsi.github.io/proj/RSS2016.html

Pretrained Model

Pretrained models for CDNet-2014 and VL-CMU-CD are available. You can download them from the following link.

CDNet-2014: [Baiduyun] the password is 78cp. [GoogleDrive].
- We uploaded six models trained on CDNet-2014 dataset, they are SViT_E1_D1_16, SViT_E1_D1_32, SViT_E4_D4_16, SViT_E4_D4_32, Res_SViT_E1_D1_16 and Res_SViT_E4_D4_16.
VL-CMU-CD: [Baiduyun] the password is ydzl. [GoogleDrive].
- We uploaded four models trained on VL-CMU-CD dataset, ther are SViT_E1_D1_16, SViT_E1_D1_32, Res_SViT_E1_D1_16 and Res_SViT_E1_D1_32.

Test

Before test, please download datasets and predtrained models. Copy pretrained models to folder './dataset_name/outputs/best_weights', and run the following command:

cd TransCD_ROOT
python test.py --net_cfg 
   
     --train_cfg

Use --save_changemap True to save predicted changemaps. For example:

python test.py --net_cfg SVit_E1_D1_32 --train_cfg CDNet_2014 --save_changemap True

Training

Before training, please download datasets and revise dataset path in configs.py to your path. CD TransCD_ROOT

python -m visdom.server
python train.py --net_cfg 
   
     --train_cfg

For example:

python -m visdom.server
python train.py --net_cfg Res_SViT_E1_D1_16 --train_cfg VL_CMU_CD

To display training processing, copy 'http://localhost:8097' to your browser.

Citing TransCD

If you use this repository or would like to refer the paper, please use the following BibTex entry.

@inproceddings{TransCD,
title={TransCD: Scene Change Detection via Transformer-based Architecture},
author={ZHIXUE WANG, YU ZHANG*, LIN LUO, NAN WANG},
journal={Optics Express},
yera={2021},
organization={The Optical Society},
}

Reference

-Akcay, Samet, Amir Atapour-Abarghouei, and Toby P. Breckon. "Ganomaly: Semi-supervised anomaly detection via adversarial training." Asian conference on computer vision. Springer, Cham, 2018.
-Chen, Jieneng, et al. "Transunet: Transformers make strong encoders for medical image segmentation." arXiv preprint arXiv:2102.04306 (2021).

TransCD: Scene Change Detection via Transformer-based Architecture

Related tags

Overview

TransCD: Scene Change Detection via Transformer-based Architecture

Requirements

Datasets

Pretrained Model

Test

Training

Citing TransCD

Reference

Owner

wangzhixue

A DCGAN to generate anime faces using custom mined dataset

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

DuBE: Duple-balanced Ensemble Learning from Skewed Data

The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

PyTorch(Geometric) implementation of G^2GNN in "Imbalanced Graph Classification via Graph-of-Graph Neural Networks"

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

Repo for 2021 SDD assessment task 2, by Felix, Anna, and James.

Zero-shot Synthesis with Group-Supervised Learning (ICLR 2021 paper)

Face Mask Detection System built with OpenCV, TensorFlow using Computer Vision concepts

Development Kit for the SoccerNet Challenge

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

eXPeditious Data Transfer

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust.

A simple Neural Network that predicts the label for a series of handwritten digits

InterfaceGAN++: Exploring the limits of InterfaceGAN

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

BLEURT is a metric for Natural Language Generation based on transfer learning.

A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.

It is a system used to detect bone fractures. using techniques deep learning and image processing