Codes_APN

Official codes of CVPR21 paper: Normal Learning in Videos with Attention Prototype Network (https://arxiv.org/abs/2108.11055)

Overview of our approach based on APU and CAU model:

Introduction

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection. With models trained on the normal data, the reconstruction errors of anomalous scenes are usually much larger than those of normal ones. Previous methods introduced the memory bank into AE, for encoding diverse normal patterns across the training videos. However, they are memory consuming and cannot cope with unseen new scenarios in the testing data. In this work, we propose a self-attention prototype unit (APU) to encode the normal latent space as prototypes in real time, free from extra memory cost. In addition, we introduce circulative attention mechanism to our backbone to form a novel feature extracting learner, namely Circulative Attention Unit(CAU). It enables the fast adaption capability on new scenes by only consuming a few iterations of update. Extensive experiments are conducted on various benchmarks. The superior performance over the state-of-the-art demonstrates the effectiveness of our method.

Performance

We achieved SOTA on many video anomaly detection datasets.

Unsupervised Anomaly Detection Model Training

bash train.sh

Unsupervised Anomaly Detection Model Testing

bash test.sh

If you find this work helpful, please cite:

@inproceedings{Nv2021APN,
  author    = {Chao Hu and
	       Fan Wu and
               Weijie Wu and
               Weibin Qiu and
               Shengxin Lai},
  title     = {Normal Learning in Videos with Attention Prototype Network},
  booktitle = {Computer Vision and Pattern Recognition},
  year      = {2021}
}

Normal Learning in Videos with Attention Prototype Network

Related tags

Overview

Codes_APN

Introduction

Performance

Unsupervised Anomaly Detection Model Training

Unsupervised Anomaly Detection Model Testing

Owner

The source code of CVPR17 'Generative Face Completion'.

Hashformers is a framework for hashtag segmentation with transformers.

PyTorch Implementation of Region Similarity Representation Learning (ReSim)

Revisiting, benchmarking, and refining Heterogeneous Graph Neural Networks.

Unofficial implementation of the Involution operation from CVPR 2021

Massively parallel Monte Carlo diffusion MR simulator written in Python.

Voice Gender Recognition

Generative Modelling of BRDF Textures from Flash Images [SIGGRAPH Asia, 2021]

DIT is a DTLS MitM proxy implemented in Python 3. It can intercept, manipulate and suppress datagrams between two DTLS endpoints and supports psk-based and certificate-based authentication schemes (RSA + ECC).

Contains code for the paper "Vision Transformers are Robust Learners".

PyTorch Implementation of Spatially Consistent Representation Learning(SCRL)

[ICCV 2021] Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

Implementation of "Scaled-YOLOv4: Scaling Cross Stage Partial Network" using PyTorch framwork.

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

Segmentation models with pretrained backbones. PyTorch.

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

Compute FID scores with PyTorch.

[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.

Predict the latency time of the deep learning models