Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Last update: Dec 07, 2022

Related tags

Deep Learning SELFY

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

This is the official implementation of the paper "Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition" by H.Kwon, M.Kim, S.Kwak, and M.Cho. For more information, checkout the project website and the paper on arXiv.

Environment:

Cuda: 9.0
gcc: 7.3.0
Python 3.6.8
PyTorch 1.0.1
TorchVison: 0.2.2
Spatial Correlation Sampler
Others: environment.yml

Anaconda environment setting

git clone https://github.com/arunos728/SELFY.git
cd selfy
conda env create -f environment.yml
conda activate selfy

Installing Correlation sampler

cd Pytorch-Correlation-extension
python setup.py install

# check whether SpatialCorrelationSampler is installed correctly.
python check.py forward
python check.py backward
python checkCorrelationSampler.py

Please check this repo for the detailed instructions.

Dataset preparation

Please refer to TSM repo for the detailed data preparation instructions.

File lists (.txt files in ./data) specify configurations of each video clips (path, #frames, class). We upload our Something-Something-V1 & V2 video file lists in ./data. The path of the file lists should be added into the scripts for training (or testing).

Training & Testing

For training SELFYNet on Something-Something, use the following command:

    ./scripts/train_SELFY_Something.sh

For testing your trained model on Something-Something, use the following command:

    ./scripts/test_SELFY_Something.sh

Citation

If you use this code or ideas from the paper for your research, please cite our paper:

@inproceedings{kwon2021learning,
  title={Learning self-similarity in space and time as generalized motion for video action recognition},
  author={Kwon, Heeseung and Kim, Manjin and Kwak, Suha and Cho, Minsu},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={13065--13075},
  year={2021}
}

Contact

Heeseung Kwon([email protected]), Manjin Kim([email protected])

Questions can also be left as issues in the repository. We will be happy to answer them.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Related tags

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

Environment:

Anaconda environment setting

Installing Correlation sampler

Dataset preparation

Training & Testing

Citation

Contact

Owner

A PyTorch implementation of "Multi-Scale Contrastive Siamese Networks for Self-Supervised Graph Representation Learning", IJCAI-21

VLG-Net: Video-Language Graph Matching Networks for Video Grounding

🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

Experiments on Flood Segmentation on Sentinel-1 SAR Imagery with Cyclical Pseudo Labeling and Noisy Student Training

TVNet: Temporal Voting Network for Action Localization

CoANet: Connectivity Attention Network for Road Extraction From Satellite Imagery

Interactive Image Generation via Generative Adversarial Networks

Converts given image (png, jpg, etc) to amogus gif.

Losslandscapetaxonomy - Taxonomizing local versus global structure in neural network loss landscapes

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

上海交通大学全自动抢课脚本，支持准点开抢与抢课后持续捡漏两种模式。2021/06/08更新。

PyTorch and Tensorflow functional model definitions

Good Semi-Supervised Learning That Requires a Bad GAN

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc

The official homepage of the COCO-Stuff dataset.