A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Last update: Nov 29, 2022

Related tags

Deep Learning CVPR2021_VSPW_Implement

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Preparation

Download VSPW dataset

The VSPW dataset with extracted frames and masks is available here. Now you can directly download VSPW_480P dataset.

Dependencies

Python 3.7
Pytorch 1.3.1
Numpy

Download the ImageNet-pretrained models at this link. Put it in the root folder and decompress it.

Train and Test

Resize the frames and masks of the VSPW dataset to 480p.

python change2_480p.py

Edit the .sh files in scripts/ and change the $DATAROOT to your path to VSPW_480p.

Image-based methods

PSPNet

sh scripts/run_psp.sh

OCRNet

sh scripts/run_ocr.sh

Video-based methods

TCB-PSP

sh run_temporal_psp.sh

TCB-OCR

sh run_temporal_ocr.sh

Evaluation on TC and VC

Change dataroot and prediction root in TC_cal.py and VC_perclip.py.

python TC_cal.py

python VC_perclip.py

This implementation utilized this code and RAFT.

Citation

@inproceedings{miao2021vspw,

  title={VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild},

  author={Miao, Jiaxu and Wei, Yunchao and  Wu, Yu and Liang, Chen and Li, Guangrui and Yang, Yi},

  booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition},

  year={2021}

}

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Related tags

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

Preparation

Download VSPW dataset

Dependencies

Train and Test

Image-based methods

Video-based methods

Evaluation on TC and VC

Citation

Owner

This is the official code of L2G, Unrolling and Recurrent Unrolling in Learning to Learn Graph Topologies.

Image processing in Python

《DeepViT: Towards Deeper Vision Transformer》(2021)

Large dataset storage format for Pytorch

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Spearmint Bayesian optimization codebase

PyTorch code for the ICCV'21 paper: "Always Be Dreaming: A New Approach for Class-Incremental Learning"

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

Code for "Learning Graph Cellular Automata"

Implementation of the state-of-the-art vision transformers with tensorflow

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Data manipulation and transformation for audio signal processing, powered by PyTorch

A unofficial pytorch implementation of PAN(PSENet2): Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

[ACM MM2021] MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification

Adversarial vulnerability of powerful near out-of-distribution detection

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

📝 Wrapper library for text generation / language models at char and word level with RNN in TensorFlow

Code for 'Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning' (AAAI 2022)

Improving Compound Activity Classification via Deep Transfer and Representation Learning

Out-of-boundary View Synthesis towards Full-frame Video Stabilization