SwiftNet

The official PyTorch implementation of SwiftNet:Real-time Video Object Segmentation, which has been accepted by CVPR2021.

Requirements

Python >= 3.6
Pytorch 1.5
Numpy
Pillow
opencv-python
scipy
tqdm

Training

The training pipeline of Swiftnet is similar with the training pipeline of STM, which can be found in our reproduced STM training code.

Inference

Usage

python eval.py -g 0 -y 17 -s val -D 'path to davis'

Performance

Performance on Davis-17 val set.

backbone	J&F	J	F	FPS	weights
resnet-18	77.6	75.5	79.7	65	`link`

Note: The FPS is tested on one P100, which does not include the time of image loading and evaluation cost.

Acknowledgement

This repository is partially founded on the official STM repository.

Citation

If you find this repository helpful and want to cite SwiftNet in your own projects, please use the following citation info.

@inproceedings{wang2021swiftnet,
  title={SwiftNet: Real-time Video Object Segmentation},
  author={Wang, Haochen and Jiang, Xiaolong and Ren, Haibing and Hu, Yao and Bai, Song},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={1296--1305},
  year={2021}
}

implement of SwiftNet:Real-time Video Object Segmentation

Related tags

Overview

SwiftNet

Requirements

Training

Inference

Performance

Acknowledgement

Citation

Owner

haochen wang

TF Image Segmentation: Image Segmentation framework

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

PyTorch code of paper "LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering"

This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".

Reproduced Code for Image Forgery Detection papers.

This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

FairEdit: Preserving Fairness in Graph Neural Networks through Greedy Graph Editing

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Source code of our TTH paper: Targeted Trojan-Horse Attacks on Language-based Image Retrieval.

Quasi-Dense Similarity Learning for Multiple Object Tracking, CVPR 2021 (Oral)

Code for Mining the Benefits of Two-stage and One-stage HOI Detection

An implementation of the proximal policy optimization algorithm

Offcial implementation of "A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, ICCV-2021".

A modified version of DeepMind's Alphafold2 to divide CPU part (MSA and template searching) and GPU part (prediction model)

Easy to use and customizable SOTA Semantic Segmentation models with abundant datasets in PyTorch

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (EMNLP Founding 2021)

Optical machine for senses sensing using speckle and deep learning

Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)