Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Last update: Dec 09, 2022

Related tags

Deep Learning GUI-iVOS_and_GIS

Overview

GUI for iVOS(interactive VOS) and GIS (Guided iVOS)

GUI Implementation of

CVPR2021 paper "Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps"

ECCV2020 paper "Interactive Video Object Segmentation Using Global and Local Transfer Modules"

Githubs:
CVPR2021 / ECCV2020

Project Pages:
CVPR2021 / ECCV2020

Codes in this github:

Real-world GUI evaluation on DAVIS2017 based on the DAVIS framework
GUI for other videos

Prerequisite

cuda 11.0
python 3.6
pytorch 1.6.0
davisinteractive 1.0.4
numpy, cv2, PtQt5, and other general libraries of python3

Directory Structure

root/apps: QWidget apps.
root/checkpoints: save our checkpoints (pth extensions) here.
root/dataset_torch: pytorch datasets.
root/libs: library of utility files.
root/model_CVPR2021 : networks and GUI models for CVPR2021
- detailed explanations on [Github:CVPR2021]
root/model_ECCV2020 : networks and GUI models for ECCV2020
- detailed explanations (including building correlation package) on [Github:ECCV2020]
root/eval_GIS_RS1.py : DAVIS2017 evaluation based on the DAVIS framework.
root/eval_GIS_RS4.py : DAVIS2017 evaluation based on the DAVIS framework.
root/eval_IVOS.py : DAVIS2017 evaluation based on the DAVIS framework.
root/IVOS_demo_customvideo.py : GUI for custom videos

Instruction

To run

Edit eval_GIS_RS1.py``eval_GIS_RS4.py``eval_IVOS.py``IVOS_demo_customvideo.py to set the directory of your DAVIS2017 dataset and other configurations.
Download our parameters and place the file as root/checkpoints/GIS-ckpt_standard.pth.
- For CVPR2021 evaluation [Google-Drive]
- For ECCV2020 evaluation [Google-Drive]
Run eval_GIS_RS1.py``eval_GIS_RS4.py``eval_IVOS.py for real-world GUI evaluation on DAVIS2017 or
Run IVOS_demo_customvideo.py to apply our method on the other videos

To use

Left click for the target object and right click for the background.

Select any frame to interact by dragging the slidder under the main image
Give interaction
Run VOS
Find worst frame (if GIS, a candidate frame-RS1 or frames-RS4 are given) and reinteract.
Iterate until you get satisfied with VOS results.
By selecting satisfied button, your evaluation result (consumed time and frames) will be recorded on root/results.

Reference

Please cite our paper if the implementations are useful in your work:

@Inproceedings{
Yuk2021GIS,
title={Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps},
author={Yuk Heo and Yeong Jun Koh and Chang-Su Kim},
booktitle={CVPR},
year={2021},
url={https://openaccess.thecvf.com/content/CVPR2021/papers/Heo_Guided_Interactive_Video_Object_Segmentation_Using_Reliability-Based_Attention_Maps_CVPR_2021_paper.pdf}
}

@Inproceedings{
Yuk2020IVOS,
title={Interactive Video Object Segmentation Using Global and Local Transfer Modules},
author={Yuk Heo and Yeong Jun Koh and Chang-Su Kim},
booktitle={ECCV},
year={2020},
url={https://openreview.net/forum?id=bo_lWt_aA}
}

Our real-world evaluation demo is based on the GUI of IPNet:

@Inproceedings{
Oh2019IVOS,
title={Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks},
author={Seoung Wug Oh and Joon-Young Lee and Seon Joo Kim},
booktitle={CVPR},
year={2019},
url={https://openaccess.thecvf.com/content_ICCV_2019/papers/Oh_Video_Object_Segmentation_Using_Space-Time_Memory_Networks_ICCV_2019_paper.pdf}
}

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Related tags

Overview

GUI for iVOS(interactive VOS) and GIS (Guided iVOS)

Prerequisite

Directory Structure

Instruction

To run

To use

Reference

Owner

Yuk Heo

Code for Emergent Translation in Multi-Agent Communication

The code of Zero-shot learning for low-light image enhancement based on dual iteration

Source code related to the article submitted to the International Conference on Computational Science ICCS 2022 in London

Implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networks, using PyTorch

Code for the paper "Generative design of breakwaters usign deep convolutional neural network as a surrogate model"

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

Visual Memorability for Robotic Interestingness via Unsupervised Online Learning (ECCV 2020 Oral and TRO)

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

An End-to-End Machine Learning Library to Optimize AUC (AUROC, AUPRC).

You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling

Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness

Camview - A CLI-tool used to stream CCTV online footage based on URL params

Ludwig Benchmarking Toolkit

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

Image process framework based on plugin like imagej, it is esay to glue with scipy.ndimage, scikit-image, opencv, simpleitk, mayavi...and any libraries based on numpy

Classify music genre from a 10 second sound stream using a Neural Network.

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

CKD - Collaborative Knowledge Distillation for Heterogeneous Information Network Embedding

Official PyTorch implementation of PS-KD