Transparent Transformer Segmentation

Last update: Jan 02, 2023

Related tags

Overview

Transparent Transformer Segmentation

Introduction

This repository contains the data and code for IJCAI 2021 paper Segmenting transparent object in the wild with transformer.

Environments

python 3
torch = 1.4.0
torchvision
pyyaml
Pillow
numpy

INSTALL

python setup.py develop --user

Data Preparation

create dirs './datasets/transparent/Trans10K_v2'
put the train/validation/test data under './datasets/transparent/Trans10K_v2'. Data Structure is shown below.

Trans10K_v2
├── test
│   ├── images
│   └── masks_12
├── train
│   ├── images
│   └── masks_12
└── validation
    ├── images
    └── masks_12

Download Dataset: Google Drive. Baidu Drive. code: oqms

Network Define

The code of Network pipeline is in segmentron/models/trans2seg.py.

The code of Transformer Encoder-Decoder is in segmentron/modules/transformer.py.

Train

Our experiments are based on one machine with 8 V100 GPUs with 32g memory, about 1 hour training time.

bash tools/dist_train.sh $CONFIG-FILE $GPUS

For example:

bash tools/dist_train.sh configs/trans10kv2/trans2seg/trans2seg_medium.yaml 8

Test

bash tools/dist_train.sh $CONFIG-FILE $GPUS --test TEST.TEST_MODEL_PATH $MODEL_PATH

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@article{xie2021segmenting,
  title={Segmenting transparent object in the wild with transformer},
  author={Xie, Enze and Wang, Wenjia and Wang, Wenhai and Sun, Peize and Xu, Hang and Liang, Ding and Luo, Ping},
  journal={arXiv preprint arXiv:2101.08461},
  year={2021}
}

Transparent Transformer Segmentation

Related tags

Overview

Transparent Transformer Segmentation

Introduction

Environments

INSTALL

Data Preparation

Network Define

Train

Test

Citations

Owner

谢恩泽

Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

The King is Naked: on the Notion of Robustness for Natural Language Processing

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

PyTorch implementation of a Real-ESRGAN model trained on custom dataset

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Python version of the amazing Reaction Mechanism Generator (RMG).

HTSeq is a Python library to facilitate processing and analysis of data from high-throughput sequencing (HTS) experiments.

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"

PyTorch implementation of Lip to Speech Synthesis with Visual Context Attentional GAN (NeurIPS2021)

Generic ecosystem for feature extraction from aerial and satellite imagery

Regulatory Instruments for Fair Personalized Pricing.

Emulation and Feedback Fuzzing of Firmware with Memory Sanitization

TensorFlow-based neural network library

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

Multitask Learning Strengthens Adversarial Robustness

Implementation of ConvMixer-Patches Are All You Need? in TensorFlow and Keras

All the essential resources and template code needed to understand and practice data structures and algorithms in python with few small projects to demonstrate their practical application.

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Tensorflow implementation of ID-Unet: Iterative Soft and Hard Deformation for View Synthesis.