Deep Learning for Human Part Discovery in Images - Chainer implementation

Last update: Sep 25, 2022

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

NOTE: This is not official implementation. Original paper is Deep Learning for Human Part Discovery in Images.

We are now reproducing the experiments in the original paper. Any contribution will be welcomed!

Requirements

Python 2.7.11+
- Chainer 1.10+
- numpy 1.9+
- scipy 0.16+
- six
- matplotlib
- tqdm
- cv2 (opencv)

Preparation

Data

bash prepare.sh

This script downloads VOC 2010 dataset (http://host.robots.ox.ac.uk/pascal/VOC/voc2010/VOCtrainval_03-May-2010.tar) and the authors' original dataset (http://www2.informatik.uni-freiburg.de/~oliveira/datasets/Sitting.tar.gz).

Model

You can download pre-trained FCN model from here.

We will use weights of this model and train new model on VOC dataset.

Start training

python train.py -g 0 -b 3 -e 3000 -l on -s on

Possible options

python train.py --help

GPU memory requirement

Citation from the original paper:

Each minibatch consists of just one image. The learning rate and momentum are fixed to 1e 10 and 0.99, respectively. We train the refinement layer by layer, which takes two days per refinement layer. Thus, the overall training starting from the pre-trained VGG network took 10 days on a single GPU.

Current maximum batchsize is 3 for 12 GB memory GPU.

Also it was confirmed that MBP (Late 2016, memory 16 GiB) can run with batchsize 1.

Result

Now in prep.

Visualize Prediction

python visualize.py -f PATH_TO_IMAGE_FILE

LICENSE

MIT LICENSE.

Author

shiba24, August 2016.

Contributors

bobye

Deep Learning for Human Part Discovery in Images - Chainer implementation

Related tags

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

Requirements

Preparation

Data

Model

Start training

Possible options

GPU memory requirement

Result

Visualize Prediction

LICENSE

Author

Contributors

Owner

Shintaro Shiba

Source code of AAAI 2022 paper "Towards End-to-End Image Compression and Analysis with Transformers".

Godot RL Agents is a fully Open Source packages that allows video game creators

Unsupervised MRI Reconstruction via Zero-Shot Learned Adversarial Transformers

TransGAN: Two Transformers Can Make One Strong GAN

A micro-game "flappy bird".

Simple tutorials on Pytorch DDP training

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"

TensorFlow implementation for Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How

Official implementation of "MetaSDF: Meta-learning Signed Distance Functions"

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

Official Implementation for Fast Training of Neural Lumigraph Representations using Meta Learning.

High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.

Pytorch implementation for A-NeRF: Articulated Neural Radiance Fields for Learning Human Shape, Appearance, and Pose

Towards Multi-Camera 3D Human Pose Estimation in Wild Environment

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

Codes for TIM2021 paper "Anchor-Based Spatio-Temporal Attention 3-D Convolutional Networks for Dynamic 3-D Point Cloud Sequences"

Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions