A simple baseline for 3d human pose estimation in PyTorch.

Last update: Jan 06, 2023

Overview

3d_pose_baseline_pytorch

A PyTorch implementation of a simple baseline for 3d human pose estimation. You can check the original Tensorflow implementation written by Julieta Martinez et al.. Some codes for data processing are brought from the original version, thanks to the authors.

This is the code for the paper

@inproceedings{martinez_2017_3dbaseline,
  title={A simple yet effective baseline for 3d human pose estimation},
  author={Martinez, Julieta and Hossain, Rayat and Romero, Javier and Little, James J.},
  booktitle={ICCV},
  year={2017}
}

WIP

Training code
Testing code

Datasets

Human3.6M
HumanEva

Dependencies

~~h5py~~
PyTorch >= 1.0.0

Installation

First, clone this repository:

git clone --recursive https://github.com/weigq/3d_pose_baseline_pytorch.git

Download the pre-processed Human3.6M dataset in 3d joints:
```
unzip human36m.zip
rm h36m.zip
```

Usage

Data preprocess

Train

Train on Human3.6M groundtruth 2d joints:

# optional arguments, you can access more details in opt.py
main.py [-h] [--data_dir DATA_DIR] [--exp EXP] [--ckpt CKPT]
           [--load LOAD] [--test] [--resume]
           [--action {all,All}]
           [--max_norm] [--linear_size LINEAR_SIZE]
           [--num_stage NUM_STAGE] [--use_hg] [--lr LR]
           [--lr_decay LR_DECAY] [--lr_gamma LR_GAMMA] [--epochs EPOCHS]
           [--dropout DROPOUT] [--train_batch TRAIN_BATCH]
           [--test_batch TEST_BATCH] [--job JOB] [--no_max] [--max]
           [--procrustes]

train the model:

python main.py --exp example

You will get the training and testing loss curves like:

~~Train on Human3.6M 2d joints detected by stacked hourglass:~~

Test

You can download the pretrained model on ground-truth 2d pose for a quick demo.

python main.py --load $PATH_TO_gt_ckpt_best.pth.tar --test

and you will get the results:

	direct.	discuss.	eat.	greet.	phone	photo	pose	purch.	sit	sitd.	somke	wait	walkd.	walk	walkT	avg
original version	37.7	44.4	40.3	42.1	48.2	54.9	44.4	42.1	54.6	58.0	45.1	46.4	47.6	36.4	40.4	45.5
pytorch version	35.7	42.3	39.4	40.7	44.5	53.3	42.8	40.1	52.5	53.9	42.8	43.1	44.1	33.4	36.3	-

License

MIT

A simple baseline for 3d human pose estimation in PyTorch.

Related tags

Overview

3d_pose_baseline_pytorch

WIP

Datasets

Dependencies

Installation

Usage

Data preprocess

Train

Test

License

Owner

weigq

7th place solution of Human Protein Atlas - Single Cell Classification on Kaggle

Revisting Open World Object Detection

[AI6122] Text Data Management & Processing

Official implementation of our paper "Learning to Bootstrap for Combating Label Noise"

Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW 2021

This is the code of "Multi-view Contrastive Graph Clustering" in NeurlPS 2021.

这是一个yolox-keras的源码，可以用于训练自己的模型。

Sudoku solver - A sudoku solver with python

This repository contains the implementation of Deep Detail Enhancment for Any Garment proposed in Eurographics 2021

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Monocular Depth Estimation - Weighted-average prediction from multiple pre-trained depth estimation models

A simple editor for captions in .SRT file extension

Tensorflow/Keras Plug-N-Play Deep Learning Models Compilation

Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

Attention-based Transformation from Latent Features to Point Clouds (AAAI 2022)

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

Self-Supervised Learning for Domain Adaptation on Point-Clouds

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" ([email protected])

CLIP (Contrastive Language–Image Pre-training) for Italian