Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral

Last update: Dec 24, 2022

Overview

Human Pose Regression with Residual Log-likelihood Estimation

[Paper] [arXiv] [Project Page]

Human Pose Regression with Residual Log-likelihood Estimation
Jiefeng Li, Siyuan Bian, Ailing Zeng, Can Wang, Bo Pang, Wentao Liu, Cewu Lu
ICCV 2021 Oral

Regression with Residual Log-likelihood Estimation

TODO

Provide minimal implementation of RLE loss.
Provide implementation on Human3.6M dataset.
Provide implementation on COCO dataset.

Installation

Install pytorch >= 1.1.0 following official instruction.
Install rlepose:

pip install cython
python setup.py develop

Install COCOAPI.

pip install -U 'git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI'

Init data directory:

mkdir data

Download COCO data:

|-- data
`-- |-- coco
    `-- |-- annotations
        |   |-- person_keypoints_train2017.json
        |   `-- person_keypoints_val2017.json
        `-- images
            |-- train2017
            |   |-- 000000000009.jpg
            |   |-- 000000000025.jpg
            |   |-- 000000000030.jpg
            |   |-- ... 
            `-- val2017
                |-- 000000000139.jpg
                |-- 000000000285.jpg
                |-- 000000000632.jpg
                |-- ...

Train from scratch

./scripts/train.sh ./configs/256x192_res50_regress-flow.yaml train_rle

Evaluation

Download the pretrained model from Google Drive.

./scripts/validate.sh ./configs/256x192_res50_regress-flow.yaml ./coco-laplace-rle.pth

Citing

If our code helps your research, please consider citing the following paper:

@inproceedings{li2021human,
    title={Human Pose Regression with Residual Log-likelihood Estimation},
    author={Li, Jiefeng and Bian, Siyuan and Zeng, Ailing and Wang, Can and Pang, Bo and Liu, Wentao and Lu, Cewu},
    booktitle={ICCV},
    year={2021}
}

Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral

Related tags

Overview

Human Pose Regression with Residual Log-likelihood Estimation

TODO

Installation

Train from scratch

Evaluation

Citing

Owner

JeffLi

Language model Prompt And Query Archive

4K videos with annotated masks in our ICCV2021 paper 'Internal Video Inpainting by Implicit Long-range Propagation'.

Gym environments used in the paper: "Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors"

Source code of all the projects of Udacity Self-Driving Car Engineer Nanodegree.

Code for intrusion detection system (IDS) development using CNN models and transfer learning

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

Implementation for Curriculum DeepSDF

Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond

GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

Learning kernels to maximize the power of MMD tests

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

🔥3D-RecGAN in Tensorflow (ICCV Workshops 2017)

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

FastReID is a research platform that implements state-of-the-art re-identification algorithms.

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Code for Understanding Pooling in Graph Neural Networks

Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation"

Simple data balancing baselines for worst-group-accuracy benchmarks.