Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Last update: Dec 31, 2022

Overview

ONNX-Mobile-Human-Pose-3D

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model.

Original image for inference: (https://static2.diariovasco.com/www/pre2017/multimedia/noticias/201412/01/media/DF0N5391.jpg)

❗ ⚠️ Known issues

The models works well when the person is looking forward and without occlusions, it will start to fail as soon as the person is occluded.
The model is fast, but the 3D representation is slow due to matplotlib, this will be fixed. The 3d representation can be ommitted for faster inference by setting draw_3dpose to False

Requirements

OpenCV, imread-from-url, scipy, onnx and onnxruntime. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

ONNX model

The original models were converted to different formats (including .onnx) by PINTO0309, download the models from his repository and save them into the models folder.

YOLOv5s: You will also need an object detector to first detect the people in the image. Download the model from the model zoo and save the .onnx version into the models folder.

Original model

The original model was taken from the original repository.

Examples

Image inference:

python imagePoseEstimation.py

Video inference:

python videoPoseEstimation.py

Webcam inference:

python webcamPoseEstimation.py

Inference video Example

References:

Mobile human pose model: https://github.com/SangbumChoi/MobileHumanPose
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
3DMPPE_POSENET_RELEASE repository: https://github.com/mks0601/3DMPPE_POSENET_RELEASE
Original YOLOv5 repository: https://github.com/ultralytics/yolov5
Original paper: https://openaccess.thecvf.com/content/CVPR2021W/MAI/html/Choi_MobileHumanPose_Toward_Real-Time_3D_Human_Pose_Estimation_in_Mobile_Devices_CVPRW_2021_paper.html

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Related tags

Overview

ONNX-Mobile-Human-Pose-3D

❗ ⚠️ Known issues

Requirements

Installation

ONNX model

Original model

Examples

Inference video Example

References:

Owner

Ibai Gorordo

Repository for Multimodal AutoML Benchmark

Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

A tensorflow implementation of Fully Convolutional Networks For Semantic Segmentation

Code for Mesh Convolution Using a Learned Kernel Basis

Repo for FUZE project. I will also publish some Linux kernel LPE exploits for various real world kernel vulnerabilities here. the samples are uploaded for education purposes for red and blue teams.

Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper

Code in conjunction with the publication 'Contrastive Representation Learning for Hand Shape Estimation'

BuildingNet: Learning to Label 3D Buildings

Image segmentation with private İstanbul Dataset

[ACMMM 2021 Oral] Enhanced Invertible Encoding for Learned Image Compression

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

The all new way to turn your boring vector meshes into the new fad in town; Voxels!

Paper: De-rendering Stylized Texts

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Simple Pixelbot for Diablo 2 Resurrected written in python and opencv.

Python interface for the DIGIT tactile sensor

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Efficient face emotion recognition in photos and videos

No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency