Cascaded Pyramid Network (CPN) based on Keras (Tensorflow backend)

Last update: Nov 22, 2021

Related tags

Deep Learning CPN_KR

Overview

ML2 Takehome Project

Reimplementing the paper: Cascaded Pyramid Network for Multi-Person Pose Estimation

Dataset

The model uses the COCO dataset which can be downloaded by typing:

chmod +x coco.sh
./coco.sh

The data is going to be saved inside the coco/ folder.

I actually got the wrong idea of the assigment from the beginning and didn't relize until I searched for a pytorch code on Github for reference.

That is the data doesn't need to be cropped from the original. I mean not physically cropped to images but just need to write the program to cut it during the training process. Anyway I did the cutting and save the neccesary information such as keypoints and visual score (0,1,2) to a dataframe for the training and validation data.

python dataprocessing/process_data.py

Training

python train.py

Test

Download the checkpoint here and unzip.

python test.py

The results are shown below, I know that this one is not a perfect one, but if I have more time I think the model will get better.

Input	Prediction

Failed cases

Input	Prediction

Notes

the model was not finished training yet, then I was not able to test it.
There was a typo in the code when I created the dataset and I just figured it out on Friday then everything is just like a fresh start. I will keep training and update the weight file and test code as well as the result.

Reference

The repo is heavily based on the pytorch version and tensorflow version and the official keras tutorial about keypoint estimation.

Cascaded Pyramid Network (CPN) based on Keras (Tensorflow backend)

Related tags

Overview

ML2 Takehome Project

Dataset

Training

Test

Notes

Reference

Owner

Vo Van Tu

Source code for From Stars to Subgraphs

Real-time Object Detection for Streaming Perception, CVPR 2022

Official project repository for 'Normality-Calibrated Autoencoder for Unsupervised Anomaly Detection on Data Contamination'

A Momentumized, Adaptive, Dual Averaged Gradient Method for Stochastic Optimization

Multi Task RL Baselines

This repository is the code of the paper "Sparse Spatial Transformers for Few-Shot Learning".

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Fast convergence of detr with spatially modulated co-attention

Meta-TTS: Meta-Learning for Few-shot SpeakerAdaptive Text-to-Speech

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Count the MACs / FLOPs of your PyTorch model.

Change Detection in SAR Images Based on Multiscale Capsule Network

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

An Abstract Cyber Security Simulation and Markov Game for OpenAI Gym

EfficientDet (Scalable and Efficient Object Detection) implementation in Keras and Tensorflow

CUAD

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".

Official code of the paper "Expanding Low-Density Latent Regions for Open-Set Object Detection" (CVPR 2022)

The reference baseline of final exam for XMU machine learning course