A learning-based data collection tool for human segmentation

Last update: Jun 24, 2022

Overview

FullBodyFilter

A Learning-Based Data Collection Tool For Human Segmentation

Overview

Human segmentation is a difficult machine learning task of identifying and extracting the human in a picture. Most of the time this is done by using a convolutional neural network. In order to achieve an accurate and robust model, large amounts of data with varying human poses need to be collected to train the model. Collecting and labeling train data by hand takes lots of time and resources. This project explores another option to use automtation to collect and label pre-existing data from internet videos.

The model that was focused on is the DTEN ME model used for Zoom meetings virtual background.

Openpose is used to filter the video for suitable frames, in particular single person full body frames. Mask R-CNN is the teacher model that generates training labels. To find which images perform poorly on ME model, a comparison is done between ME masks and Mask R-CNN masks. The result is a set of images and masks that can be used as training data.

Overview of Program

A full report of the system design and implemenation details can be found in doc

Sample Results

Examples of train data saved. In each image bottom left is Mask R-CNN mask and bottom right is ME mask.

Usage

This project relies on Openpose and Mask R-CNN and all their dependencies. Instructions on how to set up each are found in there respective directories here.

Documentation on how to use scripts are located in doc.

A learning-based data collection tool for human segmentation

Related tags

Overview

FullBodyFilter

Contents

Overview

Sample Results

Usage

Owner

Robert Jiang

DeepCAD: A Deep Generative Network for Computer-Aided Design Models

A python package for generating, analyzing and visualizing building shadows

Implementation of paper "Self-supervised Learning on Graphs:Deep Insights and New Directions"

A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.

Code and Datasets from the paper "Self-supervised contrastive learning for volcanic unrest detection from InSAR data"

We simulate traveling back in time with a modern camera to rephotograph famous historical subjects.

Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces(ICML 2021)

Shitty gaze mouse controller

最新版本yolov5+deepsort目标检测和追踪，支持5.0版本可训练自己数据集

Structured Edge Detection Toolbox

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Callable PyTrees and filtered JIT/grad transformations => neural networks in JAX.

Source code for Zalo AI 2021 submission

RodoSol-ALPR Dataset

Hough Transform and Hough Line Transform Using OpenCV

Official PyTorch Implementation of GAN-Supervised Dense Visual Alignment

Moiré Attack (MA): A New Potential Risk of Screen Photos [NeurIPS 2021]

Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.

RDA: Robust Domain Adaptation via Fourier Adversarial Attacking

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes