performing moving objects segmentation using image processing techniques with opencv and numpy

Last update: Dec 12, 2022

Overview

Moving Objects Segmentation

On this project I tried to perform moving objects segmentation using background subtraction technique. the introduced method relies on two important functions:

Create clean background
Generate mask sequence

Create clean background

The main idea is taking several frames from the video sequence and estimating the covered regions by any distracting elements that come in front of the clean background, this method only works if the frames are aligned and the camera is not moving otherwise the method is not going to work.

createCleanBG(input_path, memorize, skipping, save_result, output_path)

input_path (string): path of input video sequence.

memorize (int): number of frames that are used to generate a clean background, usually around 10 to 15 frames works fine.

skipping (int): number of frames to skip before caching another frame to generate the clean background, if the motion on the video is too slow consider increasing this number.

save_result (boolean): boolean parameter to decide whether to save the generated result or not.

output_path (string): output path of the generated clean background.

Generate mask sequence

Mask sequence is generated by subtracting each video frame from the clean background, image adjustments are used to generate better masking results.

Vibrance

Vibrance adjustment increases the intensity of the low saturated colors in an image and leaves the saturated colors as it is, this adjustment is inspired by Adobe Photoshop filter, although the implementation is different because the desired target is not generating a plausible colors by human eyes but shifting the colors in the low saturated pixels to be more saturated, so after subtraction any slight difference in the colors would become more clearer after increasing the vibrance.

Brightening Shadows

This adjustment only affects the dark pixels of the image and increases its luminance, applying this adjustment would also create a wider range of values after the subtraction and it would become clearer to threshold and distinguish the difference in color values after brightening the dark pixels.

The figure below shows that applying image adjustments before segmentation would generate better segmentation results.

Applications

The generated masks could be used for many different purposes, for example it could be used to localize moving objects on a video footage.

The generated masks could be used also on VFX production as luma matte to mask objects from the background.

performing moving objects segmentation using image processing techniques with opencv and numpy

Related tags

Overview

Moving Objects Segmentation

Create clean background

Generate mask sequence

Vibrance

Brightening Shadows

Applications

Owner

Mohamed Magdy

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

Balancing Principle for Unsupervised Domain Adaptation

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

CNNs for Sentence Classification in PyTorch

A whale detector design for the Kaggle whale-detector challenge!

[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)

Reproduce partial features of DeePMD-kit using PyTorch.

Anonymize BLM Protest Images

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

This tutorial repository is to introduce the functionality of KGTK to first-time users

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Official implementation of Pixel-Level Bijective Matching for Video Object Segmentation

Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers

SOTA easy to use PyTorch-based DL training library

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild

Drone Task1 - Drone Task1 With Python

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

TFOD-MASKRCNN - Tensorflow MaskRCNN With Python

Massively parallel Monte Carlo diffusion MR simulator written in Python.

根据midi文件演奏“风物之诗琴”的脚本 "Windsong Lyre" auto play