Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Last update: Oct 14, 2022

Overview

ONNX Object Localization Network

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Original image: https://en.wikipedia.org/wiki/File:Interior_design_865875.jpg

Important

I added a bit of logic to the box color selection to make it look nicer. Since it performs K-Means for each box, it might be slow. If you only care about speed, you can either set all the boxes to the same color or use random colors.

Requirements

Check the requirements.txt file.
For ONNX, if you have a NVIDIA GPU, then install the onnxruntime-gpu, otherwise use the onnxruntime library.
Additionally, pafy and youtube-dl are required for youtube video inference.

Installation

git clone https://github.com/ibaiGorordo/ONNX-Object-Localization-Network.git
cd ONNX-Object-Localization-Network
pip install -r requirements.txt

ONNX Runtime

For Nvidia GPU computers: pip install onnxruntime-gpu

Otherwise: pip install onnxruntime

For youtube video inference

pip install youtube_dl
pip install git+https://github.com/zizo-pro/[email protected]

ONNX model

The original model was converted to ONNX by PINTO0309, download the models from the download script in his repository and save them into the models folder.

The License of the models is Apache-2.0 License: https://github.com/mcahny/object_localization_network/blob/main/LICENSE

Pytorch model

The original Pytorch model can be found in this repository: https://github.com/mcahny/object_localization_network

Examples

Image inference:

python image_object_localization.py

Webcam inference:

python webcam_object_localization.py

Video inference: https://youtu.be/n9qhQJXYUWo

python video_object_localization.py

Original video: https://youtu.be/vgJUXvkdS78

References:

Object-Localization-Network model: https://github.com/mcahny/object_localization_network
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
Original paper: https://arxiv.org/abs/2108.06753

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Related tags

Overview

ONNX Object Localization Network

Important

Requirements

Installation

ONNX Runtime

For youtube video inference

ONNX model

Pytorch model

Examples

References:

Owner

Ibai Gorordo

Honours project, on creating a depth estimation map from two stereo images of featureless regions

Gapmm2: gapped alignment using minimap2 (align transcripts to genome)

Example repository for custom C++/CUDA operators for TorchScript

Everything's Talkin': Pareidolia Face Reenactment (CVPR2021)

MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition

A lightweight tool to get an AI Infrastructure Stack up in minutes not days.

Implementation for our AAAI2021 paper (Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction).

Adversarial vulnerability of powerful near out-of-distribution detection

Dense Prediction Transformers

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Learning What and Where to Draw

A testcase generation tool for Persistent Memory Programs.

the code for our CVPR 2021 paper Bilateral Grid Learning for Stereo Matching Network [BGNet]

CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"

Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

Bayesian Deep Learning and Deep Reinforcement Learning for Object Shape Error Response and Correction of Manufacturing Systems

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Oral)

π-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis