English | 中文

OpenIVA

OpenIVA is an end-to-end intelligent video analytics development toolkit based on different inference backends, designed to help individual users and start-ups quickly launch their own video AI services.
OpenIVA implements varied mainstream facial recognition, object detection, segmentation and landmark detection algorithms. And it provides an efficient and lightweight service deployment framework with a modular design. Users only need to replace the algorithm model used for their own tasks.

Features

Common mainstream algorithms

Provides latest fast accurate pre-trained models for facial recognition, object detection, segmentation and landmark detection tasks

Multi inference backends

Supports TensorlayerX/ TensorRT/ onnxruntime

High performance

Achieves high performance on CPU/GPU/Ascend platforms, achieve inference speed above 3000it/s

Asynchronous & multithreading

Use multithreading and queue to achieve high device utilization for inference and pre/post-processing

Lightweight service

Use Flask for lightweight intelligent application services

Modular design

You can quickly start your intelligent analysis service, only need to replace the AI models

GUI visualization tools

Start analysis tasks only by clicking buttons, and show visualized results in GUI windows, suitable for multiple tasks

Performance benchmark

Testing environments

i5-10400 6c12t
RTX3060
Ubuntu18.04
CUDA 11.1
TensorRT-7.2.3.4
onnxruntime with EPs:
- CPU(Default)
- CUDA(Manually Compiled)
- OpenVINO(Manually Compiled)
- TensorRT(Manually Compiled)

Performance

Facial recognition

Run
python test_landmark.py
batchsize=8, top_k=68, 67 faces in the image

Face detection
Model face_detector_640_dy_sim

onnxruntime EPs FPS faces per sec

CPU 32 2075

OpenVINO 81 5374

CUDA 105 7074

TensorRT(FP32) 124 7948

TensorRT(FP16) 128 8527
Face landmark
Model landmarks_68_pfld_dy_sim

onnxruntime EPs faces per sec

CPU 69

OpenVINO 890

CUDA 2061

TensorRT(FP32) 2639

TensorRT(FP16) 3131

onnxruntime EPs	FPS	faces per sec
CPU	32	2075
OpenVINO	81	5374
CUDA	105	7074
TensorRT(FP32)	124	7948
TensorRT(FP16)	128	8527

onnxruntime EPs	faces per sec
CPU	69
OpenVINO	890
CUDA	2061
TensorRT(FP32)	2639
TensorRT(FP16)	3131

Run
python test_face.py
batchsize=8

Face embedding
Model arc_mbv2_ccrop_sim

onnxruntime EPs faces per sec

CPU 212

OpenVINO 865

CUDA 1790

TensorRT(FP32) 2132

TensorRT(FP16) 2812

onnxruntime EPs	faces per sec
CPU	212
OpenVINO	865
CUDA	1790
TensorRT(FP32)	2132
TensorRT(FP16)	2812

Objects detection

Run
python test_yolo.py
batchsize=8 , 4 objects in the image

YOLOX objects detect
Model yolox_s(ms_coco)

onnxruntime EPs FPS Objects per sec

CPU 9.3 37.2

OpenVINO 13 52

CUDA 77 307

TensorRT(FP32) 95 380

TensorRT(FP16) 128 512

Model yolox_m(ms_coco)

onnxruntime EPs FPS Objects per sec

CPU 4 16

OpenVINO 5.5 22

CUDA 46.8 187

TensorRT(FP32) 64 259

TensorRT(FP16) 119 478

Model yolox_nano(ms_coco)

onnxruntime EPs FPS Objects per sec

CPU 47 188

OpenVINO 80 320

CUDA 210 842

TensorRT(FP32) 244 977

TensorRT(FP16) 269 1079

Model yolox_tiny(ms_coco)

onnxruntime EPs FPS Objects per sec

CPU 33 133

OpenVINO 43 175

CUDA 209 839

TensorRT(FP32) 248 995

TensorRT(FP16) 327 1310

Intelligent Video Analytics toolkit based on different inference backends.

Related tags

Overview

OpenIVA

Features

Performance benchmark

Testing environments

Performance

Facial recognition

Objects detection

Progress

Owner

Quantum Liu

AoT is a system for automatically generating off-target test harness by using build information.

Neural Scene Flow Prior (NeurIPS 2021 spotlight)

Code accompanying our paper Feature Learning in Infinite-Width Neural Networks

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

Deep Learning Algorithms for Hedging with Frictions

This is the official code for the paper "Tracker Meets Night: A Transformer Enhancer for UAV Tracking".

This project aims at building a real-time wide band channel sounder using USRPs

git《FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding》(CVPR 2021) GitHub: [fig8]

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022)

Tensors and neural networks in Haskell

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step

MWPToolkit is a PyTorch-based toolkit for Math Word Problem (MWP) solving.

This repository contains the database and code used in the paper Embedding Arithmetic for Text-driven Image Transformation

A multi-scale unsupervised learning for deformable image registration

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

an Evolutionary Algorithm assisted GAN

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

onnxruntime EPs	FPS	Objects per sec
CPU	9.3	37.2
OpenVINO	13	52
CUDA	77	307
TensorRT(FP32)	95	380
TensorRT(FP16)	128	512

onnxruntime EPs	FPS	Objects per sec
CPU	4	16
OpenVINO	5.5	22
CUDA	46.8	187
TensorRT(FP32)	64	259
TensorRT(FP16)	119	478