This is a vision-based 3d model manipulation and control UI

Last update: Oct 23, 2022

Overview

Manipulation of 3D Models Using Hand Gesture

This program allows user to manipulation 3D models (.obj format) with their hands. The project support both the OAK-D and OAK-D-LITE.

Install dependencies

On an Intel-based macOS or Linux machine, run the following command in the terminal:

git clone https://github.com/cortictechnology/vision_ui.git
cd vision_ui
python3 -m pip install -r requirements.txt

For Linux only, make sure your OAK-D device is not plugged in and then run the following:

echo 'SUBSYSTEM=="usb", ATTRS{idVendor}=="03e7", MODE="0666"' | sudo tee /etc/udev/rules.d/80-movidius.rules
sudo udevadm control --reload-rules && sudo udevadm trigger

To run

Make sure the OAK-D/OAK-D-Lite device is plug into the computer.
In the terminal, run

python3 main.py

AI Model description

The ai_models folder includes two Intel Myriad X optimized models:

palm_detection_sh4.blob: This is the palm detection model
hand_landmark_sh4.blob: This is the model to detect the hand landmarks using the palm detection model

This is a vision-based 3d model manipulation and control UI

Related tags

Overview

Manipulation of 3D Models Using Hand Gesture

Install dependencies

To run

AI Model description

Credits

Owner

Cortic Technology Corp.

A flexible framework of neural networks for deep learning

GANTheftAuto is a fork of the Nvidia's GameGAN

Reproduces ResNet-V3 with pytorch

A machine learning project which can detect and predict the skin disease through image recognition.

Code for "Unsupervised State Representation Learning in Atari"

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

The fastai book, published as Jupyter Notebooks

Code for "Modeling Indirect Illumination for Inverse Rendering", CVPR 2022

SMPLpix: Neural Avatars from 3D Human Models

moving object detection for satellite videos.

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

METER: Multimodal End-to-end TransformER

RoMa: A lightweight library to deal with 3D rotations in PyTorch.

Python port of R's Comprehensive Dynamic Time Warp algorithm package

Exploring whether attention is necessary for vision transformers

Functional deep learning

nanodet_plus,yolov5_v6.0

Unsupervised Video Interpolation using Cycle Consistency

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".