3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Last update: Dec 20, 2022

Overview

visemenet-inference

Inference Demo of "VisemeNet-tensorflow"
- VisemeNet is an audio-driven animator centric speech animation driving a JALI or standard FACS-based face-rigging from input audio.
- The original repo is outdated and difficult to setup the environment for testing the pretrained model. This code is to provide a super-clean inference module based on the original author's repo.

How to freeze graph

This repo does not need bazel-build for "freeze-graph" function
Thanks to https://github.com/lighttransport/VisemeNet-infer for giving some examples.

Requirements

Python 3.6.x using "pyenv"
Tensorflow 1.1.0

Setup the envs and packages

# Install Virtualenv using pyenv
pyenv install 3.6.5
pyenv virtualenv 3.6.5 visemenet-freeze
pyenv activate visemenet-freeze

# Install packages
pip install tensorflow==1.1.0

Clone the repo

# Clone Visemenet repo and the pretrained model
git clone https://github.com/yzhou359/VisemeNet_tensorflow.git
curl -L https://www.dropbox.com/sh/7nbqgwv0zz8pbk9/AAAghy76GVYDLqPKdANcyDuba?dl=0 > pretrained_model.zip
unzip prtrained_model.zip -d VisemeNet_tensorflow/data/ckpt/pretrain_biwi/

Freeze Graph and Save as pb

# Freeze Graph
python freeze_graph.py

Model Inference

Colab Demo

This code provides the simple and clean inference code without any needless ones
It's compatible with TF 2.0 Version

Requirements

Tensorflow 2.x
numpy
scipy
python_speech_features

How to run inference

import numpy as np
from inference import VisemeRegressor

pb_filepath = "./visemenet_frozen.pb"
wav_file_path = "./test_audio.wav"
out_txt_path = "./maya_viseme_outputs.txt"

viseme_regressor = VisemeRegressor(pb_filepath=pb_filepath)

viseme_outputs = viseme_regressor.predict_outputs(wav_file_path=wav_file_path)

np.savetxt(out_txt_path, viseme_outputs, '%.4f')

3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Related tags

Overview

visemenet-inference

How to freeze graph

Requirements

Model Inference

Requirements

How to run inference

Owner

Junhwan Jang

This project deploys a yolo fastest model in the form of tflite on raspberry 3b+. The model is from another repository of mine called -Trash-Classification-Car

BabelCalib: A Universal Approach to Calibrating Central Cameras. In ICCV (2021)

Implementation of ConvMixer-Patches Are All You Need? in TensorFlow and Keras

Open standard for machine learning interoperability

Bayesian Inference Tools in Python

Real life contra a deep learning project built using mediapipe and openc

Plover-tapey-tape: an alternative to Plover’s built-in paper tape

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Official implementation of "Learning Proposals for Practical Energy-Based Regression", 2021.

This is the official implementation for the paper "(Almost) Free Incentivized Exploration from Decentralized Learning Agents" in NeurIPS 2021.

[CVPR 2021] Generative Hierarchical Features from Synthesizing Images

Incomplete easy-to-use math solver and PDF generator.

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Re-TACRED: Addressing Shortcomings of the TACRED Dataset

Experiments with Fourier layers on simulation data.

This repository contains the official code of the paper Equivariant Subgraph Aggregation Networks (ICLR 2022)

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation (CVPR 2021)

Tool for working with Y-chromosome data from YFull and FTDNA

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.