PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Last update: Jan 08, 2023

Related tags

Overview

PaddleBoBo - 元宇宙时代，你也可以动手做一个虚拟主播。

PaddleBoBo是基于飞桨PaddlePaddle深度学习框架和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目。PaddleBoBo致力于简单高效、可复用性强，只需要一张带人像的图片和一段文字，就能快速生成一个虚拟主播的视频；并能通过简单的二次开发更改文字输入，实现视频实时生成和实时直播功能。

应用案例

运行环境

飞桨AIStudio在线运行 (强烈推荐，Tesla V100冲！！！)
自建本地环境
- Windows 10
- Python 3.7+
- PaddlePaddle >= 2.2.1
- Nvidia显卡显存16G+（没测试过，希望有显卡的土豪大佬们反馈下）

快速开始

1.安装依赖包

pip install ppgan paddlespeech

2.配置文件(default.yaml)

GANDRIVING:
  FOM_INPUT_IMAGE: './file/input/test.png' #带人脸的静态图
  FOM_DRIVING_VIDEO: './file/input/zimeng.mp4' #用作表情迁移的参考视频
  FOM_OUTPUT_VIDEO: './file/input/test.mp4' #表情迁移后的视频输出路径

SAVEPATH:
  VIDEO_SAVE_PATH: './file/output/video/' #保存音频的路径
  AUDIO_SAVE_PATH: './file/output/audio/' #保存生成虚拟主播视频的路径

3.让静态人脸动起来

python create_virtual_human.py --config default.yaml

4.通用版本生成

python general_demo.py \
    --human ./file/input/test.mp4 \
    --output output.mp4 \
    --text 各位开发者大家好，欢迎使用飞桨。

参数	参数说明
human	第3步生成的人脸视频路径
output	生成虚拟主播视频的输出路径
text	虚拟主播语音文本

案例库

AI财经新闻主播

* 运行news_app.py 持续采集同花顺新闻数据并生成视频
* 运行play.py 实时和循环播放生成的视频

TODO LIST

最近有点累，如果大佬们有什么想法的话可以提Issue，同时也欢迎PR。

https://github.com/JiehangXie/PaddleBoBo/issues

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Related tags

Overview

PaddleBoBo - 元宇宙时代，你也可以动手做一个虚拟主播。

应用案例

运行环境

快速开始

1.安装依赖包

2.配置文件(default.yaml)

3.让静态人脸动起来

4.通用版本生成

案例库

AI财经新闻主播

更多应用案例正在开发中，欢迎开发者投稿

TODO LIST

参考资料

Owner

A faster pytorch implementation of faster r-cnn

TAug :: Time Series Data Augmentation using Deep Generative Models

A general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)

A curated list of long-tailed recognition resources.

A chemical analysis of lipophilicities & molecule drawings including ML

Project to create an open-source 6 DoF input device

Pytorch implementation of paper: "NeurMiPs: Neural Mixture of Planar Experts for View Synthesis"

Fusion-in-Decoder Distilling Knowledge from Reader to Retriever for Question Answering

Single/multi view image(s) to voxel reconstruction using a recurrent neural network

Download and preprocess popular sequential recommendation datasets

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

PyTorch(Geometric) implementation of G^2GNN in "Imbalanced Graph Classification via Graph-of-Graph Neural Networks"

Multi-Objective Reinforced Active Learning

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Official implementation for paper Render In-between: Motion Guided Video Synthesis for Action Interpolation