PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Last update: Dec 26, 2022

Related tags

Deep Learning Dancing2Music

Overview

Dancing to Music

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Paper

Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz
Dancing to Music Neural Information Processing Systems (NeurIPS) 2019
[Paper] [YouTube] [Project] [Blog] [Supp]

Example Videos

Beat-Matching
1st row: generated dance sequences, 2nd row: music beats, 3rd row: kinematics beats

Multimodality
Generate various dance sequences with the same music and the same initial pose.

Long-Term Generation
Seamlessly generate a dance sequence with arbitrary length.

Photo-Realisitc Videos
Map generated dance sequences to photo-realistic videos.

Train Decomposition

python train_decomp.py --name Decomp

Train Composition

python train_comp.py --name Decomp --decomp_snapshot DECOMP_SNAPSHOT

Demo

python demo.py --decomp_snapshot DECOMP_SNAPSHOT --comp_snapshot COMP_SNAPSHOT --aud_path AUD_PATH --out_file OUT_FILE --out_dir OUT_DIR --thr THR

Flags
- aud_path: input .wav file
- out_file: location of output .mp4 file
- out_dir: directory of output frames
- thr: threshold based on motion magnitude
- modulate: whether to do beat warping
Example

python demo.py -decomp_snapshot snapshot/Stage1.ckpt --comp_snapshot snapshot/Stage2.ckpt --aud_path demo/demo.wav --out_file demo/out.mp4 --out_dir demo/out_frame

Citation

If you find this code useful for your research, please cite our paper:

@inproceedings{lee2019dancing2music,
  title={Dancing to Music},
  author={Lee, Hsin-Ying and Yang, Xiaodong and Liu, Ming-Yu and Wang, Ting-Chun and Lu, Yu-Ding and Yang, Ming-Hsuan and Kautz, Jan},
  booktitle={NeurIPS},
  year={2019}
}

License

Copyright (C) 2020 NVIDIA Corporation. All rights reserved. This work is made available under NVIDIA Source Code License (1-Way Commercial). To view a copy of this license, visit https://nvlabs.github.io/Dancing2Music/LICENSE.txt.

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Related tags

Overview

Dancing to Music

Paper

Example Videos

Train Decomposition

Train Composition

Demo

Citation

License

Owner

NVIDIA Research Projects

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

yolox_backbone is a deep-learning library and is a collection of YOLOX Backbone models.

This is an (re-)implementation of DeepLab-ResNet in TensorFlow for semantic image segmentation on the PASCAL VOC dataset.

FAST-RIR: FAST NEURAL DIFFUSE ROOM IMPULSE RESPONSE GENERATOR

This implements one of result networks from Large-scale evolution of image classifiers

Learning Compatible Embeddings, ICCV 2021

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

ML models and internal tensors 3D visualizer

Adversarial Color Enhancement: Generating Unrestricted Adversarial Images by Optimizing a Color Filter

A Python Package for Portfolio Optimization using the Critical Line Algorithm

PyTorch implementation of DeepUME: Learning the Universal Manifold Embedding for Robust Point Cloud Registration (BMVC 2021)

The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

Deep Q-Learning Network in pytorch (not actively maintained)

The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.

Official PyTorch implementation of "The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation" (ICCV 21).

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in Tensorflow Lite.

Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.

A working implementation of the Categorical DQN (Distributional RL).

[ICCV 2021 Oral] Deep Evidential Action Recognition

Code for "R-GCN: The R Could Stand for Random"