Lucid Sonic Dreams syncs GAN-generated visuals to music.

Last update: Jan 02, 2023

Related tags

Overview

Lucid Sonic Dreams

Lucid Sonic Dreams syncs GAN-generated visuals to music. By default, it uses NVLabs StyleGAN2, with pre-trained models lifted from Justin Pinkney's consolidated repository. Custom weights and other GAN architectures can be used as well.

Sample output can be found on YouTube and Instagram.

Installation

This implementation has been teston on Python 3.6 and 3.7. As per NVLabs' TensorFlow implementation of StyleGAN2, TensorFlow 1.15 is required. TensorFlow 2.x is not supported.

To install, simply run:

pip install lucidsonicdreams

Usage

You may refer to the Lucid Sonic Dreams Tutorial Notebook for full parameter descriptions and sample code templates. A basic visualization snippet is also found below.

Basic Visualization

from lucidsonicdreams import LucidSonicDream


L = LucidSonicDream(song = 'song.mp3',
                    style = 'abstract photos')

L.hallucinate(file_name = 'song.mp4')

Comments

where to place .pkl model files?

Hi,

Thanks for the fantastic repo,

I really want to drop a custom .pkl model file into LucidSonicDreams and it isn't obvious to me where I should put it? I'm working in Colab for the time being.

Thanks,

Mark

opened by markhanslip 1
Installing in Ubuntu 20.04 ?

Hi I tried now for hours to install Lucid Sonic Dreams in Ubuntu 20.04. How to install it correctly so that it works ? I tried it with anaconda but no luck....a little desperate now ! Update: Installed everything without errors. Used the setup.py to install dependencies. But now i am stuck. Where and how to execute this:

from lucidsonicdreams import show_styles

Show valid default style names. show_styles()

or this ?

"from lucidsonicdreams import LucidSonicDream

L = LucidSonicDream(song = 'song.mp3', style = 'abstract photos')

L.hallucinate(file_name = 'song.mp4') " ??

Can someone enlighten me please ?

opened by Colliwomple 0
Fix for broken Deps possibly? Please advise! LSD colab is BROKEN! Thanks!

See - https://github.com/mikaelalafriz/lucid-sonic-dreams/compare/main...pollinations:lucid-sonic-dreams:main suggestion for pollinations to mod to self refer so their fixes they made can be used, otherwise its referring to the same broken changes that you have that are breaking the colabs for LSD.

[fuse bias errors mostly to do with incompatibilities in breaking changes to several depenancies and potential v2 v3 python issues with v1/v2 tensorflow.]

ITs fixable but we need to specify the old working dependencies from what i can see, not the new breaking ones. All this began after the default code attempted to integrate ADA from what i could see? Correct me if i am wrong thanks!

opened by cleancoindev 1
Real time support

Hi,

First, thank you for your great work - it's incredible!

I was wondering if, in your opinion, it would be possible to extend your work to generate the visuals in real-time. This would mean using streaming of audio data (or, possibly, MIDI) rather than pre-rendered files. I guess the frame rate can be a little low at 1024, but it would be still great to have this option for someone who has a lot of GPUs. Do you think it would be anyhow realistic?

Keep up the amazing work!

opened by lowlypalace 0
ModuleNotFoundError: No module named 'lucidsonicdreams'

Im trying to run a test and this is the way i have the python file typed. Any help would be appreciated

(command i input)= python proud.py (to run the python below)

from lucidsonicdreams import LucidSonicDream

L = LucidSonicDream(song = 'proud.mp3', style = 'abstract photos')

L.hallucinate(file_name = 'proud.mp4', resolution = 360, start = 30, duration = 45)

files.download("proud of you.mp4")

Error im getting

Traceback (most recent call last): File "proud.py" line 1 in from lucidsonicdreams import LucidSonicDream ModuleNotFoundError: No module named 'lucidsonicdreams

'

opened by Texagon 3

index out of bounds

Hi! I am trying out the script in order to sync some images I have generated using VQGAN+CLIP to my audio. Here's the code:

def load_imgs(noise_batch, class_batch):
    # just loads N images randomly
    return images

L = LucidSonicDream('audio_5.mp3',
                    style = load_imgs, 
                    input_shape = 592,
                    num_possible_classes = 1000)

L.hallucinate('video_sync.mp4',
              output_audio = 'audio_sync.mp3',
              speed_fpm = 3,
              classes = [13, 14, 22, 24, 301, 84, 99, 100, 134, 143, 393, 394],
              class_shuffle_seconds = 10, 
              class_shuffle_strength = 0.1,
              class_complexity = 0.5,
              class_smooth_seconds = 4,
              motion_react = 0.35,
              flash_strength = 1)
              #contrast_strength = 0.5)

The error appears just at the end of the process:

IndexError                                Traceback (most recent call last)
<ipython-input-15-aeedb41e1387> in <module>()
     15               class_smooth_seconds = 4,
     16               motion_react = 0.35,
---> 17               flash_strength = 1)
     18               #contrast_strength = 0.5)

2 frames
/usr/local/lib/python3.7/dist-packages/lucidsonicdreams/main.py in apply_effect(self, array, index)
    742     '''Apply effect to image (array)'''
    743 
--> 744     amplitude = self.spec[index]
    745     return self.func(array=array, strength = self.strength, amplitude=amplitude)

IndexError: index 207 is out of bounds for axis 0 with size 207

Any idea on how to avoid it? Thanks in advance!

opened by shoegazerstella 0

Releases(v_04)

v_04(Mar 13, 2021)

Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository

Oriented Response Networks, in CVPR 2017

Oriented Response Networks [Home] [Project] [Paper] [Supp] [Poster] Torch Implementation The torch branch contains: the official torch implementation

217 Dec 12, 2022

Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 short.

Session-aware BERT4Rec Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 shor

22 Dec 13, 2022

Image Super-Resolution by Neural Texture Transfer

SRNTT: Image Super-Resolution by Neural Texture Transfer Tensorflow implementation of the paper Image Super-Resolution by Neural Texture Transfer acce

413 Nov 30, 2022

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

ContextNet ContextNet has CNN-RNN-transducer architecture and features a fully convolutional encoder that incorporates global context information into

24 Nov 24, 2022

Lucid Sonic Dreams syncs GAN-generated visuals to music.

Related tags

Overview

Lucid Sonic Dreams

Installation

Usage

Basic Visualization

Comments

where to place .pkl model files?

Installing in Ubuntu 20.04 ?

Fix for broken Deps possibly? Please advise! LSD colab is BROKEN! Thanks!

Real time support

ModuleNotFoundError: No module named 'lucidsonicdreams'

index out of bounds

Releases(v_04)

v_04(Mar 13, 2021)

Owner

Oriented Response Networks, in CVPR 2017

Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 short.

Image Super-Resolution by Neural Texture Transfer

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

Code for the paper "Benchmarking and Analyzing Point Cloud Classification under Corruptions"

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

A tutorial on DataFrames.jl prepared for JuliaCon2021

StableSims is an open-source project aimed at simulating MakerDAO's Dai stablecoin system

Easy-to-use micro-wrappers for Gym and PettingZoo based RL Environments

The final project for "Applying AI to Wearable Device Data" course from "AI for Healthcare" - Udacity.

The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

tf2-keras implement yolov5

Finetuner allows one to tune the weights of any deep neural network for better embeddings on search tasks

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

A synthetic texture-invariant dataset for object detection of UAVs

Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)

Code for paper entitled "Improving Novelty Detection using the Reconstructions of Nearest Neighbours"

System-oriented IR evaluations are limited to rather abstract understandings of real user behavior

This is the code for HOI Transformer