Optimizing synthesizer parameters using gradient approximation

NASH 2021 Hackathon!

These are some experiments I conducted during NASH 2021, the Neural Audio Synthesis Hackathon that took place on the 18th & 19th of December.

Over the weekend I explored implementing gradient approximation for torchsynth, so that synthesizers could be included in deep learning models & training without having to have the full synth be differentiable. It uses simultaneous perturbation stochastic approximation (SPSA) to estimate the gradients for synthesizer parameters. This technique was used by Marco A. Martínez Ramírez et al. in their work on Differentiable Signal Processing With Black-Box Audio Effects.

I was able to start optimizing on a few parameters for a simple synthesizer, but ran into issues as soon as oscillator tuning or FM was introduced. There is a known issue with audio loss functions for calculating loss with pitch (Turian and Henry, 2020), so this is not surprising.

Nonetheless, techniques like SPSA seem promising for including traditional DSP synthesis into neural nets and deep learning!

Fun weekend puttering around with this! Thank you to Ben Hayes for organing the event.

Optimizing synthesizer parameters using gradient approximation

Related tags

Overview

Optimizing synthesizer parameters using gradient approximation

NASH 2021 Hackathon!

Owner

Jordie Shier

Benchmarks for semi-supervised domain generalization.

A 3D Dense mapping backend library of SLAM based on taichi-Lang designed for the aerial swarm.

DTCN IJCAI - Sequential prediction learning framework and algorithm

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Codebase for testing whether hidden states of neural networks encode discrete structures.

Net2net - Network-to-Network Translation with Conditional Invertible Neural Networks

Drone detection using YOLOv5

MADE (Masked Autoencoder Density Estimation) implementation in PyTorch

Paper Title: Heterogeneous Knowledge Distillation for Simultaneous Infrared-Visible Image Fusion and Super-Resolution

An example showing how to use jax to train resnet50 on multi-node multi-GPU

A small tool to joint picture including gif

QueryInst: Parallelly Supervised Mask Query for Instance Segmentation

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

This dlib-based facial login system

一些经典的CTR算法的复现; LR, FM, FFM, AFM, DeepFM，xDeepFM, PNN, DCN, DCNv2, DIFM, AutoInt, FiBiNet,AFN,ONN,DIN, DIEN ... （pytorch, tf2.0）

Anchor-free Oriented Proposal Generator for Object Detection

Analysing poker data from home games with friends

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records