Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Last update: Dec 17, 2022

Overview

Fre-GAN Vocoder

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Training:

python train.py --config config.json

Citation:

@misc{kim2021fregan,
      title={Fre-GAN: Adversarial Frequency-consistent Audio Synthesis}, 
      author={Ji-Hoon Kim and Sang-Hoon Lee and Ji-Hyun Lee and Seong-Whan Lee},
      year={2021},
      eprint={2106.02297},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

References:

Owner

Rishikesh (ऋषिकेश)

GitHub Repository

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

LESA Introduction This repository contains the official implementation of Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Cont

20 Dec 31, 2021

Tackling data scarcity in Speech Translation using zero-shot multilingual Machine Translation techniques

Tackling data scarcity in Speech Translation using zero-shot multilingual Machine Translation techniques This repository is derived from the NMTGMinor

1 Sep 07, 2022

Official PyTorch Implementation of "AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting".

AgentFormer This repo contains the official implementation of our paper: AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecast

161 Dec 23, 2022

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

PyTorch Large-Scale Language Model A Large-Scale PyTorch Language Model trained on the 1-Billion Word (LM1B) / (GBW) dataset Latest Results 39.98 Perp

114 Nov 04, 2022

Convnet transfer - Code for paper How transferable are features in deep neural networks?

How transferable are features in deep neural networks? This repository contains source code necessary to reproduce the results presented in the follow

143 Sep 13, 2022

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

Keyword Spotting Transformer This is the unofficial TensorFlow implementation of the Keyword Spotting Transformer model. This model is used to train o

8 May 11, 2022

This is the official code for the paper "Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision"

RUAS This is the official code for the paper "Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision" A prelimin

2 May 05, 2022

Network Compression via Central Filter

Network Compression via Central Filter Environments The code has been tested in the following environments: Python 3.8 PyTorch 1.8.1 cuda 10.2 torchsu

2 May 12, 2022

Weakly supervised medical named entity classification

Trove Trove is a research framework for building weakly supervised (bio)medical named entity recognition (NER) and other entity attribute classifiers

60 Nov 18, 2022

SNIPS: Solving Noisy Inverse Problems Stochastically

SNIPS: Solving Noisy Inverse Problems Stochastically This repo contains the official implementation for the paper SNIPS: Solving Noisy Inverse Problem

35 Nov 09, 2022

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

SwinTrack This is the official repo for SwinTrack. A Simple and Strong Baseline Prerequisites Environment conda (recommended) conda create -y -n SwinT

196 Jan 04, 2023

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Datasets | Website | Raw Data | OpenReview SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning Christopher

67 Dec 17, 2022

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

STAM - Pytorch Implementation of STAM (Space Time Attention Model), yet another pure and simple SOTA attention model that bests all previous models in

109 Dec 28, 2022

High performance distributed framework for training deep learning recommendation models based on PyTorch.

340 Dec 30, 2022

Convolutional neural network that analyzes self-generated images in a variety of languages to find etymological similarities

This project is a convolutional neural network (CNN) that analyzes self-generated images in a variety of languages to find etymological similarities. Specifically, the goal is to prove that computer

1 Feb 03, 2022

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Related tags

Overview

Fre-GAN Vocoder

Training:

Citation:

References:

Owner

Rishikesh (ऋषिकेश)

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

Tackling data scarcity in Speech Translation using zero-shot multilingual Machine Translation techniques

Official PyTorch Implementation of "AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting".

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

Convnet transfer - Code for paper How transferable are features in deep neural networks?

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

This is the official code for the paper "Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision"

Network Compression via Central Filter

Weakly supervised medical named entity classification

SNIPS: Solving Noisy Inverse Problems Stochastically

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

High performance distributed framework for training deep learning recommendation models based on PyTorch.

Convolutional neural network that analyzes self-generated images in a variety of languages to find etymological similarities

Auditing Black-Box Prediction Models for Data Minimization Compliance

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Using Hotel Data to predict High Value And Potential VIP Guests

Learning to Self-Train for Semi-Supervised Few-Shot

Denoising images with Fourier Ring Correlation loss