Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Last update: Jun 06, 2022

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training because the log-likelihood can be undefined for sparse probability distributions. Furthermore, many sparse normalization functions often collapse the multimodality of distributions. In this work, we present ev-softmax, a sparse normalization function that preserves the multimodality of probability distributions. We derive its properties, including its gradient in closed-form, and introduce a continuous family of approximations to ev-softmax that have full support and can thus be trained with probabilistic loss functions such as negative log-likelihood and Kullback-Leibler divergence. We evaluate our method on a variety of generative models, including variational autoencoders and auto-regressive models. Our method outperforms existing dense and sparse normalization techniques in distributional accuracy and classification performance. We demonstrate that ev-softmax successfully reduces the dimensionality of output probability distributions while maintaining multimodality.

Setup

Required packages are listed in requirements.txt.

Running

The implementation for the ev-softmax function and its loss function can be found in evsoftmax.py.

The MNIST CVAE and VQ-VAE experiments can be run using run_mnist_cvae.sh and run_vqvae.sh, respectively. Instructions for the SSVAE experiment can be found in mnist_ssvae/README.md, and scripts used for preprocessing, training, and evaluating can be found in mnist_ssvae/scripts. Instructions for the translation experiment can be found in translation/README.md, and scripts used for preprocessing, training, and evaluating can be found in translation/scripts/iwslt.

Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Setup

Running

Owner

Stanford Intelligent Systems Laboratory

Learning Compatible Embeddings, ICCV 2021

Codes and scripts for "Explainable Semantic Space by Grounding Languageto Vision with Cross-Modal Contrastive Learning"

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

Diabet Feature Engineering - Predict whether people have diabetes when their characteristics are specified

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Statistical-Rethinking-with-Python-and-PyMC3 - Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath

CMSC320 - Introduction to Data Science - Fall 2021

Deep Compression for Dense Point Cloud Maps.

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Python based Advanced AI Assistant

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

ConformalLayers: A non-linear sequential neural network with associative layers

AITom is an open-source platform for AI driven cellular electron cryo-tomography analysis.

[ICCV 2021] Learning A Single Network for Scale-Arbitrary Super-Resolution

ServiceX Transformer that converts flat ROOT ntuples into columnwise data

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

Optimized code based on M2 for faster image captioning training

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily

Training DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)