Pytorch implementation of VAEs for heterogeneous likelihoods.

Last update: Nov 29, 2022

Related tags

Overview

Heterogeneous VAEs

Beware: This repository is under construction 🛠️

Pytorch implementation of different VAE models to model heterogeneous data. Here, we call heterogeneous data those for which we assume that each feature is of a different type, and therefore each feature is assumed to have a different likelihood. Heterogeneous data is also known as mixed-type data and tabular data.

Usage

This repository is not meant to be a library which you can install and use as it is, but rather as a ML project code which you can freely fork and modify to fit your particular needs.

Dependencies

We are working on providing a conda requirements file. For the moment, there is a Dockerfile which you can build and use, or simply look at the project dependencies from there.

Example

You can find information about all the available arguments via python main.py --help. For example, you can train the Wine dataset on a heterogeneous VAE with default arguments using:

python main.py -model=vae -dataset=datasets/Wine -seed=2 -miss-perc=20 -miss-suffix=1

Models

This repository contains implementations of the following models, adapted for heterogeneous likelihoods (if you use them in your work, make sure to cite the original authors):

Autoencoding Variational Bayes (VAE): https://arxiv.org/abs/1312.6114
Importance Weighted Autoencoder (IWAE): http://arxiv.org/abs/1509.00519
Doubly Reparametrized Gradient Estimators for Monte Carlo objectives (DReG): http://arxiv.org/abs/1810.04152
Handling Incomplete Heterogeneous Data using VAEs (HI-VAE): http://arxiv.org/abs/1807.03653

Likelihoods

The code supports the following likelihoods at the moment:

Gaussian, for real-valued features.
Log-normal, for positive-valued real features.
Bernoulli, for binary features.
Categorical, for categorical features.
Poisson, for positive-valued integer (count) features.

Datasets

We provide with this code some example datasets taken from UCI and R package datasets. You can use any dataset as long as the format is the same.

Contributing

The code can be further simplified and polished, and we still have some legacy code. Pull requests and issues are more than welcome, as long as it contributes to making the code clean, simple, general, and elegant.

Pytorch implementation of VAEs for heterogeneous likelihoods.

Related tags

Overview

Heterogeneous VAEs

Usage

Dependencies

Example

Models

Likelihoods

Datasets

Contributing

Owner

Adrián Javaloy

ANN model for prediction a spatio-temporal distribution of supercooled liquid in mixed-phase clouds using Doppler cloud radar spectra.

Text completion with Hugging Face and TensorFlow.js running on Node.js

Transformer part of 12th place solution in Riiid! Answer Correctness Prediction

PyGRANSO: A PyTorch-enabled port of GRANSO with auto-differentiation

OBBDetection: an oriented object detection toolbox modified from MMdetection

Code for CVPR2021 paper 'Where and What? Examining Interpretable Disentangled Representations'.

A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

[NeurIPS 2021] Galerkin Transformer: a linear attention without softmax

Split Variational AutoEncoder

Neural Koopman Lyapunov Control

An OpenAI-Gym Package for Training and Testing Reinforcement Learning algorithms with OpenSim Models

FedGS: A Federated Group Synchronization Framework Implemented by LEAF-MX.

It's final year project of Diploma Engineering. This project is based on Computer Vision.

A system for quickly generating training data with weak supervision

a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LSTM layers

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Generating Band-Limited Adversarial Surfaces Using Neural Networks

A GridMixup augmentation, inspired by GridMask and CutMix

using STGCN to achieve egg classification task