This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

Last update: Dec 26, 2022

Related tags

Deep Learning slate

Overview

SLATE

This is the official source code for SLATE. We provide the code for the model, the training code and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

Arxiv: https://arxiv.org/pdf/2110.11405.pdf
Project Page: https://sites.google.com/view/slate-autoencoder

Dataset

The current release provides a boilerplate code to train the model on the 3D Shapes dataset. The dataset class is provided in shapes_3d.py. You can edit or replace this class if you need to run the code on a different dataset. The 3D Shapes dataset can be downloaded from the official URL https://console.cloud.google.com/storage/browser/3d-shapes. This should produce a dataset file 3dshapes.h5. During training, the path to this dataset file needs to be provided using the argument --data_path.

Training

To train the model, simply execute:

python train.py

Check train.py to see the full list of training arguments.

Outputs

The training code produces Tensorboard logs. To see these logs, run Tensorboard on the logging directory that was provided in the training argument --log_path. These logs contain the training loss curves and visualizations of reconstructions and object attention maps.

Hyperparameters of Interest

Learning Rate can be tuned using the training argument --lr_main and different choices can affect the characteristics of the object attention maps.
Number of Slots can be tuned using the training argument --num_slots. Number of slots should be set higher than the number of objects you expect to see in the images.
Number of Slot Attention Iterations can be tuned using the training argument --num_iterations. In general, keep the number of iterations as small as possible because too many iterations can prevent slots from learning to diversify and attach to different objects.

Code Files

This repository provides the following files.

train.py contains the main code for running the training.
slate.py provides the model class for SLATE.
shapes_3d.py contains the dataset class for 3D Shapes dataset.
dvae.py provides the encoder and the decoder for Discrete VAE.
slot_attn.py provides the model class for Slot Attention encoder.
transformer.py provides the model classes for Transformer.
utils.py provides helper classes and functions for the implementation.

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

Related tags

Overview

SLATE

Dataset

Training

Outputs

Hyperparameters of Interest

Code Files

Owner

Gautam Singh

implicit displacement field

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets"

This repository provides a basic implementation of our GCPR 2021 paper "Learning Conditional Invariance through Cycle Consistency"

Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021)

Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2021)

Mahadi-Now - This Is Pakistani Just Now Login Tools

A New Approach to Overgenerating and Scoring Abstractive Summaries

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

This is the research repository for Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition.

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

PyTorch implementation for ComboGAN

OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis

Implementation of Bottleneck Transformer in Pytorch

Offical implementation for "Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation".

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

💊 A 3D Generative Model for Structure-Based Drug Design (NeurIPS 2021)

(CVPR 2022 Oral) Official implementation for "Surface Representation for Point Clouds"

YOLOX + ROS(1, 2) object detection package

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling