Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Last update: Dec 07, 2022

Related tags

Overview

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Introduction

This is the official repository for the PyTorch implementation of "Canonical Capsules: Unsupervised Capsules in Canonical Pose" by Weiwei Sun*, Andrea Tagliasacchi*, Boyang Deng, Sara Sabour, Soroosh Yazdani, Geoffrey Hinton, Kwang Moo Yi.

Download links

Project Website
PDF (arXiv)
PDF (github copy)

Citation

⚠️ If you use this source core or data in your research (in any shape or format), we require you to cite our paper as:

@conference{sun2020canonical,
   title={Canonical Capsules: Unsupervised Capsules in Canonical Pose},
   author={Weiwei Sun and Andrea Tagliasacchi and Boyang Deng and 
           Sara Sabour and Soroosh Yazdani and Geoffrey Hinton and
           Kwang Moo Yi},
   booktitle={Neural Information Processing Systems},
   year={2021}
}

Requirements

Please install dependencies with the provided environment.yml:

conda env create -f environment.yml

Datasets

We use the ShapeNet dataset as in AtlasNetV2: download the data from AtlasNetV2's official repo and convert the downloaded data into h5 files with the provided script (i.e., data_utils/ShapeNetLoader.py).
For faster experimentation, please use our 2D planes dataset, which we generated from ShapeNet (please cite both our paper, as well as ShapeNet if you use this dataset).

Training/testing (2D)

To train the model on 2D planes (training of network takes only 50 epochs, and one epoch takes approximately 2.5 minutes on an NVIDIA GTX 1080 Ti):

./main.py --log_dir=plane_dim2 --indim=2 --scheduler=5

To visualize the decompostion and reconstruction:

./main.py --save_dir=gifs_plane2d --indim=2 --scheduler=5 --mode=vis --pt_file=logs/plane_dim2/checkpoint.pth

Training/testing (3D)

To train the model on the 3D dataset:

./main.py --log_dir=plane_dim3 --indim=3 --cat_id=-1

We test the model with:

./main.py --log_dir=plane_dim3 --indim=3 --cat_id=-1 --mode=test

Note that the option cat_id indicates the category id to be used to load the corresponding h5 files (this look-up table):

id	category
-1	all
0	bench
1	cabinet
2	car
3	cellphone
4	chair
5	couch
6	firearm
7	lamp
8	monitor
9	plane
10	speaker
11	table
12	watercraft

Pre-trained models (3D)

We release the 3D pretrained models for both single categy (airplanes), as well as multi-category (all 13 classes).

Classification

To use our classification script:

python classification.py --data_dir=/path/to/saved/features --feature_type=caca --method_type=svm --use_kpts

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Related tags

Overview

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Introduction

Download links

Citation

Requirements

Datasets

Training/testing (2D)

Training/testing (3D)

Pre-trained models (3D)

Classification

Owner

A Comparative Review of Recent Kinect-Based Action Recognition Algorithms (TIP2020, Matlab codes)

git《Joint Entity and Relation Extraction with Set Prediction Networks》(2020) GitHub:

Reinforcement Learning for finance

Open source annotation tool for machine learning practitioners.

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

Anomaly detection related books, papers, videos, and toolboxes

Type4Py: Deep Similarity Learning-Based Type Inference for Python

LineBoard - Python+React+MySQL-白板即時系統改善人群行為

Code repository for Semantic Terrain Classification for Off-Road Autonomous Driving

Intrusion Detection System using ensemble learning (machine learning)

RoFormer_pytorch

Code & Models for 3DETR - an End-to-end transformer model for 3D object detection

Classifying cat and dog images using Kaggle dataset

PyTorch implementation of the paper: Long-tail Learning via Logit Adjustment

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

💡 Learnergy is a Python library for energy-based machine learning models.

Codebase for testing whether hidden states of neural networks encode discrete structures.