ShapeGlot: Learning Language for Shape Differentiation

Last update: Dec 23, 2022

Overview

ShapeGlot: Learning Language for Shape Differentiation

Created by Panos Achlioptas, Judy Fan, Robert X.D. Hawkins, Noah D. Goodman, Leonidas J. Guibas.

Introduction

This work is based on our ICCV-2019 paper. There, we proposed speaker & listener neural models that reason and differentiate objects according to their shape via language (hence the term shape--glot). These models can operate on 2D images and/or 3D point-clouds and do learn about natural properties of shapes, including the part-based compositionality of 3D objects, from language alone. The latter fact, makes them remarkably robust, enabling a plethora of zero-shot-transfer learning applications. You can check our project's webpage for a quick introduction and produced results.

Dependencies

Main Requirements:

Python 3x (with numpy, pandas, matplotlib, nltk)
Pytorch (version 1.0+)

Our code has been tested with Python 3.6.9, Pytorch 1.3.1, CUDA 10.0 on Ubuntu 14.04.

Installation

Clone the source code of this repository and pip install it inside your (virtual) environment.

git clone https://github.com/optas/shapeglot
cd shapeglot
pip install -e .

Data Set

We provide 78,782 utterances referring to a ShapeNet chair that was contrasted against two distractor chairs via the reference game described in our accompanying paper (dataset termed as ChairsInContext). We further provide the data used in the Zero-Shot experiments which include 300 images of real-world chairs, and 1200 referential utterances for ShapeNet lamps & tables & sofas, and 400 utterances describing ModelNet beds. Last, we include image-based (VGG-16) and point-cloud-based (PC-AE) pretrained features for all ShapeNet chairs to facilitate the training of the neural speakers and listeners.

To download the data (~232 MB) please run the following commands. Notice, that you first need to accept the Terms Of Use here. Upon review we will email to you the necessary link that you need to put inside the desingated location of the download_data.sh file.

cd shapeglot/
./download_data.sh

The downloaded data will be stored in shapeglot/data

Usage

To easily expose the main functionalities of our paper, we prepared some simple, instructional notebooks.

To tokenize, prepare and visualize the chairsInContext dataset, please look/run:

    shapeglot/notebooks/prepare_chairs_in_context_data.ipynb

To train a neural listener (only ~10 minutes on a single modern GPU):

    shapeglot/notebooks/train_listener.ipynb

Note: This repo contains limited functionality compared to what was presented in the paper. This is because our original (much heavier) implementation is in low-level TensorFlow and python 2.7. If you need more functionality (e.g. pragmatic-speakers) and you are OK with Tensorflow, please email [email protected] .

Citation

If you find our work useful in your research, please consider citing:

@article{shapeglot,
  title={ShapeGlot: Learning Language for Shape Differentiation},
  author={Achlioptas, Panos and Fan, Judy and Hawkins, Robert X. D. and Goodman, Noah D. and Guibas, Leonidas J.},
  journal={CoRR},
  volume={abs/1905.02925},
  year={2019}
}

License

This provided code is licensed under the terms of the MIT license (see LICENSE for details).

ShapeGlot: Learning Language for Shape Differentiation

Related tags

Overview

ShapeGlot: Learning Language for Shape Differentiation

Introduction

Dependencies

Installation

Data Set

Usage

Citation

License

Owner

Panos

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

NLG evaluation via Statistical Measures of Similarity: BaryScore, DepthScore, InfoLM

Readings for "A Unified View of Relational Deep Learning for Polypharmacy Side Effect, Combination Therapy, and Drug-Drug Interaction Prediction."

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

PyExplainer: A Local Rule-Based Model-Agnostic Technique (Explainable AI)

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Low-dose Digital Mammography with Deep Learning

Neural network-based build time estimation for additive manufacturing

Graph InfoClust: Leveraging cluster-level node information for unsupervised graph representation learning

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

An open-source Deep Learning Engine for Healthcare that aims to treat & prevent major diseases

Understanding Convolution for Semantic Segmentation

pytorchのスライス代入操作をonnxに変換する際にScatterNDならないようにするサンプル

Code for the prototype tool in our paper "CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning".

Training RNNs as Fast as CNNs

SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)

PyTorch implementation of ARM-Net: Adaptive Relation Modeling Network for Structured Data.

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"

PECOS - Prediction for Enormous and Correlated Spaces

Torch implementation of "Enhanced Deep Residual Networks for Single Image Super-Resolution"