The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Last update: Apr 20, 2022

Related tags

Overview

The Code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Setting up and using the repo

Get the dataset. Follow the steps in data/README.md. This includes the steps to get the pretrained BERT embeddings and visual representations.
Install cuda 11.0 if it's not available already.
Install anaconda if it's not available already, and create a new environment. You need to install a few things, namely, pytorch 1.7.1, torchvision, and allennlp.

wget https://repo.anaconda.com/archive/Anaconda3-5.2.0-Linux-x86_64.sh
conda update -n base -c defaults conda
conda create --name MCC python=3.6
source activate MCC

conda install numpy pyyaml setuptools cmake cffi tqdm pyyaml scipy ipython mkl mkl-include cython typing h5py pandas nltk spacy numpydoc scikit-learn jpeg

conda install pytorch==1.7.1 torchvision==0.8.2 cudatoolkit=11.0 -c pytorch

pip install -r allennlp-requirements.txt
pip install --no-deps allennlp==0.8.0
python -m spacy download en_core_web_sm


# this one is optional but it should help make things faster
pip uninstall pillow && CC="cc -mavx2" pip install -U --force-reinstall pillow-simd

That's it! Now to set up the environment, run source activate MCC.

Train/Evaluate models

Please refer to models/README.md.

Acknowledgement

We refer to the repo r2c and tab-vcr for preprocessing codes.

Cite

@inproceedings{zhang2021multi,
  title={Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning},
  author={Zhang, Xi and Zhang, Feifei and Xu, Changsheng},
  booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
  pages={1793--1802},
  year={2021}
}

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Related tags

Overview

The Code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Setting up and using the repo

Train/Evaluate models

Acknowledgement

Cite

Owner

My solution for the 7th place / 245 in the Umoja Hack 2022 challenge

Deep learning toolbox based on PyTorch for hyperspectral data classification.

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

DAT4 - General Assembly's Data Science course in Washington, DC

Paddle implementation for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

OpenAi's gym environment wrapper to vectorize them with Ray

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

PenguinSpeciesPredictionML - Basic model to predict Penguin species based on beak size and sex.

Aerial Single-View Depth Completion with Image-Guided Uncertainty Estimation (RA-L/ICRA 2020)

Deep motion generator collections

[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization

X-VLM: Multi-Grained Vision Language Pre-Training

Deep Federated Learning for Autonomous Driving

IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling

Time Series Forecasting with Temporal Fusion Transformer in Pytorch

Scheduling BilinearRewards

Sibur challange 2021 competition - 6 place

MAU: A Motion-Aware Unit for Video Prediction and Beyond, NeurIPS2021