This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Last update: Dec 26, 2022

Overview

MultiModal-InfoMax

🔥 If you would be interested in other multimodal works in our DeCLaRe Lab, welcome to visit the clustered repository

Introduction

Multimodal-informax (MMIM) synthesizes fusion results from multi-modality input through a two-level mutual information (MI) maximization. We use BA (Barber-Agakov) lower bound and contrastive predictive coding as the target function to be maximized. To facilitate the computation, we design an entropy estimation module with associated history data memory to facilitate the computation of BA lower bound and the training process.

Usage

Download the CMU-MOSI and CMU-MOSEI dataset from Google Drive or Baidu Disk (extraction code: g3m2). Place them under the folder Multimodal-Infomax/datasets
Set up the environment (need conda prerequisite)

conda env create -f environment.yml
conda activate MMIM

Start training

python main.py --dataset mosi --contrast

Citation

Please cite our paper if you find our work useful for your research:

@article{han2021improving,
  title={Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis},
  author={Han, Wei and Chen, Hui and Poria, Soujanya},
  journal={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  year={2021}
}

Contact

Should you have any question, feel free to contact me through [email protected]

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Related tags

Overview

MultiModal-InfoMax

Introduction

Usage

Citation

Contact

Owner

Deep Cognition and Language Research (DeCLaRe) Lab

Explaining neural decisions contrastively to alternative decisions.

Multi-Task Deep Neural Networks for Natural Language Understanding

LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation

Attention over nodes in Graph Neural Networks using PyTorch (NeurIPS 2019)

Model Zoo for MindSpore

Python interface for the DIGIT tactile sensor

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

[ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links

Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs

Normalizing Flows with a resampled base distribution

Perspective: Julia for Biologists

NeuralDiff: Segmenting 3D objects that move in egocentric videos

PPLNN is a Primitive Library for Neural Network is a high-performance deep-learning inference engine for efficient AI inferencing

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration

FastFace: Lightweight Face Detection Framework

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

HuSpaCy: industrial-strength Hungarian natural language processing

Koç University deep learning framework.

High frequency AI based algorithmic trading module.