Predicting Event Memorability from Contextual Visual Semantics

Last update: Oct 06, 2021

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

This repository contains pytorch implementation of five configurations in our paper "Predicting Event Memorability from Contextual Visual Semantics".

Raw images are to be put in '../datasets/r3/images/'
Train and validation (val) splits for different configurations are under '../datasets/r3/splits/'; the set of train_1.txt, val_1.txt, etc. contains image names and memorability scores for the respective split.
Configurations of ablation study are with individual folders, e.g., './no_face', './no_activity', etc. './full_set' is for full configuration without removing features.
Complete extrinsic features and the memory test outcome is available in 'R3_data.csv' file. Description of the features is given in 'R3_data_notes.txt'. Both can be downloaded together with the original image cues @ https://drive.google.com/drive/folders/1Bx_ePv7ui6DCIXkESCpoyuvd0H3B9o6d?usp=sharing
The AMNet implementation is adpated from https://github.com/ok1zjf/AMNet

########################################################################################

To train AMNet and CEMNet_wt_AMNet:

python3 main.py --train-batch-size 128 --test-batch-size 128 --cnn ResNet50FC --dataset lamem --train-split train_1 --val-split val_1

To predict:

python3 main.py --cnn ResNet50FC --model-weights /path/to/model/weights_xx.pkl --eval-images /path/to/evl_images --csv-out memorabilities.txt

To train other models (ICNet, MLP, CEMNet_wt_ICNet):

[Go the the respective folder, e.g., '../ICNet']

python main.py

To predict (please select corresponding splits and model in predict.py):

python predict.py

[Where necessary, change Dataset.py to the corresponding directory of split]

########################################################################################

System configuration:

platform: UBUNTU 16.04

GPU: GeForce GTX 1080

CUDA:9.0

########################################################################################

Python packages:

python 3.5.6

pytorch 0.2.0

Torchvison 0.1.9

Numpy 1.15.2

Opencv 3.1.0

PIL 6.1.0

########################################################################################

To cite the paper: Xu Q., Fang F., del Molino A.G, Subbaraju V., Lim J.H., Predicting Event Memorability from Contextual Visual Semantics, NeurIPS 2021.

If you have any questions, please feel free to contact Dr Xu Qianli: [email protected]

Predicting Event Memorability from Contextual Visual Semantics

Related tags

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

Owner

A novel pipeline framework for multi-hop complex KGQA task. About the paper title: Improving Multi-hop Embedded Knowledge Graph Question Answering by Introducing Relational Chain Reasoning

Disentangled Lifespan Face Synthesis

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Code and training data for our ECCV 2016 paper on Unsupervised Learning

TensorFlow-based implementation of "ICNet for Real-Time Semantic Segmentation on High-Resolution Images".

A scikit-learn-compatible module for estimating prediction intervals.

This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing at EMNLP 2021.

A tensorflow implementation of an HMM layer

Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"

ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation

Classification models 1D Zoo - Keras and TF.Keras

In this project, we create and implement a deep learning library from scratch.

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

COCO Style Dataset Generator GUI

Codes and scripts for "Explainable Semantic Space by Grounding Languageto Vision with Cross-Modal Contrastive Learning"

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

The Noise Contrastive Estimation for softmax output written in Pytorch

[NeurIPS 2021] "Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems"