PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Last update: Dec 08, 2022

Related tags

Overview

MIRCO

PyTorch implementation for paper: Latent Structures Mining with Contrastive Modality Fusion for Multimedia Recommendation

Dependencies

Python 3.6
torch==1.5.0
scikit-learn==0.24.2
torch-scatter==2.0.8

Dataset Preparation

Download 5-core reviews data, meta data, and image features from Amazon product dataset. Put data into the directory data/meta-data/.

Install sentence-transformers and download pretrained models to extract textual features. Unzip pretrained model into the directory sentence-transformers/:

├─ data/: 
    ├── sports/
    	├── meta-data/
    		├── image_features_Sports_and_Outdoors.b
    		├── meta-Sports_and_Outdoors.json.gz
    		├── reviews_Sports_and_Outdoors_5.json.gz
    ├── sentence-transformers/
        	├── stsb-roberta-large

Run python build_data.py to preprocess data.
Run python cold_start.py to build cold-start data.
We provide processed data Baidu Yun (access code: m37q), Google Drive.

Usage

Start training and inference as:

cd codes
python main.py --dataset {DATASET}

For cold-start settings:

python main.py --dataset {DATASET} --core 0 --verbose 1 --lr 1e-5

Citation

If you want to use our codes in your research, please cite:

@article{MICRO21,
  title     = {Latent Structures Mining with Contrastive Modality Fusion for Multimedia Recommendation},
  author    = {Zhang, Jinghao and 
               Zhu, Yanqiao and 
               Liu, Qiang and
               Zhang, Mengqi and
               Wu, Shu and 
               Wang, Liang},
  journal = {arXiv.org},
  year={2021},
  eprint={2111.00678},
  archivePrefix={arXiv},
  primaryClass={cs.IR}
}

Acknowledgement

The structure of this code is largely based on LightGCN. Thank for their work.

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Related tags

Overview

MIRCO

Dependencies

Dataset Preparation

Usage

Citation

Acknowledgement

Owner

Big Data and Multi-modal Computing Group, CRIPAC

PocketNet: Extreme Lightweight Face Recognition Network using Neural Architecture Search and Multi-Step Knowledge Distillation

A wrapper around SageMaker ML Lineage Tracking extending ML Lineage to end-to-end ML lifecycles, including additional capabilities around Feature Store groups, queries, and other relevant artifacts.

Analyzing basic network responses to novel classes

Multiple custom object count and detection using YOLOv3-Tiny method

Black box hyperparameter optimization made easy.

This project is based on RIFE and aims to make RIFE more practical for users by adding various features and design new models

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

a project for 3D multi-object tracking

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

Code for the paper: Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization (https://arxiv.org/abs/2002.11798)

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

Causal estimators for use with WhyNot

Categorizing comments on YouTube into different categories.

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)

[CVPR 2016] Unsupervised Feature Learning by Image Inpainting using GANs

A Keras implementation of YOLOv3 (Tensorflow backend)

DeepMReye: magnetic resonance-based eye tracking using deep neural networks

Deploy optimized transformer based models on Nvidia Triton server

Single-Stage 6D Object Pose Estimation, CVPR 2020

Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"