UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Last update: Dec 03, 2022

Related tags

Deep Learning UMEC

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Code for this paper UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Jiayi Shen, Haotao Wang*, Shupeng Gui*, Jianchao Tan, Zhangyang Wang, and Ji Liu

Overview

We propose a unified model and embedding compression (UMEC) framework to hammer an efficient neural network-based recommendation system. Our framework jointly learns input feature selection and neural network compression together, and solve them as an end-to-end resource-constrained optimization problem using ADMM.

Main Results

Implementation

We perform the compression process on DLRM, which is a public recommendation model. Our proposed algorithm is mainly implemented inrc_optimizer.py and rc_utils.py. Other files are inherited from the original DLRM code repo, with several lines of modifications, such as joint_train.py, input_selection.py, and finetune.py, in order to plug in our algorithm. To run the code in this repo, you have to first follow the instructions in the original repo to download the dataset, and run the corresponding training part, to finish the data preprocessing process.

Unified Framework

To implement to joint training and compressing under the resource constraint, please see the script in script/joint_train.sh.

Input feature selection

To implement to joint training and compressing under the resource constraint, please see the script in script/input_selection.sh.

Acknowledgement

We thank the author of DLRM for providing a recommendation model benchmark.

Citation

@inproceedings{
shen2021umec,
title={{\{}UMEC{\}}: Unified model and embedding compression for efficient recommendation systems},
author={Jiayi Shen and Haotao Wang and Shupeng Gui and Jianchao Tan and Zhangyang Wang and Ji Liu},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=BM---bH_RSh}
}

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Related tags

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Overview

Main Results

Implementation

Unified Framework

Input feature selection

Acknowledgement

Citation

Owner

VITA

DC3: A Learning Method for Optimization with Hard Constraints

Chinese license plate recognition

RGB-stacking 🛑 🟩 🔷 for robotic manipulation

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

PyTorch implementation of UNet++ (Nested U-Net).

State of the Art Neural Networks for Deep Learning

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"

Code for "Diversity can be Transferred: Output Diversification for White- and Black-box Attacks"

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

Convert Pytorch model to onnx or tflite, and the converted model can be visualized by Netron

Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

Towards Interpretable Deep Metric Learning with Structural Matching

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones Code

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

Dual Attention Network for Scene Segmentation (CVPR2019)

ICLR21 Tent: Fully Test-Time Adaptation by Entropy Minimization