UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

VITA

Last update: Dec 3, 2022

Related tags

Deep Learning UMEC

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Code for this paper UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Jiayi Shen, Haotao Wang*, Shupeng Gui*, Jianchao Tan, Zhangyang Wang, and Ji Liu

Overview

We propose a unified model and embedding compression (UMEC) framework to hammer an efficient neural network-based recommendation system. Our framework jointly learns input feature selection and neural network compression together, and solve them as an end-to-end resource-constrained optimization problem using ADMM.

Main Results

Implementation

We perform the compression process on DLRM, which is a public recommendation model. Our proposed algorithm is mainly implemented inrc_optimizer.py and rc_utils.py. Other files are inherited from the original DLRM code repo, with several lines of modifications, such as joint_train.py, input_selection.py, and finetune.py, in order to plug in our algorithm. To run the code in this repo, you have to first follow the instructions in the original repo to download the dataset, and run the corresponding training part, to finish the data preprocessing process.

Unified Framework

To implement to joint training and compressing under the resource constraint, please see the script in script/joint_train.sh.

Input feature selection

To implement to joint training and compressing under the resource constraint, please see the script in script/input_selection.sh.

Acknowledgement

We thank the author of DLRM for providing a recommendation model benchmark.

Citation

@inproceedings{
shen2021umec,
title={{\{}UMEC{\}}: Unified model and embedding compression for efficient recommendation systems},
author={Jiayi Shen and Haotao Wang and Shupeng Gui and Jianchao Tan and Zhangyang Wang and Ji Liu},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=BM---bH_RSh}
}

You might also like...

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

730 Jan 9, 2023

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

ProcrustEs-KGE Paddle implementation for Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis 🙈 A more detailed re

4 Jun 9, 2021

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery Lorien is an infrastructure to massively explore/benchmark the best sc

45 Dec 12, 2022

Code for the ICCV 2021 Workshop paper: A Unified Efficient Pyramid Transformer for Semantic Segmentation.

Unified-EPT Code for the ICCV 2021 Workshop paper: A Unified Efficient Pyramid Transformer for Semantic Segmentation. Installation Linux, CUDA=10.0,

29 Aug 23, 2022

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

SeerNet This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is

3 May 1, 2022

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022) We consider how a user of a web servi

20 Aug 21, 2022

Comments

Can I change the model flops calc to MLP model prediction ?

As different model has different s_ub channels and each of them may have some dependency, so it is hard for us to get a truly flops given by s. So I think it is easy to get flops by MLP model. But when I replaced it to mlp, loss1 value is 0 and least_norm_s are 0. I can only update s. Why?

opened by ailias 0

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Related tags

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Overview

Main Results

Implementation

Unified Framework

Input feature selection

Acknowledgement

Citation

You might also like...

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

Code for the ICCV 2021 Workshop paper: A Unified Efficient Pyramid Transformer for Semantic Segmentation.

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

DA2Lite is an automated model compression toolkit for PyTorch.

Pytorch implementation for Patient Knowledge Distillation for BERT Model Compression

Spatio-Temporal Entropy Model (STEM) for end-to-end leaned video compression.

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

Comments

Can I change the model flops calc to MLP model prediction ?

Owner

VITA

Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme (NeurIPS2021)

An efficient and easy-to-use deep learning model compression framework

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

MMRazor: a model compression toolkit for model slimming and AutoML

Product-based-recommendation-system - A product based recommendation system which uses Machine learning algorithm such as KNN and cosine similarity

Code for KDD'20 "An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph"

Recommendationsystem - Movie-recommendation - matrixfactorization colloborative filtering recommendation system user

PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].

Best Practices on Recommendation Systems

paper list in the area of reinforcenment learning for recommendation systems

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Related tags

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Overview

Main Results

Implementation

Unified Framework

Input feature selection

Acknowledgement

Citation

You might also like...

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

Code for the ICCV 2021 Workshop paper: A Unified Efficient Pyramid Transformer for Semantic Segmentation.

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

DA2Lite is an automated model compression toolkit for PyTorch.

Pytorch implementation for Patient Knowledge Distillation for BERT Model Compression

Spatio-Temporal Entropy Model (STEM) for end-to-end leaned video compression.

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

Comments

Can I change the model flops calc to MLP model prediction ?

Owner

VITA

Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme (NeurIPS2021)

An efficient and easy-to-use deep learning model compression framework

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

MMRazor: a model compression toolkit for model slimming and AutoML

Product-based-recommendation-system - A product based recommendation system which uses Machine learning algorithm such as KNN and cosine similarity

Code for KDD'20 "An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph"

Recommendationsystem - Movie-recommendation - matrixfactorization colloborative filtering recommendation system user

PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].

Best Practices on Recommendation Systems

paper list in the area of reinforcenment learning for recommendation systems

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI