Learned image compression

Last update: Dec 04, 2022

Overview

Pytorch code of our recent work A Unified End-to-End Framework for Efficient Deep Image Compression.

We first release the code for Variational image compression with a scale hyperprior, we will update our code to our full implementaion of our paper.

Prerequisites

You should install the libraries of this repo.

pip install -r requirements.txt

Data Preparation

We need to first prepare the training and validation data. The trainging data is from flicker.com. You can obtain the training data according to description of CompressionData.

The validation data is the popular kodak dataset.

bash data/download_kodak.sh

Training

For high bitrate (4096, 6144, 8192), the out_channel_N is 192 and the out_channel_M is 320 in 'config_high.json'. For low bitrate (256, 512, 1024, 2048), the out_channel_N is 128 and the out_channel_M is 192 in 'config_low.json'.

Details

PSNR experiments.

For high bitrate of 8192, we first train from scratch as follows.

CUDA_VISIBLE_DEVICES=0 python train.py --config examples/example/config_high.json -n baseline_8192 --train flicker_path --val kodak_path

For other high bitrate (4096, 6144), we use the converged model of 8192 as pretrain model and set the learning rate as 1e-5. The training iterations are set as 500000.

The low bitrate (256, 512, 1024, 2048) training process follows the same strategy.

MS-SSIM experiments

You should change the distorsion loss to (1-MS_SSIM), and fine-tune the pretrained model optimized by PSNR to accelerate the training process. You can find more details in our released paper. The training strategy is similar.

If your find our code is helpful for your research, please cite our paper. Besides, this code is only for research.

@article{liu2020unified,
  title={A Unified End-to-End Framework for Efficient Deep Image Compression},
  author={Liu, Jiaheng and Lu, Guo and Hu, Zhihao and Xu, Dong},
  journal={arXiv preprint arXiv:2002.03370},
  year={2020}
}

Learned image compression

Related tags

Overview

Overview

Content

Prerequisites

Data Preparation

Training

Details

PSNR experiments.

MS-SSIM experiments

Owner

Jiaheng Liu

Fast, general, and tested differentiable structured prediction in PyTorch

Awesome Weak-Shot Learning

Vision Transformer and MLP-Mixer Architectures

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Keeper for Ricochet Protocol, implemented with Apache Airflow

D-NeRF: Neural Radiance Fields for Dynamic Scenes

Efficient Sparse Attacks on Videos using Reinforcement Learning

Python package to add text to images, textures and different backgrounds

Learning Dense Representations of Phrases at Scale (Lee et al., 2020)

The implementation of our CIKM 2021 paper titled as: "Cross-Market Product Recommendation"

An official implementation of the Anchor DETR.

A very short and easy implementation of Quantile Regression DQN

Mall-Customers-Segmentation - Customer Segmentation Using K-Means Clustering

Llvlir - Low Level Variable Length Intermediate Representation

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Interpolation-based reduced-order models

Generalized hybrid model for mode-locked laser diodes with an extended passive cavity

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

nn_builder lets you build neural networks with less boilerplate code

Alfred-Restore-Iterm-Arrangement - An Alfred workflow to restore iTerm2 window Arrangements