Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Last update: Jan 07, 2023

Related tags

Overview

Trainable multi-codebook quantization

This repository implements a utility for use with PyTorch, and ideally GPUs, for training an efficient quantizer based on multiple single-byte codebooks. The prototypical scenario is that you have some distribution over vectors in some space, say, of dimension 512, that might come from a neural net embedding, and you want a means of encoding a vector into a short sequence of bytes (say, 4 or 8 bytes) that can be used to reconstruct the vector with minimal expected loss, measured as squared distance, i.e. squared l2 loss.

This repository provides Quantizer object that lets you do this quantization, and an associated QuantizerTrainer object that you can use to train the Quantizer. For example, you might invoke the QuantizerTrainer with 20,000 minibatches of vectors.

Usage

Installation

python3 setup.py install

Example

import torch
import quantization

trainer = quantization.QuantizerTrainer(dim=256, bytes_per_frame=4,
                                        device=torch.device('cuda'))
while not trainer.done():
   # let x be some tensor of shape (*, dim), that you will train on
   # (should not be the same on each minibatch)
   trainer.step(x)
quantizer = trainer.get_quantizer()

# let x be some tensor of shape (*, dim)..
encoded = quantizer.encode(x)  # (*, 4), dtype=uint8
x_approx = quantizer.decode(quantizer.encode(x))

To avoid versioning issues and so on, it may be easier to just include quantization.py in your repository directly (and add its requirements to your requirements.txt).

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Related tags

Overview

Trainable multi-codebook quantization

Usage

Installation

Example

Owner

Daniel Povey

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

Springer Link Download Module for Python

Applying PVT to Semantic Segmentation

CVPR 2021 Challenge on Super-Resolution Space

Zero-Cost Proxies for Lightweight NAS

PassAPI is a password generator in hash format and fully developed in Python, with the aim of teaching how to handle and build

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

A Python Package for Portfolio Optimization using the Critical Line Algorithm

Pyramid addon for OpenAPI3 validation of requests and responses.

DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral

Official repo for our 3DV 2021 paper "Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements".

Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

ManipNet: Neural Manipulation Synthesis with a Hand-Object Spatial Representation - SIGGRAPH 2021

Enabling dynamic analysis of Legacy Embedded Systems in full emulated environment

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

Fast RFC3339 compliant Python date-time library

A few stylization coreML models that I've trained with CreateML

Underwater industrial application yolov5m6

Fake-user-agent-traffic-geneator - Python CLI Tool to generate fake traffic against URLs with configurable user-agents