[CVPR 2022 Oral] Rethinking Minimal Sufficient Representation in Contrastive Learning

Last update: Nov 23, 2022

Related tags

Overview

Rethinking Minimal Sufficient Representation in Contrastive Learning

PyTorch implementation of
Rethinking Minimal Sufficient Representation in Contrastive Learning
Haoqing Wang, Xun Guo, Zhi-hong Deng, Yan Lu

CVPR 2022 Oral

Abstract

Contrastive learning between different views of the data achieves outstanding success in the field of self-supervised representation learning and the learned representations are useful in broad downstream tasks. Since all supervision information for one view comes from the other view, contrastive learning approximately obtains the minimal sufficient representation which contains the shared information and eliminates the non-shared information between views. Considering the diversity of the downstream tasks, it cannot be guaranteed that all task-relevant information is shared between views. Therefore, we assume the non-shared task-relevant information cannot be ignored and theoretically prove that the minimal sufficient representation in contrastive learning is not sufficient for the downstream tasks, which causes performance degradation. This reveals a new problem that the contrastive learning models have the risk of over-fitting to the shared information between views. To alleviate this problem, we propose to increase the mutual information between the representation and input as regularization to approximately introduce more task-relevant information, since we cannot utilize any downstream task information during training. Extensive experiments verify the rationality of our analysis and the effectiveness of our method. It significantly improves the performance of several classic contrastive learning models in downstream tasks.

Citation

If you use this code for your research, please cite our paper:

@inproceedings{wang2022rethinking,
  title={Rethinking Minimal Sufficient Representation in Contrastive Learning},
  author={Wang, Haoqing and Deng, Zhi-hong and Guo, Xun and Lu, Yan},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={xx--xx},
  year={2022}
}

Note

This code is built upon the implementation from moco and CLAE.
The dataset, model, and code are for non-commercial research purposes only.

[CVPR 2022 Oral] Rethinking Minimal Sufficient Representation in Contrastive Learning

Related tags

Overview

Rethinking Minimal Sufficient Representation in Contrastive Learning

Abstract

Citation

Note

Owner

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay

Exporter for Storage Area Network (SAN)

TGS Salt Identification Challenge

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

LowRankModels.jl is a julia package for modeling and fitting generalized low rank models.

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

Groceries ARL: Association Rules (Birliktelik Kuralı)

A Keras implementation of CapsNet in the paper: Sara Sabour, Nicholas Frosst, Geoffrey E Hinton. Dynamic Routing Between Capsules

TensorFlow implementation of PHM (Parameterization of Hypercomplex Multiplication)

这是一个mobilenet-yolov4-lite的库，把yolov4主干网络修改成了mobilenet，修改了Panet的卷积组成，使参数量大幅度缩小。

Bare bones use-case for deploying a containerized web app (built in streamlit) on AWS.

a reimplementation of Holistically-Nested Edge Detection in PyTorch

A custom-designed Spider Robot trained to walk using Deep RL in a PyBullet Simulation

Vehicle detection using machine learning and computer vision techniques for Udacity's Self-Driving Car Engineer Nanodegree.

Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)

Code for paper Adaptively Aligned Image Captioning via Adaptive Attention Time

PyTorch implementation for our paper Learning Character-Agnostic Motion for Motion Retargeting in 2D, SIGGRAPH 2019

Official PyTorch implementation of PICCOLO: Point-Cloud Centric Omnidirectional Localization (ICCV 2021)

Show-attend-and-tell - TensorFlow Implementation of "Show, Attend and Tell"

这是一个deeplabv3-plus-pytorch的源码，可以用于训练自己的模型。