Embedding Transfer with Label Relaxation for Improved Metric Learning

Official PyTorch implementation of CVPR 2021 paper Embedding Transfer with Label Relaxation for Improved Metric Learning.

Embedding trnasfer with Relaxed Contrastive Loss improves performance, or reduces sizes and output dimensions of embedding model effectively.

This repository provides source code of experiments on three datasets (CUB-200-2011, Cars-196 and Stanford Online Products) including relaxed contrastive loss, relaxed MS loss, and 6 other knowledge distillation or embedding transfer methods such as:

FitNet, Fitnets: hints for thin deep nets
Attention, Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
CRD, Contrastive Representation Distillation
DarkRank, Darkrank: Accelerating Deep Metric Learning via Cross Sample Similarities Transfer
PKT, Learning Deep Representations with Probabilistic Knowledge Transfer
RKD, Relational Knowledge Distillation

Overview

Relaxed Contrastive Loss

Relaxed contrastive loss exploits pairwise similarities between samples in the source embedding space as relaxed labels, and transfers them through a contrastive loss used for learning target embedding models.

Experimental Restuls

Our method achieves the state of the art when embedding dimension is 512, and is as competitive as recent metric learning models even with a substantially smaller embedding dimension. In all experiments, it is superior to other embedding transfer techniques.

Requirements

Python3
PyTorch (> 1.0)
NumPy
tqdm
wandb
Pytorch-Metric-Learning

Prepare Datasets

Download three public benchmarks for deep metric learning.
- CUB-200-2011
- Cars-196 (Img, Annotation)
- Stanford Online Products (Link)
Extract the tgz or zip file into ./data/ (Exceptionally, for Cars-196, put the files in a ./data/cars196)

Prepare Pretrained Source models

Download the pretrained source models using ./scripts/download_pretrained_source_models.sh.

sh scripts/download_pretrained_source_models.sh

Training Target Embedding Network with Relaxed Contrastive Loss

Self-transfer Setting

Transfer the knowledge of source model to target model with the same architecture and embedding dimension for performance improvement.
Source Embedding Network (BN–Inception, 512 dim) 🠢 Target Embedding Network (BN–Inception, 512 dim)

CUB-200-2011

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model bn_inception \
--embedding-size 512 --batch-size 90 --IPC 2 --dataset cub --epochs 90 \
--source-ckpt ./pretrained_source/bn_inception/cub_bn_inception_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

Cars-196

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model bn_inception \ 
--embedding-size 512 --batch-size 90 --IPC 2 --dataset cars --epochs 90 \
--source-ckpt ./pretrained_source/bn_inception/cars_bn_inception_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

SOP

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model bn_inception \
--embedding-size 512 --batch-size 90 --IPC 2 --dataset SOP --epochs 150 \
--source-ckpt ./pretrained_source/bn_inception/SOP_bn_inception_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

		CUB-200-2011			Cars-196			SOP
Method	Backbone	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]
Source: PA	BN⁵¹²	69.1	78.9	86.1	86.4	91.9	95.0	79.2	90.7	96.2
FitNet	BN⁵¹²	69.9	79.5	86.2	87.6	92.2	95.6	78.7	90.4	96.1
Attention	BN⁵¹²	66.3	76.2	84.5	84.7	90.6	94.2	78.2	90.4	96.2
CRD	BN⁵¹²	67.7	78.1	85.7	85.3	91.1	94.8	78.1	90.2	95.8
DarkRank	BN⁵¹²	66.7	76.5	84.8	84.0	90.0	93.8	75.7	88.3	95.3
PKT	BN⁵¹²	69.1	78.8	86.4	86.4	91.6	94.9	78.4	90.2	96.0
RKD	BN⁵¹²	70.9	80.8	87.5	88.9	93.5	96.4	78.5	90.2	96.0
Ours	BN⁵¹²	72.1	81.3	87.6	89.6	94.0	96.5	79.8	91.1	96.3

Dimensionality Reduction Setting

Transfer to the same architecture with a lower embedding dimension for efficient image retrieval.
Source Embedding Network (BN–Inception, 512 dim) 🠢 Target Embedding Network (BN–Inception, 64 dim)

CUB-200-2011

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model bn_inception \
--embedding-size 64 --batch-size 90 --IPC 2 --dataset cub --epochs 90 \
--source-ckpt ./pretrained_source/bn_inception/cub_bn_inception_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

Cars-196

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model bn_inception \
--embedding-size 64 --batch-size 90 --IPC 2 --dataset cars --epochs 90 \
--source-ckpt ./pretrained_source/bn_inception/cars_bn_inception_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

SOP

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model bn_inception \
--embedding-size 64 --batch-size 90 --IPC 2 --dataset SOP --epochs 150 \
--source-ckpt ./pretrained_source/bn_inception/SOP_bn_inception_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

		CUB-200-2011			Cars-196			SOP
Method	Backbone	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]
Source: PA	BN⁵¹²	69.1	78.9	86.1	86.4	91.9	95.0	79.2	90.7	96.2
FitNet	BN⁶⁴	62.3	73.8	83.0	81.2	87.7	92.5	76.6	89.3	95.4
Attention	BN⁶⁴	58.3	69.4	79.1	79.2	86.7	91.8	76.3	89.2	95.4
CRD	BN⁶⁴	60.9	72.7	81.7	79.2	87.2	92.1	75.5	88.3	95.3
DarkRank	BN⁶⁴	63.5	74.3	83.1	78.1	85.9	91.1	73.9	87.5	94.8
PKT	BN⁶⁴	63.6	75.8	84.0	82.2	88.7	93.5	74.6	87.3	94.2
RKD	BN⁶⁴	65.8	76.7	85.0	83.7	89.9	94.1	70.2	83.8	92.1
Ours	BN⁶⁴	67.4	78.0	85.9	86.5	92.3	95.3	76.3	88.6	94.8

Model Compression Setting

Transfer to a smaller network with a lower embedding dimension for usage in low-power and resource limited devices.
Source Embedding Network (ResNet50, 512 dim) 🠢 Target Embedding Network (ResNet18, 128 dim)

CUB-200-2011

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model resnet18 \
--embedding-size 128 --batch-size 90 --IPC 2 --dataset cub --epochs 90 \
--source-ckpt ./pretrained_source/resnet50/cub_resnet50_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

Cars-196

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model resnet18 \
--embedding-size 128 --batch-size 90 --IPC 2 --dataset cars --epochs 90 \
--source-ckpt ./pretrained_source/resnet50/cars_resnet50_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

SOP

python code/train_target.py --gpu-id 0 --loss Relaxed_Contra --model resnet18 \
--embedding-size 128 --batch-size 90 --IPC 2 --dataset SOP --epochs 150 \
--source-ckpt ./pretrained_source/resnet50/SOP_resnet50_512dim_Proxy_Anchor_ckpt.pth \
--view 2 --sigma 1 --delta 1 --save 1

		CUB-200-2011			Cars-196			SOP
Method	Backbone	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]
Source: PA	R50⁵¹²	69.9	79.6	88.6	87.7	92.7	95.5	80.5	91.8	98.8
FitNet	R18¹²⁸	61.0	72.2	81.1	78.5	86.0	91.4	76.7	89.4	95.5
Attention	R18¹²⁸	61.0	71.7	81.5	78.6	85.9	91.0	76.4	89.3	95.5
CRD	R18¹²⁸	62.8	73.8	83.2	80.6	87.9	92.5	76.2	88.9	95.3
DarkRank	R18¹²⁸	61.2	72.5	82.0	75.3	83.6	89.4	72.7	86.7	94.5
PKT	R18¹²⁸	65.0	75.6	84.8	81.6	88.8	93.4	76.9	89.2	95.5
RKD	R18¹²⁸	65.8	76.3	84.8	84.2	90.4	94.3	75.7	88.4	95.1
Ours	R18¹²⁸	66.6	78.1	85.9	86.0	91.6	95.3	78.4	90.4	96.1

Train Source Embedding Network

This repository also provides code for training source embedding network with several losses as well as proxy-anchor loss. For details on how to train the source embedding network, please see the Proxy-Anchor Loss repository.

For example, training source embedding network (BN–Inception, 512 dim) with Proxy-Anchor Loss on the CUB-200-2011 as

python code/train_source.py --gpu-id 0 --loss Proxy_Anchor --model bn_inception \
--embedding-size 512 --batch-size 180 --lr 1e-4 --dataset cub \
--warm 1 --bn-freeze 1 --lr-decay-step 10

Evaluating Image Retrieval

Follow the below steps to evaluate the trained model.
Trained best model will be saved in the ./logs/folder_name.

# The parameters should be changed according to the model to be evaluated.
python code/evaluate.py --gpu-id 0 \
                   --batch-size 120 \
                   --model bn_inception \
                   --embedding-size 512 \
                   --dataset cub \
                   --ckpt /set/your/model/path/best_model.pth

Acknowledgements

Our source code is modified and adapted on these great repositories:

Citation

If you use this method or this code in your research, please cite as:

@inproceedings{kim2021embedding,
  title={Embedding Transfer with Label Relaxation for Improved Metric Learning},
  author={Kim, Sungyeon and Kim, Dongwon and Cho, Minsu and Kwak, Suha},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

Official PyTorch Implementation of Embedding Transfer with Label Relaxation for Improved Metric Learning, CVPR 2021

Related tags

Overview

Embedding Transfer with Label Relaxation for Improved Metric Learning

Overview

Relaxed Contrastive Loss

Experimental Restuls

Requirements

Prepare Datasets

Prepare Pretrained Source models

Training Target Embedding Network with Relaxed Contrastive Loss

Self-transfer Setting

CUB-200-2011

Cars-196

SOP

Dimensionality Reduction Setting

CUB-200-2011

Cars-196

SOP

Model Compression Setting

CUB-200-2011

Cars-196

SOP

Train Source Embedding Network

Evaluating Image Retrieval

Acknowledgements

Citation

Owner

Sungyeon Kim

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.

The second project in Python course on FCC

Parametric Contrastive Learning (ICCV2021)

Convnet transfer - Code for paper How transferable are features in deep neural networks?

Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch

Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021.

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

A tool to prepare websites grabbed with wget for local viewing.

Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

🔥3D-RecGAN in Tensorflow (ICCV Workshops 2017)

Deep Learning Models for Causal Inference

Code repository for "Free View Synthesis", ECCV 2020.

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

RoMa: A lightweight library to deal with 3D rotations in PyTorch.

Implementation of neural class expression synthesizers

CUP-DNN is a deep neural network model used to predict tissues of origin for cancers of unknown of primary.