[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Last update: Nov 21, 2022

Overview

Discriminative Region-based Multi-Label Zero-Shot Learning (ICCV 2021)

[arXiv][Project page >> coming soon]

Sanath Narayan^, Akshita Gupta^, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah

( 🌟 denotes equal contribution)

Installation

The codebase is built on PyTorch 1.1.0 and tested on Ubuntu 16.04 environment (Python3.6, CUDA9.0, cuDNN7.5).

For installing, follow these intructions

conda create -n mlzsl python=3.6
conda activate mlzsl
conda install pytorch=1.1 torchvision=0.3 cudatoolkit=9.0 -c pytorch
pip install matplotlib scikit-image scikit-learn opencv-python yacs joblib natsort h5py tqdm pandas

Install warmup scheduler

cd pytorch-gradual-warmup-lr; python setup.py install; cd ..

Attention Visualization

Results


Our approach on NUS-WIDE Dataset.	Our approach on OpenImages Dataset.

Training and Evaluation

NUS-WIDE

Step 1: Data preparation

Download pre-computed features from here and store them at features folder inside BiAM/datasets/NUS-WIDE directory.
[Optional] You can extract the features on your own by using the original NUS-WIDE dataset from here and run the below script:

python feature_extraction/extract_nus_wide.py

Step 2: Training from scratch

To train and evaluate multi-label zero-shot learning model on full NUS-WIDE dataset, please run:

sh scripts/train_nus.sh

Step 3: Evaluation using pretrained weights

To evaluate the multi-label zero-shot model on NUS-WIDE. You can download the pretrained weights from here and store them at NUS-WIDE folder inside pretrained_weights directory.

sh scripts/evaluate_nus.sh

OPEN-IMAGES

Step 1: Data preparation

Please download the annotations for training, validation, and testing into this folder.
Store the annotations inside BiAM/datasets/OpenImages.
To extract the features for OpenImages-v4 dataset run the below scripts for crawling the images and extracting features of them:

## Crawl the images from web
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `train`: download images into `./image_data/train/`
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `validation`: download images into `./image_data/validation/`
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `test`: download images into `./image_data/test/`

## Run feature extraction codes for all the 3 splits
python feature_extraction/extract_openimages_train.py
python feature_extraction/extract_openimages_test.py
python feature_extraction/extract_openimages_val.py

Step 2: Training from scratch

To train and evaluate multi-label zero-shot learning model on full OpenImages-v4 dataset, please run:

sh scripts/train_openimages.sh
sh scripts/evaluate_openimages.sh

Step 3: Evaluation using pretrained weights

To evaluate the multi-label zero-shot model on OpenImages. You can download the pretrained weights from here and store them at OPENIMAGES folder inside pretrained_weights directory.

sh scripts/evaluate_openimages.sh

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

Citation

If you find this repository useful, please consider giving a star ⭐ and citation 🎊 :

@article{narayan2021discriminative,
title={Discriminative Region-based Multi-Label Zero-Shot Learning},
author={Narayan, Sanath and Gupta, Akshita and Khan, Salman and  Khan, Fahad Shahbaz and Shao, Ling and Shah, Mubarak},
journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
publisher = {IEEE},
year={2021}
}

Contact

Should you have any question, please contact 📧 [email protected]

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Related tags

Overview

Discriminative Region-based Multi-Label Zero-Shot Learning (ICCV 2021)

Sanath Narayan*, Akshita Gupta*, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah

Installation

Attention Visualization

Results

Training and Evaluation

NUS-WIDE

Step 1: Data preparation

Step 2: Training from scratch

Step 3: Evaluation using pretrained weights

OPEN-IMAGES

Step 1: Data preparation

Step 2: Training from scratch

Step 3: Evaluation using pretrained weights

License

Citation

Contact

Owner

Akshita Gupta

PFFDTD is an open-source FDTD simulator for 3D room acoustics

GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

Can we learn gradients by Hamiltonian Neural Networks?

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

A heterogeneous entity-augmented academic language model based on Open Academic Graph (OAG)

The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

SimulLR - PyTorch Implementation of SimulLR

A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

NeRF visualization library under construction

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Non-Official Pytorch implementation of "Face Identity Disentanglement via Latent Space Mapping" https://arxiv.org/abs/2005.07728 Using StyleGAN2 instead of StyleGAN

Unsupervised clustering of high content screen samples

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Numerical Methods with Python, Numpy and Matplotlib

This code uses generative adversarial networks to generate diverse task allocation plans for Multi-agent teams.

Computer vision - fun segmentation experience using classic and deep tools :)

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

Graph Neural Networks with Keras and Tensorflow 2.

Sanath Narayan^, Akshita Gupta^, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah