Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Last update: Nov 21, 2022

Related tags

Deep Learning GPR1200

Overview

GPR1200 Dataset

GPR1200: A Benchmark for General-Purpose Content-Based Image Retrieval (ArXiv)

Konstantin Schall, Kai Uwe Barthel, Nico Hezel, Klaus Jung

Visual Computing Group HTW Berlin

Similar to most vision related tasks, deep learning models have taken over in the field of content-based image retrieval (CBIR) over the course of the last decade. However, most publications that aim to optimise neural networks for CBIR, train and test their models on domain specific datasets. It is therefore unclear, if those networks can be used as a general-purpose image feature extractor. After analyzing popular image retrieval test sets we decided to manually curate GPR1200, an easy to use and accessible but challenging benchmark dataset with 1200 categories and 10 class examples. Classes and images were manually selected from six publicly available datasets of different image areas, ensuring high class diversity and clean class boundaries.

Results:

Download Instructions:

The images are available under this link. Unziping the content will result in an "images" folder, which contains all 12000 images. Each filename consists of a combination of the GPR1200 category ID and the original name:
"{category ID}_{original name}.jpg

Evaluation Protocol:

Images are not devided into query and index sets for evaluation and the full mean average precision value is used as the metric. Instructions and evalution code can be found in this repository.

This notebook contains evaluation code for several models with Pytorch and the awesome timm library.

If you have precomputed embeddings for the dataset, you can run the eval script with the following command:

python ./eval/evaluate.py --evalfile-path '/path/to/embeddings' \
                            --mode 'embeddings' \
                            --dataset-path '/path/to/GPR1200/images'

In this case an evaluation file has to be provided that contains embeddings in the order created by the GPR1200 dataset object. This can be a npy file or a pickable python list.

GPR1200_dataset = GPR1200('/path/to/GPR1200/images')

If you work with local features, it is best to provide nearest neighbours indices. For this case run the evaluation script in the indices mode:

python ./eval/evaluate.py --evalfile-path='/path/to/indices' \
                            --mode='indices' \
                            --dataset-path='/path/to/GPR1200/images'

License Informations:

This dataset is available for for non-commercial research and educational purposes only and the copyright belongs to the original owners. If any of the images belongs to you and you would like it removed, please kindly inform us, we will remove it from our dataset immediately. Since all images were curated from other publicly available datasets, please visit the respective dataset websites for additional license informations.

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Related tags

Overview

GPR1200 Dataset

Results:

Download Instructions:

Evaluation Protocol:

License Informations:

Owner

Visual Computing Group

A short code in python, Enchpyter, is able to encrypt and decrypt words as you determine, of course

Final project for machine learning (CSC 590). Detection of hepatitis C and progression through blood samples.

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Geometry-Free View Synthesis: Transformers and no 3D Priors

MPViT:Multi-Path Vision Transformer for Dense Prediction

Automatic Image Background Subtraction

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Few-NERD: Not Only a Few-shot NER Dataset

COLMAP - Structure-from-Motion and Multi-View Stereo

CVPR2022 paper "Dense Learning based Semi-Supervised Object Detection"

Clean Machine Learning, a Coding Kata

Frigate - NVR With Realtime Object Detection for IP Cameras

Geometric Vector Perceptron --- a rotation-equivariant GNN for learning from biomolecular structure

This repository contains a PyTorch implementation of the paper Learning to Assimilate in Chaotic Dynamical Systems.

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

REBEL: Relation Extraction By End-to-end Language generation

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

DTCN IJCAI - Sequential prediction learning framework and algorithm

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization