Rotation Robust Descriptors

Last update: Nov 15, 2022

Overview

RoRD

Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching

Project Page | Paper link

Evaluation and Datasets

MMA : Training on PhotoTourism and testing on HPatches and proposed Rotated HPatches
Pose Estimation : Training on same PhotoTourism datasets as used for MMA and testing on proposed DiverseView
Visual Place Recognition : Oxford RobotCar training sequence and testing sequence

Pretrained Models

Download models from Google Drive (73.9 MB) in the base directory.

Evaluating RoRD

You can evaluate RoRD on demo images or replace it with your custom images.

Dependencies can be installed in a conda of virtualenv by running:
1. pip install -r requirements.txt
python extractMatch.py <rgb_image1> <rgb_image2> --model_file <path to the model file RoRD>
Example:
python extractMatch.py demo/rgb/rgb1_1.jpg demo/rgb/rgb1_2.jpg --model_file models/rord.pth
This should give you output like this:

RoRD

SIFT

DiverseView Dataset

Download dataset from Google Drive (97.8 MB) in the base directory (only needed if you want to evaluate on DiverseView Dataset).

Evaluation on DiverseView Dataset

The DiverseView Dataset is a custom dataset consisting of 4 scenes with images having high-angle camera rotations and viewpoint changes.

Pose estimation on single image pair of DiverseView dataset:
1. cd demo
2. python register.py --rgb1 <path to rgb image 1> --rgb2 <path to rgb image 2> --depth1 <path to depth image 1> --depth2 <path to depth image 2> --model_rord <path to the model file RoRD>
3. Example:
  python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth
4. This should give you output like this:

RoRD matches in perspective view

RoRD matches in orthographic view

To visualize the registered point cloud, use --viz3d command:
1. python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth --viz3d

PointCloud registration using correspondences

Pose estimation on a sequence of DiverseView dataset:
1. cd evaluation/DiverseView/
2. python evalRT.py --dataset <path to DiverseView dataset> --sequence <sequence name> --model_rord <path to RoRD model> --output_dir <name of output dir>
3. Example:
  1. python evalRT.py --dataset /path/to/preprocessed/ --sequence data1 --model_rord ../../models/rord.pth --output_dir out
4. This would generate out folder containing predicted transformations and matching results in out/vis folder, containing images like below:

RoRD

Training RoRD on PhotoTourism Images

Training using rotation homographies with initialization from D2Net weights (Download base models as mentioned in Pretrained Models).
Download branderburg_gate dataset that is used in the configs/train_scenes_small.txt from here(5.3 Gb) in phototourism folder.

Folder stucture should be:

phototourism/  
___ brandenburg_gate  
___ ___ dense  
___ ___	___ images  
___ ___	___ stereo  
___ ___	___ sparse

python trainPT_ipr.py --dataset_path <path_to_phototourism_folder> --init_model models/d2net.pth --plot

TO-DO

Provide VPR code
Provide combine training of RoRD + D2Net
Provide code for calculating error in Diverseview Dataset

Credits

Our base model is borrowed from D2-Net.

BibTex

If you use this code in your project, please cite the following paper:

@misc{rord2021,
      title={RoRD: Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching}, 
      author={Udit Singh Parihar and Aniket Gujarathi and Kinal Mehta and Satyajit Tourani and Sourav Garg and Michael Milford and K. Madhava Krishna},
      year={2021},
      eprint={2103.08573},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Rotation Robust Descriptors

Related tags

Overview

RoRD

Evaluation and Datasets

Pretrained Models

Evaluating RoRD

RoRD

SIFT

DiverseView Dataset

Evaluation on DiverseView Dataset

RoRD matches in perspective view

RoRD matches in orthographic view

PointCloud registration using correspondences

RoRD

Training RoRD on PhotoTourism Images

TO-DO

Credits

BibTex

Owner

Udit Singh Parihar

IndoNLI: A Natural Language Inference Dataset for Indonesian

This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.

[CVPR 2022 Oral] MixFormer: End-to-End Tracking with Iterative Mixed Attention

We are More than Our JOints: Predicting How 3D Bodies Move

Implementation for "Exploiting Aliasing for Manga Restoration" (CVPR 2021)

TrTr: Visual Tracking with Transformer

Gauge equivariant mesh cnn

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes (AAAI2022)

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

NLU Dataset Diagnostics

PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision

This project is for a Twitter bot that monitors a bird feeder in my backyard. Any detected birds are identified and posted to Twitter.

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

Generalized Data Weighting via Class-level Gradient Manipulation

The code for our paper CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention.

Justmagic - Use a function as a method with this mystic script, like in Nim

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling