Learning Camera Localization via Dense Scene Matching, CVPR2021

Last update: Dec 01, 2022

Related tags

Overview

This repository contains code of our CVPR 2021 paper - "Learning Camera Localization via Dense Scene Matching" by Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan.

This paper presents a new method for scene agnostic camera localization using dense scene matching (DSM), where a cost volume is constructed between a query image and a scene. The cost volume and the corresponding coordinates are processed by a CNN to predict dense coordinates. Camera poses can then be solved by PnP algorithms.

If you find this project useful, please cite:

@inproceedings{Tang2021Learning,
  title={Learning Camera Localization via Dense Scene Matching},
  author={Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan},
  booktitle={Computer Vision and Pattern Recognition (CVPR)},
  year={2021}
}

Usage

Environment

The codes are tested along with
- pytorch=1.4.0
- lmdb (optional)
- yaml
- skimage
- opencv
- numpy=1.17
- tensorboard

Installation

Build PyTorch operations

  cd libs/model/ops
  python setup.py install

Build PnP algorithm

  cd libs/utils/lm_pnp
  mkdir build
  cd build
  cmake ..
  make all

Train and Test

Download

You can download the trained models and label files for 7scenes, Cambridge, Scannet.

For 7scenes, you can use the prepared data in the following.

Chess Fire Heads Office Pumpkin Kitchen Stairs

For Cambridge landmarks, you can download image files here, and depths here.
Test

Please refer to configs/7scenes.yaml for detailed explaination of how to set label file path and image file path.
- 7scenes
```
python tools/video_test.py --config configs/7scenes.yaml
```
- Camrbrige
```
python tools/video_test.py --config configs/cambridge.yaml
```
Train

We use ResNet-FPN pretrained model.
```
  python tools/train_net.py
```

Learning Camera Localization via Dense Scene Matching, CVPR2021

Related tags

Overview

Usage

Environment

Installation

Train and Test

Owner

tangshitao

BNF Globalization Code (CVPR 2016)

OCR powered screen-capture tool to capture information instead of images

基于Paddle框架的PSENet复现

Um simples projeto para fazer o reconhecimento do captcha usado pelo jogo bombcrypto

This repo contains a script that allows us to find range of colors in images using openCV, and then convert them into geo vectors.

scene-linear test images

PAGE XML format collection for document image page content and more

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

CellProfiler is a open-source application for biological image analysis

A tool to make dumpy among us GIFS

python ocr using tesseract/ with EAST opencv detector

Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

SemTorch

Color Picker and Color Detection tool for METR4202

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

A small C++ implementation of LSTM networks, focused on OCR.

かの有名なあの東方二次創作ソング、「bad apple!」のMVをPythonでやってみたって話

Educational application aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using a variety of CV technologies in the backend such as OpenCV, PyAutoGUI and EasyOCR and a frontend coded in Typescript.

Binarize document images