Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

Last update: Dec 30, 2022

Overview

AA-RMVSNet

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021) in PyTorch.

paper link: arXiv | CVF

Change Log

Jun 17, 2021: Initialize repo
Jun 27, 2021: Update code
Aug 10, 2021: Update paper link
Oct 14, 2021: Update bibtex

Data Preparation

Download the preprocessed DTU training data (also available at BaiduYun, PW: s2v2).
For other datasets, please follow the practice in Yao Yao's MVSNet repo.
The pretrained model is provided. Place it under ./checkpoints/.

How to run

Install required dependencies:

conda create -n drmvsnet python=3.6
conda activate drmvsnet
conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=10.0 -c pytorch
conda install -c conda-forge py-opencv plyfile tensorboardx

Set root of datasets as env variables in env.sh.
Train AA-RMVSNet on DTU dataset (note that training requires a large amount of GPU memory):
```
./scripts/train_dtu.sh
```
Predict depth maps and fuse them to get point clouds of DTU:
```
./scripts/eval_dtu.sh
./scripts/fusion_dtu.sh
```
Predict depth maps and fuse them to get point clouds of Tanks and Temples:
```
./scripts/eval_tnt.sh
./scripts/fusion_tnt.sh
```

Note: if permission issues are encountered, try chmod +x <script_filename> to allow execution.

Citation

@inproceedings{wei2021aa,
  title={AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network},
  author={Wei, Zizhuang and Zhu, Qingtian and Min, Chen and Chen, Yisong and Wang, Guoping},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={6187--6196},
  year={2021}
}

Acknowledgements

This repository is heavily based on Xiaoyang Guo's PyTorch implementation.

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

Related tags

Overview

AA-RMVSNet

Change Log

Data Preparation

How to run

Citation

Acknowledgements

Owner

Qingtian Zhu

Geometry-Aware Learning of Maps for Camera Localization (CVPR2018)

🕵 Artificial Intelligence for social control of public administration

Implementing Vision Transformer (ViT) in PyTorch

[cvpr22] Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

Contrastive Learning with Non-Semantic Negatives

Improving Query Representations for DenseRetrieval with Pseudo Relevance Feedback:A Reproducibility Study.

DLFlow is a deep learning framework.

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

Adversarial Texture Optimization from RGB-D Scans (CVPR 2020).

TorchIO is a Medical image preprocessing and augmentation toolkit for deep learning. Part of the PyTorch Ecosystem.

BboxToolkit is a tiny library of special bounding boxes.

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

App customer segmentation cohort rfm clustering

Label Mask for Multi-label Classification

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

QT Py Media Knob using rotary encoder & neopixel ring

Neural Network Libraries

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Model-based reinforcement learning in TensorFlow