3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Last update: Dec 26, 2022

Related tags

Deep Learning 3D-Reconstruction

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

代码：Part1

技术文档：[Markdown] [PDF]

原始图像：Original Images

点云结果：Point Cloud Results-1

效果图：

Part II 基于计算机视觉方法的点云到点云窗户识别

代码：Part2

技术文档：[Markdown] [PDF]

点云结果：Point Cloud Results-2

算法流程图：

Part III 基于ResNest的图像到点云的语义分割

代码：Part3

技术文档：[Markdown] [PDF]

语义分割结果：Semantic Segmentation Results

点云结果：Point Cloud Results-3

效果图：

参考文献

AA-RMVSNet [arXiv] [CVF] [PDF]

Wei Z, Zhu Q, Min C, et al. Aa-rmvsnet: Adaptive aggregation recurrent multi-view stereo network[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 6187-6196.

Cascade-MVSNet [arXiv] [CVF] [PDF]

Gu X, Fan Z, Zhu S, et al. Cascade cost volume for high-resolution multi-view stereo and stereo matching[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 2495-2504.

TransMVSNet [arXiv] [PDF]

Ding Y, Yuan W, Zhu Q, et al. TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers[J]. arXiv preprint arXiv:2111.14600, 2021.

LoFTR [arXiv] [CVF] [PDF]

Sun J, Shen Z, Wang Y, et al. LoFTR: Detector-free local feature matching with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 8922-8931.

PatchmatchNet [arXiv] [CVF] [PDF]

Wang F, Galliani S, Vogel C, et al. PatchmatchNet: Learned Multi-View Patchmatch Stereo[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 14194-14203.

ResNeSt [arXiv] [PDF]

Zhang H, Wu C, Zhang Z, et al. Resnest: Split-attention networks[J]. arXiv preprint arXiv:2004.08955, 2020.

致谢

稀疏重建部分使用Colmap完成相机参数的获取。

稠密重建部分的代码主要来源于AA-RMVSNet。

点云切割与可视化使用CloudCompare及Meshlab完成。

调用Open3D进行表面重建。

Cascade+Transformer的代码主要基于kwea123实现的pytorch-lightning版本的Cascade-MVSNetl以及LoFTR进行实现。

窗户识别算法中部分思路参考了Color Space的矩形识别算法，图像处理技术主要基于冈萨雷斯的数字图像处理（第三版）。

语义分割部分调用了PyTorch-Encoding。

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

StyleGAR TODO: add arxiv link Implementation of Inverting Generative Adversarial Renderer for Face Reconstruction TODO: for test Currently, some model

155 Oct 27, 2022

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

The Boombox: Visual Reconstruction from Acoustic Vibrations Boyuan Chen, Mia Chiquier, Hod Lipson, Carl Vondrick Columbia University Project Website |

12 Nov 30, 2022

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints Official implementation for Reducing Footskate in Human Motion Recon

38 Nov 1, 2022

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction TSDF++ is a novel multi-object TSDF formulation that can encode mult

130 Dec 29, 2022

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

MeshTransformer ✨ This is our research code of End-to-End Human Pose and Mesh Reconstruction with Transformers. MEsh TRansfOrmer is a simple yet effec

473 Dec 31, 2022

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

SinIR (Official Implementation) Requirements To install requirements: pip install -r requirements.txt We used Python 3.7.4 and f-strings which are in

47 Oct 11, 2022

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

494 Jan 6, 2023

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Neural Deformation Graphs Project Page | Paper | Video Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction Aljaž Božič, Pablo P

134 Dec 16, 2022

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

LASR Installation Build with conda conda env create -f lasr.yml conda activate lasr # install softras cd third_party/softras; python setup.py install;

157 Dec 26, 2022

Releases(7)

7(Feb 16, 2022)

White mesh generated by Neus
Source code(tar.gz)
Source code(zip)
dongbeiya_neus.ply(11.21 MB)
gym_north_neus.ply(21.28 MB)
gym_south_neus.ply(16.59 MB)
6(Feb 16, 2022)

White mesh generated by Colmap and Meshlab
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(19.11 MB)
dongbeiya.png(8.45 MB)
gym_north.ply(31.93 MB)
gym_north.png(8.73 MB)
gym_south.ply(26.97 MB)
gym_south.png(9.32 MB)
5(Dec 29, 2021)

Original images for reconstruction
Source code(tar.gz)
Source code(zip)
PIC2.zip(755.68 MB)
PIC2.z01(900.00 MB)
PIC2.z02(900.00 MB)
dby.zip(735.16 MB)
dby.z02(900.00 MB)
dby.z01(900.00 MB)
4(Dec 19, 2021)

Semantic Segmentation Results of Problem 3
Source code(tar.gz)
Source code(zip)
filtered_segmentation_result_dongbeiya.zip(661.17 MB)
filtered_segmentation_result_gym.zip(786.65 MB)
segmentation_result_dongbeiya.zip(64.31 MB)
segmentation_result_dongbeiya_block.zip(53.27 MB)
segmentation_result_gym.zip(4.72 MB)
3(Dec 19, 2021)

Point Cloud Results of Problem 3
Source code(tar.gz)
Source code(zip)
2(Dec 19, 2021)

Point Cloud Results of Problem 2
Source code(tar.gz)
Source code(zip)
gym_south_window.ply(627.30 MB)
gym_north_window.ply(808.62 MB)
dongbeiya_window.ply(1800.53 MB)
gym_window.ply(1603.31 MB)
1(Dec 19, 2021)

Point Cloud Results of Problem 1
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(731.13 MB)
gym_south.ply(696.19 MB)
gym_north.ply(707.89 MB)
gym.ply(1404.08 MB)

Owner

HMT_Curo

GitHub Repository

This repo is a PyTorch implementation for Paper "Unsupervised Learning for Cuboid Shape Abstraction via Joint Segmentation from Point Clouds"

Unsupervised Learning for Cuboid Shape Abstraction via Joint Segmentation from Point Clouds This repository is a PyTorch implementation for paper: Uns

42 Dec 09, 2022

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

Fast MST Algorithm Implementation of fast algorithms for (Maximum Spanning Tree) MST parsing that includes fast ArcMax+Reweighting+Tarjan algorithm fo

11 Oct 14, 2022

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Welcome to the cuQuantum repository! This public repository contains two sets of files related to the NVIDIA cuQuantum SDK: samples: All C/C++ sample

147 Dec 27, 2022

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

Unified Multi-modal Transformers This repository maintains the official implementation of the paper UMT: Unified Multi-modal Transformers for Joint Vi

84 Jan 04, 2023

Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework

792 Dec 28, 2022

3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Related tags

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

Part II 基于计算机视觉方法的点云到点云窗户识别

Part III 基于ResNest的图像到点云的语义分割

参考文献

致谢

You might also like...

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Releases(7)

7(Feb 16, 2022)

6(Feb 16, 2022)

5(Dec 29, 2021)

4(Dec 19, 2021)

3(Dec 19, 2021)

2(Dec 19, 2021)

1(Dec 19, 2021)

Owner

HMT_Curo

This repo is a PyTorch implementation for Paper "Unsupervised Learning for Cuboid Shape Abstraction via Joint Segmentation from Point Clouds"

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework

Udacity Suse Cloud Native Foundations Scholarship Course Walkthrough

Library for fast text representation and classification.

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Codebase of deep learning models for inferring stability of mRNA molecules

This is a pytorch implementation of the NeurIPS paper GAN Memory with No Forgetting.

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Code base of object detection

dualPC.R contains the R code for the main functions.

SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches

An executor that performs image segmentation on fashion items

BlueFog Tutorials

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains