Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Last update: Aug 12, 2022

Related tags

Deep Learning InterpretableMDE

Overview

InterpretableMDE

A PyTorch implementation for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

arXiv link: https://arxiv.org/abs/2108.05312

Data and Model

For MFF models, we use the dataset they released here, and you can download their models as the baselines here. For BTS models, they use a different set of NYUv2 training images (24,231 instead of 50,688), and you download it here. We put all of our models here.

Evaluation

In this project we use yacs to manage the configurations. To evaluate the performance of a model, for example, the MFF model with SENet backbone using our assigning method, simply run

python eval.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn]

from the root directory.

To evaluate the depth selectivity, run

python dissect.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn] LAYERS D_MFF ON_TRAINING_DATA True

then get the depth selectivity and the dissection result of each unit. Layers' names are seperated by _.

Training

To train a model from scratch, run

python train.py MODEL_NAME MFF_resnet

We currently provide four options for MODEL_NAME, and the training scheme will automatically be switched to align with the original ones when using BTS models.

Acknowledgement

The model part of our code is adapted from Revisiting_Single_Depth_Estimation and bts. Some snippets are adapted from monodepth2.

Bibtex

@inproceedings{you2021iccv,
 title = {Towards Interpretable Deep Networks for Monocular Depth Estimation},
 author = {Zunzhi You and Yi-Hsuan Tsai and Wei-Chen Chiu and Guanbin Li},
 booktitle = {International Conference on Computer Vision (ICCV)},
 year = {2021}
}

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Related tags

Overview

InterpretableMDE

Data and Model

Evaluation

Training

Acknowledgement

Bibtex

Owner

Zunzhi You

3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A - Continual Learning Classification Challenge

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

Fast SHAP value computation for interpreting tree-based models

A GridMixup augmentation, inspired by GridMask and CutMix

transfer attack; adversarial examples; black-box attack; unrestricted Adversarial Attacks on ImageNet; CVPR2021 天池黑盒竞赛

GULAG: GUessing LAnGuages with neural networks

Unified MultiWOZ evaluation scripts for the context-to-response task.

Simple Tensorflow implementation of "Adaptive Convolutions for Structure-Aware Style Transfer" (CVPR 2021)

TrTr: Visual Tracking with Transformer

Unofficial implementation (replicates paper results!) of MINER: Multiscale Implicit Neural Representations in pytorch-lightning

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

Unofficial PyTorch Implementation of AHDRNet (CVPR 2019)

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

Systematic generalisation with group invariant predictions

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

CNN Based Meta-Learning for Noisy Image Classification and Template Matching

Implementation of PersonaGPT Dialog Model

Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"

Repository providing a wide range of self-supervised pretrained models for computer vision tasks.