RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Last update: Dec 15, 2022

Overview

RINDNet

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth
Mengyang Pu, Yaping Huang, Qingji Guan and Haibin Ling
ICCV 2021 (oral)

Please refer to supplementary material (code:p86d) (~60M) for more results.

Benchmark --- 🔥 🔥 BSDS-RIND 🔥 🔥

BSDS-RIND is the first public benchmark that dedicated to studying simultaneously the four edge types, namely Reflectance Edge (RE), Illumination Edge (IE), Normal Edge (NE) and Depth Edge (DE). It is created by carefully labeling images from the BSDS500. The datasets can be downloaded from:

Original images: BSDS500
Our annotations: BSDS-RIND (BaiDuNetdisk, code:e7rg ; GoogleDrive)

Abstract

As a fundamental building block in computer vision, edges can be categorised into four types according to the discontinuity in surface-Reflectance, Illumination, surface-Normal or Depth. While great progress has been made in detecting generic or individual types of edges, it remains under-explored to comprehensively study all four edge types together. In this paper, we propose a novel neural network solution, RINDNet, to jointly detect all four types of edges. Taking into consideration the distinct attributes of each type of edges and the relationship between them, RINDNet learns effective representations for each of them and works in three stages. In stage I, RINDNet uses a common backbone to extract features shared by all edges. Then in stage II it branches to prepare discriminative features for each edge type by the corresponding decoder. In stage III, an independent decision head for each type aggregates the features from previous stages to predict the initial results. Additionally, an attention module learns attention maps for all types to capture the underlying relations between them, and these maps are combined with initial results to generate the final edge detection results. For training and evaluation, we construct the first public benchmark, BSDS-RIND, with all four types of edges carefully annotated. In our experiments, RINDNet yields promising results in comparison with state-of-the-art methods.

Code and Main results ----- Coming Soon...

Acknowledgments

The work is partially done while Mengyang was at Stony Brook University.
We thank the anonymous reviewers for valuable and inspiring comments and suggestions.

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Related tags

Overview

RINDNet

Benchmark --- 🔥 🔥 BSDS-RIND 🔥 🔥

Abstract

Code and Main results ----- Coming Soon...

Acknowledgments

Owner

Mengyang Pu

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation (CVPR 2021)

Tiny Object Detection in Aerial Images.

Jupyter notebooks showing best practices for using cx_Oracle, the Python DB API for Oracle Database

The undersampled DWI image using Slice-Interleaved Diffusion Encoding (SIDE) method can be reconstructed by the UNet network.

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

Human4D Dataset tools for processing and visualization

CNNs for Sentence Classification in PyTorch

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Official implementation of the NeurIPS'21 paper 'Conditional Generation Using Polynomial Expansions'.

Distributed Arcface Training in Pytorch

Code for the RA-L (ICRA) 2021 paper "SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition"

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

Natural Intelligence is still a pretty good idea.

PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition

Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.

A trashy useless Latin programming language written in python.

Demonstration of the Model Training as a CI/CD System in Vertex AI

A motion tracking system for any arbitaray points in a video frame.

MonoScene: Monocular 3D Semantic Scene Completion

Material related to the Principles of Cloud Computing course.