[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

Last update: Jul 22, 2022

Overview

FaPN: Feature-aligned Pyramid Network for Dense Image Prediction [arXiv] [Project Page]

@inproceedings{
  huang2021fapn,
  title={{FaPN}: Feature-aligned Pyramid Network for Dense Image Prediction},
  author={Shihua Huang and Zhichao Lu and Ran Cheng and Cheng He},
  booktitle={International Conference on Computer Vision (ICCV)},
  year={2021}
}

Overview

FaPN vs. FPN	Before vs. After Alignment

This project provides the whole official implementation for our ICCV2021 paper "FaPN: Feature-aligned Pyramid Network for Dense Image Prediction" based on Detectron2, PanoticFCN, and MaskFormer. FaPN is a simple yet effective top-down pyramidal architecture to generate multi-scale features for dense image prediction. Comprised of a feature alignment module (FAM) and a feature selection module (FSM), FaPN addresses the issue of feature alignment in the original FPN, leading to substaintial improvements on various dense prediction tasks, such as object detection, semantic, instance, panoptic segmentation, etc.

Installation

This project is based on Detectron2, which can be constructed as follows.

Install Detectron2 following the instructions.
Setup the dataset following the structure.
Install DCNv2 following Install DCNv2.md.

Results

COCO Object Detection

Faster R-CNN + FaPN:

Name	lr sched	box AP	box APs	box APm	box APl	download
R50	1x	39.2	24.5	43.3	49.1	model \| log
R101	3x	42.8	27.0	46.2	54.9	model \| log

Cityscapes Semantic Segmentation

PointRend + FaPN:

Name	lr sched	mask mIoU	mask i_IoU	mask IoU_sup	mask iIoU_sup	download
R50	1x	80.0	61.3	90.6	78.5	model \| log
R101	1x	80.1	62.2	90.8	78.6	model \| log

ADE20K-150 Semantic Segmentation

MaskFormer + FaPN:

Name	mIoU Single-Scale	mIoU Multi-Scale	download
Swin+Large+IN21K	55.2	56.7	model \| log

COCOStuff-10K Semantic Segmentation

MaskFormer + FaPN:

Name	mIoU Single-Scale	mIoU Multi-Scale	download
R101	39.6	40.6	model \| log

COCO Instance Segmentation

Mask R-CNN + FaPN:

Name	lr sched	mask AP	mask APs	box AP	box APs	download
R50	1x	36.4	18.1	39.8	24.3	model \| log
R101	3x	39.4	20.9	43.8	27.4	model \| log

PointRend + FaPN:

Name	lr sched	mask AP	mask APs	box AP	box APs	download
R50	1x	37.6	18.6	39.4	24.2	model \| log

COCO Panoptic Segmentation

PanopticFPN + FaPN:

Name	lr sched	PQ	mask mIoU	St PQ	box AP	Th PQ	download
R50	1x	41.1	43.4	32.5	38.7	46.9	model \| log
R101	3x	44.2	45.7	35.0	43.0	53.3	model \| log

PanopticFCN + FaPN:

Name	lr sched	PQ	mask mIoU	St PQ	box AP	Th PQ	download
R50	1x	41.8	42.0	33.1	32.3	47.6	model \| log
R50-600	3x	43.5	43.5	35.1	34.5	49.0	model \| log

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

Related tags

Overview

FaPN: Feature-aligned Pyramid Network for Dense Image Prediction [arXiv] [Project Page]

Overview

Installation

Results

COCO Object Detection

Faster R-CNN + FaPN:

Cityscapes Semantic Segmentation

PointRend + FaPN:

ADE20K-150 Semantic Segmentation

MaskFormer + FaPN:

COCOStuff-10K Semantic Segmentation

MaskFormer + FaPN:

COCO Instance Segmentation

Mask R-CNN + FaPN:

PointRend + FaPN:

COCO Panoptic Segmentation

PanopticFPN + FaPN:

PanopticFCN + FaPN:

Owner

Shihua Huang

This is the official source code of "BiCAT: Bi-Chronological Augmentation of Transformer for Sequential Recommendation".

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

HyperaPy: An automatic hyperparameter optimization framework ⚡🚀

This is a beginner-friendly repo to make a collection of some unique and awesome projects. Everyone in the community can benefit & get inspired by the amazing projects present over here.

Pytorch implementation of FlowNet by Dosovitskiy et al.

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

coldcuts is an R package to automatically generate and plot segmentation drawings in R

Tensorflow Implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Code for the paper titled "Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks" (NeurIPS 2021 Spotlight).

Unoffical implementation about Image Super-Resolution via Iterative Refinement by Pytorch

Model parallel transformers in Jax and Haiku

Disease Informed Neural Networks (DINNs) — neural networks capable of learning how diseases spread, forecasting their progression, and finding their unique parameters (e.g. death rate).

pyhsmm - library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

Lightweight Python library for adding real-time object tracking to any detector.

Learning Optical Flow from a Few Matches (CVPR 2021)

Out-of-Distribution Generalization of Chest X-ray Using Risk Extrapolation

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

Codebase for ECCV18 "The Sound of Pixels"

🤗 Push your spaCy pipelines to the Hugging Face Hub