Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Last update: Nov 11, 2022

Related tags

Overview

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

The offical implementation for the "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination" which is published in ACM MM 2020.

We propose Nearby Objects Hallucinator (NOH), which pinpoints the objects nearby each proposal with a Gaussian distribution, together with NOH-NMS, which dynamically eases the suppression for the space that might contain other objects with a high likelihood.

This work has won the first place at the CrowdHuman Challenge, 2020.

This repo is implemented based on detectron2.

Performance

Model	Backbone	AP	Recall	MR	Weights
Faster RCNN	ResNet-50	85.0	87.5	44.5	faster_rcnn_model_final.pth
NOH-NMS	ResNet-50	88.8	92.6	43.7	noh_nms_model_final.pth

Prepare Datasets

Download the CrowdHuman Datasets from http://www.crowdhuman.org/, and then move them under the directory like:

./data/crowdhuman
├── annotations
│   └── annotation_train.odgt
│   └── annotation_val.odgt
├── images
│   └── train
│   └── val

Installation

  cd detectron2
  pip install -e . 
  #or rebuild
  sh build.sh

Quick Start

See GETTING_STARTED.md in detectron2

Acknowledgement

detectron2

Citation

if you find this project useful for your research, please cite:

@inproceedings{zhou2020noh,
  title={NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination},
  author={Zhou, Penghao and Zhou, Chong and Peng, Pai and Du, Junlong and Sun, Xing and Guo, Xiaowei and Huang, Feiyue},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={1967--1975},
  year={2020}
}

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Related tags

Overview

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

Performance

Prepare Datasets

Installation

Quick Start

Acknowledgement

Citation

Owner

Tencent YouTu Research

Audio Domain Adaptation for Acoustic Scene Classification using Disentanglement Learning

PyTorch implementation for COMPLETER: Incomplete Multi-view Clustering via Contrastive Prediction (CVPR 2021)

Datasets, tools, and benchmarks for representation learning of code.

Search Youtube Video and Get Video info

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

This is the code of using DQN to play Sekiro .

MPRNet-Cloud-removal: Progressive cloud removal

Towards Multi-Camera 3D Human Pose Estimation in Wild Environment

Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"

This repo contains implementation of different architectures for emotion recognition in conversations.

PyTorch and GPyTorch implementation of the paper "Conditioning Sparse Variational Gaussian Processes for Online Decision-making."

use machine learning to recognize gesture on raspberrypi

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

Least Square Calibration for Peer Reviews

Scaling Vision with Sparse Mixture of Experts

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

Dense Prediction Transformers

Code To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment.

GRF: Learning a General Radiance Field for 3D Representation and Rendering

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"