Weakly-supervised object detection.

Last update: Jan 05, 2023

Overview

Wetectron

Wetectron is a software system that implements state-of-the-art weakly-supervised object detection algorithms.

Project CVPR'20, ECCV'20 | Paper CVPR'20, ECCV'20

Installation

Check INSTALL.md for installation instructions.

Partial labels

The simulated partial labels (points and scribbles) of COCO can be found at Google-drive or Dropbox.

Please check tools/vis_partial_labels.ipynb for a visualization example.

Model zoo

Check MODEL_ZOO.md for detailed instructions.

Getting started

Check GETTING_STARTED for detailed instrunctions.

New dataset

If you want to run on your own dataset or use other pre-computed proposals (e.g., Edge Boxes), please check USE_YOUR_OWN_DATA for some tips.

Misc

Please also check the documentation of maskrcnn-benchmark for things like abstractions and troubleshooting. If your issues are not present there, feel free to open a new issue.

Todo:

Sequential back-prop and ResNet models.

Citations

Please consider citing following papers in your publications if they help your research.

@inproceedings{ren-cvpr020,
  title = {Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection},
  author = {Zhongzheng Ren and Zhiding Yu and Xiaodong Yang and Ming-Yu Liu and Yong Jae Lee and Alexander G. Schwing and Jan Kautz},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2020}
}

@inproceedings{ren-eccv2020,
  title = {UFO$^2$: A Unified Framework towards Omni-supervised Object Detection},
  author = {Zhongzheng Ren and Zhiding Yu and Xiaodong Yang and Ming-Yu Liu and Alexander G. Schwing and Jan Kautz},
  booktitle = {European Conference on Computer Vision (ECCV)},
  year = {2020}
}

License

This code is released under the Nvidia Source Code License.

This project is built upon maskrcnn-benchmark, which is released under MIT License.

Weakly-supervised object detection.

Related tags

Overview

Wetectron

Project CVPR'20, ECCV'20 | Paper CVPR'20, ECCV'20

Installation

Partial labels

Model zoo

Getting started

New dataset

Misc

Todo:

Citations

License

Owner

NVIDIA Research Projects

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

Language-Agnostic Website Embedding and Classification

High-resolution networks and Segmentation Transformer for Semantic Segmentation

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

LAnguage Model Analysis

A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery

An Intelligent Self-driving Truck System For Highway Transportation

PyTorch module to use OpenFace's nn4.small2.v1.t7 model

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

Keras like implementation of Deep Learning architectures from scratch using numpy.

CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

AIR^2 for Interaction Prediction

Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)

Python3 / PyTorch implementation of the following paper: Fine-grained Semantics-aware Representation Enhancement for Self-supervisedMonocular Depth Estimation. ICCV 2021 (oral)

PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Fit Fast, Explain Fast

LETR: Line Segment Detection Using Transformers without Edges

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer

Pytorch implementation for "Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion" (NeurIPS 2021)