MMSceneGraph

Introduction

MMSceneneGraph is an open source code hub for scene graph generation as well as supporting downstream tasks based on the scene graph on PyTorch. The frontend object detector is supported by open-mmlab/mmdetection.

Major features

Modular design

We decompose the framework into different components and one can easily construct a customized scene graph generation framework by combining different modules.
Support of multiple frameworks out of box

The toolbox directly supports popular and contemporary detection frameworks, e.g. Faster RCNN, Mask RCNN, etc.
Visualization support

The visualization of the groundtruth/predicted scene graph is integrated into the toolbox.

License

This project is released under the MIT license.

Changelog

Please refer to CHANGELOG.md for details.

Benchmark and model zoo

The original object detection results and models provided by mmdetection are available in the model zoo. The models for the scene graph generation are temporarily unavailable yet.

Supported methods and Datasets

Supported SGG (VRD) methods:

Supported saliency object detection methods:

R3Net (IJCAI'2018)
SCRN (ICCV'2019)

Supported image captioning methods:

bottom-up (CVPR'2018)
XLAN (CVPR'2020)

Supported datasets:

Visual Genome: VG150 (CVPR'2017)
VRD (ECCV'2016)
Visual Genome: VG200/VG-KR (ours)
MSCOCO (for object detection, image caption)
RelCap (from VG and COCO, ours)

Installation

As our project is built on mmdetection 1.x (which is a bit different from their current master version 2.x), please refer to INSTALL.md. If you want to use mmdetection 2.x, please refer to mmdetection/get_start.md.

Getting Started

Please refer to GETTING_STARTED.md for using the projects. We will update it constantly.

Acknowledgement

We appreciate the contributors of the mmdetection project and Scene-Graph-Benchmark.pytorch which inspires our design.

Citation

If you find this code hub or our works useful in your research works, please consider citing:

@inproceedings{wang2021topic,
  title={Topic Scene Graph Generation by Attention Distillation from Caption},
  author={Wang, Wenbin and Wang, Ruiping and Chen, Xilin},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  pages={15900--15910},
  month = {October},
  year={2021}
}


@inproceedings{wang2020sketching,
  title={Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation},
  author={Wang, Wenbin and Wang, Ruiping and Shan, Shiguang and Chen, Xilin},
  booktitle={Proceedings of European Conference on Computer Vision (ECCV)},
  pages={222--239},
  year={2020},
  volume={12358},
  doi={10.1007/978-3-030-58601-0_14},
  publisher={Springer}
}

@InProceedings{Wang_2019_CVPR,
author = {Wang, Wenbin and Wang, Ruiping and Shan, Shiguang and Chen, Xilin},
title = {Exploring Context and Visual Pattern of Relationship for Scene Graph Generation},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
pages = {8188-8197},
month = {June},
address = {Long Beach, California, USA},
doi = {10.1109/CVPR.2019.00838},
year = {2019}
}

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

Related tags

Overview

MMSceneGraph

Introduction

Major features

License

Changelog

Benchmark and model zoo

Supported methods and Datasets

Installation

Getting Started

Acknowledgement

Citation

Owner

Kenneth-Wong

This code is for our paper "VTGAN: Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers"

A simple python module to generate anchor (aka default/prior) boxes for object detection tasks.

TGS Salt Identification Challenge

This repo is developed for Strong Baseline For Vehicle Re-Identification in Track 2 Ai-City-2021 Challenges

Hitters Linear Regression - Hitters Linear Regression With Python

Official implementation of the paper Chunked Autoregressive GAN for Conditional Waveform Synthesis

[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang

All the essential resources and template code needed to understand and practice data structures and algorithms in python with few small projects to demonstrate their practical application.

A Python library for differentiable optimal control on accelerators.

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper

TLDR; Train custom adaptive filter optimizers without hand tuning or extra labels.

Diffusion Normalizing Flow (DiffFlow) Neurips2021

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

Block-wisely Supervised Neural Architecture Search with Knowledge Distillation (CVPR 2020)

When BERT Plays the Lottery, All Tickets Are Winning

《Towards High Fidelity Face Relighting with Realistic Shadows》(CVPR 2021)

Learning multiple gaits of quadruped robot using hierarchical reinforcement learning

Tensorflow 2 implementation of the paper: Learning and Evaluating Representations for Deep One-class Classification published at ICLR 2021