This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

Last update: Dec 20, 2022

Related tags

Deep Learning SG2HOI

Overview

SG2HOI

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

Installation

Pytorch 1.7.1

$ conda install --yes -c pytorch pytorch=1.7.1 torchvision cudatoolkit=11.0
$ pip install tdqm sklearn panda Pillow

maskrcnn

Check INSTALL.md to install maskrcnn. Then, adding the maskrcnn lib to your $PYTHONPATH, because our code uses the ROIAlign layer to extract the roi features.

Apex

If you want to use multiple gpus to train the model, you have to follow the instructions to install apex.

Datasets

HOI datasets

We use the off-the-shell object detection results of V-COCO and HICO from VSGnet, which can be downloaded from here.

Scene graph datasets

The scene graph prediction results are generated by TDE. Note that we use all the training and testing images of Visual Genome to train the SG model. Our pre-trained TDE model can be downloaded from here.

Training and testing

$ python main.py --gpu_id 0 --learning_rate 0.01 --batch_size 5 --num_epochs 50

Citations

If you find this project helps your research, please kindly consider citing our papers in your publications.

@InProceedings{he2021exploiting,
    author    = {He, Tao and Gao, Lianli and Song, Jingkuan and Li, Yuan-Fang},
    title     = {Exploiting Scene Graphs for Human-Object Interaction Detection},
    booktitle = {International Conference on Computer Vision(ICCV)},
    year      = {2021},
    url       = {https://arxiv.org/pdf/2108.08584}
}

Acknowledgement

This repository is developed on top of the other two projects: TDE by KaihuaTang and VSGnet by ASMIftekhar.

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

Related tags

Overview

SG2HOI

Installation

Pytorch 1.7.1

maskrcnn

Apex

Datasets

HOI datasets

Scene graph datasets

Training and testing

Citations

Acknowledgement

Owner

HT

DL course co-developed by YSDA, HSE and Skoltech

Simple Dynamic Batching Inference

https://arxiv.org/abs/2102.11005

Dynamic hair modeling from monocular videos using deep neural networks

[ICCV' 21] "Unsupervised Point Cloud Pre-training via Occlusion Completion"

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Starter Code for VALUE benchmark

A pytorch implementation of faster RCNN detection framework (Use detectron2, it's a masterpiece)

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Sharing of contents on mitochondrial encounter networks

Tensorflow implementation of soft-attention mechanism for video caption generation.

KIND: an Italian Multi-Domain Dataset for Named Entity Recognition

Multi-tool reverse engineering collaboration solution.

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

Leveraging OpenAI's Codex to solve cornerstone problems in Music

DeepFaceEditing: Deep Face Generation and Editing with Disentangled Geometry and Appearance Control

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

Utility code for use with PyXLL

Python Multi-Agent Reinforcement Learning framework

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".