Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Last update: Dec 30, 2022

Related tags

Deep Learning yolo_slowfast

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

we choose yolov5 as an object detector instead of detectron2, it is faster and more convenient
we use a tracker(deepsort) to allocate action labels to all objects(with same ids) in different frames
our processing speed reached 24.2 FPS at 30 inference barch size (on a single RTX 2080Ti GPU)

Relevant infomation: FAIR/PytorchVideo; Ultralytics/Yolov5

Demo comparison betwween original(<-left) and ours(->right).

Installation

create a new python environment:
```
conda create -n env_name python=3.7.11
```
install requiments:
```
pip install -r requirements.txt
```
download weights file(ckpt.t7) from [deepsort] to this folder:
```
./deep_sort/deep_sort/deep/checkpoint/
```
test on your video:
```
python yolo_slowfast.py --input {path to your video}
```
The first time to execute this command may take some times to download the yolov5 code and it's weights file from torch.hub, keep your network connected.

References

Thanks for these great works:

[1] Ultralytics/Yolov5

[2] ZQPei/deepsort

[3] FAIR/PytorchVideo

[2] AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions. paper

[3] SlowFast Networks for Video Recognition. paper

Citation

If you find our work useful, please cite as follow:

{   yolo_slowfast,
    author = {Wu Fan},
    title = { A realtime action detection frame work based on PytorchVideo},
    year = {2021},
    url = {\url{https://github.com/wufan-tb/gmm_dae}}
}

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Related tags

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

Demo comparison betwween original(<-left) and ours(->right).

Installation

References

Citation

Owner

WuFan

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging

basic tutorial on pytorch

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

PyTorch implementation of hand mesh reconstruction described in CMR and MobRecon.

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"

Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation

The ARCA23K baseline system

Sandbox for training deep learning networks

Stacs-ci - A set of modules to enable integration of STACS with commonly used CI / CD systems

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.

PyTorch implementation of the Crafting Better Contrastive Views for Siamese Representation Learning

Tracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)

A PyTorch implementation of "Semi-Supervised Graph Classification: A Hierarchical Graph Perspective" (WWW 2019)

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

CodeContests is a competitive programming dataset for machine-learning

Offline Reinforcement Learning with Implicit Q-Learning