TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

Last update: Dec 26, 2022

Related tags

Overview

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

Timely handgun detection is a crucial problem to improve public safety; nevertheless, the effectiveness of many surveillance systems still depends of finite human attention. Much of the previous research on handgun detection is based on static image detectors, leaving aside valuable temporal information that could be used to improve object detection in videos. To improve the performance of surveillance systems, a real-time temporal handgun detection system should be built. Using Temporal Yolov5, an architecture based on Quasi-Recurrent Neural Networks, temporal information is extracted from video to improve the results of handgun detection. Moreover, two publicly available datasets are proposed, labeled with hands, guns, and phones. One containing 2199 static images to train static detectors, and another with 5960 frames of videos to train temporal modules. Additionally, we explore two temporal data augmentation techniques based on Mosaic and Mixup. The resulting systems are three temporal architectures: one focused in reducing inference with a mAP50:95 of 55.9, another in having a good balance between inference and accuracy with a mAP50:95 of 59, and a last one specialized in accuracy with a mAP50:95 of 60.2. Temporal Yolov5 achieves real-time detection in the small and medium architectures. Moreover, it takes advantage of temporal features contained in videos to perform better than Yolov5 in our temporal dataset, making TYolov5 suitable for real-world applications.

If you use this code for your research, please consider citing:

Mario Alberto Duran-Vega, Miguel Gonzalez-Mendoza, Leonardo Chang, Cuauhtemoc Daniel Suarez-Ramirez https://arxiv.org/abs/2111.08867

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

Related tags

Overview

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

If you use this code for your research, please consider citing:

Owner

Mario Duran-Vega

Laplace Redux -- Effortless Bayesian Deep Learning

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

Accelerate Neural Net Training by Progressively Freezing Layers

The official repository for "Revealing unforeseen diagnostic image features with deep learning by detecting cardiovascular diseases from apical four-chamber ultrasounds"

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

Tensorflow Tutorials using Jupyter Notebook

Group project for MFIN7036. Our goal is to predict firm profitability with text-based competition measures.

[ICLR 2021] Is Attention Better Than Matrix Decomposition?

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences forImage-Text Retrieval

This is a classifier which basically predicts whether there is a gun law in a state or not, depending on various things like murder rates etc.

Pytorch implementation of face attention network

PyTorch implementation for Graph Contrastive Learning with Augmentations

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.

Collective Multi-type Entity Alignment Between Knowledge Graphs (WWW'20)

Machine Learning University: Accelerated Computer Vision Class

SPCL: A New Framework for Domain Adaptive Semantic Segmentation via Semantic Prototype-based Contrastive Learning

gym-anm is a framework for designing reinforcement learning (RL) environments that model Active Network Management (ANM) tasks in electricity distribution networks.

A list of all papers and resoureces on Semantic Segmentation

Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition