Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Last update: Jun 05, 2022

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

For Educational Purposes Only This is a recursive neural net trained to classify specific crime classes based on the UCF-Crime dataset UCF-CRIME or to perform general anomaly detection. The model uses images that have been encoded into the CLIP image embedding space.

Introducing CLIP

The model we are utilizing in our application, CLIP (developed by OpenAI), is a generalized image classification model which can take any image and produce word embeddings for the purpose of matching raw text strings to the contents of the image. The design and training of the model allows for high zero-shot performance in classifying images (i.e. image classification problems outside of the training set). The following image provides a summary of the model (taken from A. Radford et al.):

While typical image classification models train an image feature extractor and a linear classifier to predict a label, CLIP trains an image encoder and text encoder to predict the correct pairings of a batch of (image, text) training examples. At test time the learned text encoder synthesizes a zero-shot linear classifier by embedding the names or descriptions of the target dataset’s classes.

Installation

Clone the repo and the required packages can be found in the required.txt file. Running classifier.py will start an interactive application that will attempt to perform anomaly detection or multi-class classification on videos found in the 'Videos' directory.

The scripts that were used to create the image sequence database from the video files of the UCF-Crime dataset as well as the training scripts and models can be found in the src directory.

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

Introducing CLIP

Installation

Owner

Miles Tweed

Python Jupyter kernel using Poetry for reproducible notebooks

Official implementation of the paper Chunked Autoregressive GAN for Conditional Waveform Synthesis

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

auto-tuning momentum SGD optimizer

A method to perform unsupervised cross-region adaptation of crop classifiers trained with satellite image time series.

Benchmark for Answering Existential First Order Queries with Single Free Variable

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

This is an example of object detection on Micro bacterium tuberculosis using Mask-RCNN

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

Official implementation of NeurIPS 2021 paper "Contextual Similarity Aggregation with Self-attention for Visual Re-ranking"

Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"

PyTorch implementation of UPFlow (unsupervised optical flow learning)

Sample Prior Guided Robust Model Learning to Suppress Noisy Labels

using yolox+deepsort for object-tracker

Unofficial implementation of the Involution operation from CVPR 2021

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

Avalanche RL: an End-to-End Library for Continual Reinforcement Learning

Knowledge Management for Humans using Machine Learning & Tags

Complete system for facial identity system. Include one-shot model, database operation, features visualization, monitoring