Single Shot Text Detector with Regional Attention

Introduction

SSTD is initially described in our ICCV 2017 spotlight paper.

A third-party implementation of SSTD + Focal Loss. Thanks, Ho taek Han

If you find it useful in your research, please consider citing:

@inproceedings{panhe17singleshot,
      Title   = {Single Shot Text Detector with Regional Attention},
      Author  = {He, Pan and Huang, Weilin and He, Tong and Zhu, Qile and Qiao, Yu and Li, Xiaolin},
      Note    = {Proceedings of Internatioanl Conference on Computer Vision (ICCV)},
      Year    = {2017}
      }
@inproceedings{panhe16readText,
      Title   = {Reading Scene Text in Deep Convolutional Sequences},
      Author  = {He, Pan and Huang, Weilin and Qiao, Yu and Loy, Chen Change and Tang, Xiaoou},
      Note    = {Proceedings of AAAI Conference on Artificial Intelligence, (AAAI)},
      Year    = {2016}
      }
@inproceedings{liu16ssd,
      Title   = {{SSD}: Single Shot MultiBox Detector},
      Author  = {Liu, Wei and Anguelov, Dragomir and Erhan, Dumitru and Szegedy, Christian and Reed, Scott and Fu, Cheng-Yang and Berg, Alexander C.},
      Note    = {Proceedings of European Conference on Computer Vision (ECCV)},
      Year    = {2016}
      }

Installation

Get the code. We will call the directory that you cloned Caffe into $CAFFE_ROOT

git clone https://github.com/BestSonny/SSTD.git
cd SSTD

Build the code. Please follow Caffe instruction to install all necessary packages and build it.

# Modify Makefile.config according to your Caffe installation.
cp Makefile.config.example Makefile.config
make -j8
# Make sure to include $CAFFE_ROOT/python to your PYTHONPATH.
make py
make test -j8
# (Optional)
make runtest -j8
# build nms
cd examples/text
make
cd ..

Run the demo code. Download Model google drive, baiduyun and put it in text/model folder

cd examples
sh text/download.sh
mkdir text/result
python text/demo_test.py

Single Shot Text Detector with Regional Attention

Related tags

Overview

Single Shot Text Detector with Regional Attention

Introduction

Installation

Owner

Pan He

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

The first open-source library that detects the font of a text in a image.

An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Rest API Written In Python To Classify NSFW Images.

Brief idea about our project is mentioned in project presentation file.

A bot that extract text from images using the Tesseract OCR.

Fatigue Driving Detection Based on Dlib

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

Image Recognition Model Generator

This is the code for our paper DAAIN: Detection of Anomalous and AdversarialInput using Normalizing Flows

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

A curated list of resources dedicated to scene text localization and recognition

Neural search engine for AI papers

color detection using python

7th place solution

Computer vision applications project (Flask and OpenCV)

SemTorch

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

A curated list of promising OCR resources