End-to-end pipeline for real-time scene text detection and recognition.

Last update: Aug 04, 2022

Overview

Real-time-Scene-Text-Detection-and-Recognition-System

End-to-end pipeline for real-time scene text detection and recognition.

The detection model use the EAST, the recognition model use the crnn.

Download trained EAST model from https://drive.google.com/file/d/0B3APw5BZJ67ETHNPaU9xUkVoV0U/view and put it in EAST/result.

Download trained crnn model from https://www.dropbox.com/s/dboqjk20qjkpta3/crnn.pth?dl=0 and put it in crnn/samples.

Screenshot

Video

Here is the full video of the real time system: https://youtu.be/1zvteqaU4h4

Owner

Fangneng Zhan

Computer Vision, Deep Learning.

GitHub Repository

OCR, Object Detection, Number Plate, Real Time

README.md PrePareded anaconda env requirements.txt clova AI → deep text recognition → trained weights (ex, .pth) wpod-net weights (ex, .h5 , .json) ht

7 Dec 06, 2022

Solution for Problem 1 by team codesquad for AIDL 2020. Uses ML Kit for OCR and OpenCV for image processing

CodeSquad PS1 Solution for Problem Statement 1 for AIDL 2020 conducted by @unifynd technologies. Problem Given images of bills/invoices, the task was

111 Nov 27, 2022

Captcha Recognition

The objective of this project is to recognize the target numbers in the captcha images correctly which would tell us how good or bad a captcha system has been built.

5 Feb 20, 2022

Characterizing possible failure modes in physics-informed neural networks.

Characterizing possible failure modes in physics-informed neural networks This repository contains the PyTorch source code for the experiments in the

55 Jan 02, 2023

computer vision, image processing and machine learning on the web browser or node.

Image processing and Machine learning labs computer vision, image processing and machine learning on the web browser or node note Fast Fourier Trans

487 Nov 11, 2022

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

SCUT-CTW1500 Datasets We have updated annotations for both train and test set. Train: 1000 images [images][annos] Additional point annotation for each

600 Dec 18, 2022

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

GTA-5-Lane-detection Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and

4 Aug 01, 2021

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

3D Human Pose Estimation with Spatial and Temporal Transformers This repo is the official implementation for 3D Human Pose Estimation with Spatial and

363 Dec 28, 2022

Automatically resolve RidderMaster based on TensorFlow & OpenCV

AutoRiddleMaster Automatically resolve RidderMaster based on TensorFlow & OpenCV 基于 TensorFlow 和 OpenCV 实现的全自动化解御迷士小马谜题 Demo How to use Deploy the ser

5 Nov 19, 2021

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

DcoumentScanner A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV. Directly install the .exe file to inst

1 Oct 29, 2021

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo Thomas Kollar, Michael Laskey, Kevin Stone, Brijen Thananjeyan

68 Dec 14, 2022

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

Overview This collection demonstrates how to construct and train a deep, bidirectional stacked LSTM using CNN features as input with CTC loss to perfo

489 Dec 21, 2022

Shape Detection - It's a shape detection project with OpenCV and Python.

Shape Detection It's a shape detection project with OpenCV and Python. Setup pip install opencv-python for doing AI things. pip install simpleaudio fo

1 Nov 26, 2022

OCR powered screen-capture tool to capture information instead of images

NormCap OCR powered screen-capture tool to capture information instead of images. Links: Repo | PyPi | Releases | Changelog | FAQs Content: Quickstart

575 Dec 31, 2022

A Vietnamese personal card OCR website built with Django.

Django VietCardOCR Installation Creation of virtual environments is done by executing the command venv: python -m venv venv That will create a new fol

4 Sep 04, 2021

🖺 OCR using tensorflow with attention

tensorflow-ocr 🖺 OCR using tensorflow with attention, batteries included Installation git clone --recursive http://github.com/pannous/tensorflow-ocr

646 Nov 11, 2022

Document blur detection based on Laplacian operator and text detection.

Document Blur Detection For general blurred image, using the variance of Laplacian operator is a good solution. But as for the blur detection of docum

5 Oct 20, 2022

Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)

Optical Character Recognition OCR (Optical Character Recognition) is a technology that enables the conversion of document types such as scanned paper

21 Dec 25, 2022

A simple document layout analysis using Python-OpenCV

Run the application: python main.py *Note: For first time running the application, create a folder named "output". The application is a simple documen

109 Dec 12, 2022

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

hocr-tools About About the code Installation System-wide with pip System-wide from source virtualenv Available Programs hocr-check -- check the hOCR f

285 Dec 08, 2022

End-to-end pipeline for real-time scene text detection and recognition.

Related tags

Overview

Real-time-Scene-Text-Detection-and-Recognition-System

Screenshot

Video

Owner

Fangneng Zhan

OCR, Object Detection, Number Plate, Real Time

Solution for Problem 1 by team codesquad for AIDL 2020. Uses ML Kit for OCR and OpenCV for image processing

Captcha Recognition

Characterizing possible failure modes in physics-informed neural networks.

computer vision, image processing and machine learning on the web browser or node.

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

Automatically resolve RidderMaster based on TensorFlow & OpenCV

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

Shape Detection - It's a shape detection project with OpenCV and Python.

OCR powered screen-capture tool to capture information instead of images

A Vietnamese personal card OCR website built with Django.

🖺 OCR using tensorflow with attention

Document blur detection based on Laplacian operator and text detection.

Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)

A simple document layout analysis using Python-OpenCV

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.