Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Last update: Oct 10, 2022

Overview

Scene Text-Spotting based on PSEnet+CRNN

Pytorch implementation of an end to end Text-Spotter with a PSEnet text detector and CRNN text recognizer. We plan to grow this repository into an open research platform for multi-lingual text detection and recognition from natural scene images, targeted towards low-resource languages.

Requirements

Python 3.6.5
Pytorch 1.2
pyclipper
Polygon 3.0.8
OpenCV 3.4.1

Demo

Download the trained CRNN and PSEnet models from the links provided below.
Copy paths of the models and paste them in params.py
run end-end.py

python end-end.py --img [path to image] --e2e_config_name [end to end config name]

Pre-trained Models

Both PSEnet and CRNN pre-trained models can be found here: gdrive

the PSEnet model is a multi-lingual text detector, trained on MLT 2019. Works quite well!
the CRNN recognizes Hindi, Bangla, Malayalam, Kanada, Tamil, Telugu, Odia, Sanskrit, Marathi!

Download the models in models/ directory and modify params.py if required.

Training instructions

To train your own detection model refer to this file.
To train your own recognition model refer to this file.

Samples

Contributors

Azhar Shaikh, PES University LinkedIn
Nishant Sinha, OffNote Labs

Work done as part of Internship with OffNote Labs.

References

If this repository helps you, please star it. Thank you!

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Related tags

Overview

Scene Text-Spotting based on PSEnet+CRNN

Requirements

Demo

Pre-trained Models

Training instructions

Samples

Contributors

References

Owner

azhar shaikh

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

POT : Python Optimal Transport

Image augmentation for machine learning experiments.

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

GDB python tool to pretty print and debug c++ xtensor containers

This is a real life mario project using python and mediapipe

TextBoxes++: A Single-Shot Oriented Scene Text Detector

Links to awesome OCR projects

Scene text detection and recognition based on Extremal Region(ER)

This project proposes a camera vision based cursor control system, using hand moment captured from a webcam through a landmarks of hand by using Mideapipe module

Deep learning based page layout analysis

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Sort By Face

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

chineseocr/table_line 表格线检测模型pytorch版

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Some bits of javascript to transcribe scanned pages using PageXML

A python program to block out your face

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come