A curated list of papers and resources for scene text detection and recognition

Overview

Awesome

Awesome Scene Text

A curated list of papers and resources for scene text detection and recognition

The year when a paper was first published, including ArXiv publications, is used. As a result, there may be cases when a paper was accepted for example to CVPR 2019, but it is listed in year 2018 because it was published in 2018 on ArXiv.

Table of contents
1. Scene Text Detection
2. Weakly Supervised Scene Text Detection
3. Scene Text Recognition
4. Other scene text papers
5. Scene Text Survey papers

Scene Text Detection (including methods for end-to-end detection and recognition)

2010

  • Detecting text in natural scenes with stroke width transform [CVPR 2010] [paper]
  • A Method for Text Localization and Recognition in Real-World Images [ACCV 2010] [paper]

2011

2012

  • Real-time scene text localization and recognition [CVPR 2012] [paper]

2013

2014

  • Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees [ECCV 2014] [paper]

2015

  • Symmetry-based text line detection in natural scenes [CVPR 2015] [paper]
  • Object proposals for text extraction in the wild [ICDAR 2015] [paper]
  • Text-Attentional Convolutional Neural Network for Scene Text Detection [TIP 2016] [paper]
  • Text Flow : A Unified Text Detection System in Natural Scene Images [ICCV 2015] [paper]

2016

2017

2018

2019

  • MSR: Multi-Scale Shape Regression for Scene Text Detection [IJCAI 2019] [paper]
  • Scene Text Detection with Inception Text Proposal Generation Module [ICMLC 2019] [paper]
  • Towards Robust Curve Text Detection with Conditional Spatial Expansion [CVPR 2019] [paper]
  • Curve Text Detection with Local Segmentation Network and Curve Connection [ArXiv] [paper]
  • Pyramid Mask Text Detector [ArXiv] [paper]
  • Tightness-aware Evaluation Protocol for Scene Text Detection [CVPR 2019] [paper]
  • Character Region Awareness for Text Detection [CVPR 2019] [paper]
  • Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes [CVPR 2019] [paper]
  • TextCohesion: Detecting Text for Arbitrary Shapes [ArXiv] [paper]
  • Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation [CVPR 2019] [paper]
  • Learning Shape-Aware Embedding for Scene Text Detection [CVPR 2019] [paper]
  • A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning [ACMMM 2019] [paper]
  • Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network [ICCV 2019] [paper]
  • Towards Unconstrained End-to-End Text Spotting [ICCV 2019] [paper]
  • TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting [paper]
  • Convolutional Character Networks [ICCV 2019] [paper]

Weakly supervised Scene Text Detection & Recognition

2017

  • Attention-Based Extraction of Structured Information from Street View Imagery [ICDAR 2017] [paper]
  • WeText: Scene Text Detection under Weak Supervision [ICCV 2017] [paper]
  • SEE: Towards Semi-Supervised End-to-End Scene Text Recognition [AAAI 2018] [paper]

Scene Text Recognition

2014

2015

2016

2017

2018

  • Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition [AAAI 2018] [paper]
  • SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network [AAAI 2018] [paper]
  • Edit Probability for Scene Text Recognition [CVPR 2018] [paper]
  • ASTER: An Attentional Scene Text Recognizer with Flexible Rectification [TPAMI 2018] [paper]
  • Synthetically Supervised Feature Learning for Scene Text Recognition [ECCV 2018] [paper]
  • Scene Text Recognition from Two-Dimensional Perspective [AAAI 2019] [paper]
  • ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification [CVPR 2019] [paper]

2019

Script Identification

Other scene text related papers

2016

2019

  • Scene Text Synthesis for Efficient and Effective Deep Network Training [ArXiv] [paper]

Scene text survey

2018

  • Scene Text Detection and Recognition: The Deep Learning Era [ArXiv] [paper]

2019

  • Scene text detection and recognition with advances in deep learning: a survey [IJDAR 2019] [paper]
Owner
Jan Zdenek
Jan Zdenek
Deep Learning Chinese Word Segment

引用 本项目模型BiLSTM+CRF参考论文:http://www.aclweb.org/anthology/N16-1030 ,IDCNN+CRF参考论文:https://arxiv.org/abs/1702.02098 构建 安装好bazel代码构建工具,安装好tensorflow(目前本项目需

2.1k Dec 23, 2022
a micro OCR network with 0.07mb params.

MicroOCR a micro OCR network with 0.07mb params. Layer (type) Output Shape Param # Conv2d-1 [-1, 64, 8,

william 29 Aug 06, 2022
3点クリックで円を指定し、極座標変換を行うサンプルプログラム

click-warpPolar 3点クリックで円を指定し、極座標変換を行うサンプルプログラムです。 Requirements OpenCV 3.4.2 or Later Usage 実行方法は以下です。 起動後、マウスで3点をクリックし円を指定してください。 python click-warpPol

KazuhitoTakahashi 17 Dec 30, 2022

Installations for running keras-theano on GPU Upgrade pip and install opencv2 cd ~ pip install --upgrade pip pip install opencv-python Upgrade keras

Berat Kurar Barakat 14 Sep 30, 2022
Shape Detection - It's a shape detection project with OpenCV and Python.

Shape Detection It's a shape detection project with OpenCV and Python. Setup pip install opencv-python for doing AI things. pip install simpleaudio fo

1 Nov 26, 2022
End-to-end pipeline for real-time scene text detection and recognition.

Real-time-Scene-Text-Detection-and-Recognition-System End-to-end pipeline for real-time scene text detection and recognition. The detection model use

Fangneng Zhan 89 Aug 04, 2022
Python Computer Vision from Scratch

This repository explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both f

Milaan Parmar / Милан пармар / _米兰 帕尔马 221 Dec 26, 2022
https://arxiv.org/abs/1904.01941

Character-Region-Awareness-for-Text-Detection- https://arxiv.org/abs/1904.01941 Train You can train SynthText data use python source/train_SynthText.p

DayDayUp 120 Dec 28, 2022
This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

Jacobo José Guijarro Villalba 75 Oct 21, 2022
Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

Regions sanitàries (RS), Sectors Sanitaris (SS), Àrees de Gestió Assistencial (AGA) i Àrees Bàsiques de Salut (ABS) de Catalunya Fitxers GeoJSON de le

Glòria Macià Muñoz 2 Jan 23, 2022
[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks This is an official PyTorch code repository of the paper "Cloud Transformers:

Visual Understanding Lab @ Samsung AI Center Moscow 27 Dec 15, 2022
FOTS Pytorch Implementation

News!!! Recognition branch now is added into model. The whole project has beed optimized and refactored. ICDAR Dataset SynthText 800K Dataset detectio

Ning Lu 599 Dec 19, 2022
Python-based tools for document analysis and OCR

ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

OCRopus 3.2k Dec 31, 2022
Deep LearningImage Captcha 2

滑动验证码深度学习识别 本项目使用深度学习 YOLOV3 模型来识别滑动验证码缺口,基于 https://github.com/eriklindernoren/PyTorch-YOLOv3 修改。 只需要几百张缺口标注图片即可训练出精度高的识别模型,识别效果样例: 克隆项目 运行命令: git cl

Python3WebSpider 117 Dec 28, 2022
This is a implementation of CRAFT OCR method

This is a implementation of CRAFT OCR method

Esaka 0 Nov 01, 2021
Brief idea about our project is mentioned in project presentation file.

Brief idea about our project is mentioned in project presentation file. You just have to run attendance.py file in your suitable IDE but we prefer jupyter lab.

Dhruv ;-) 3 Mar 20, 2022
Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

DewarpNet This repository contains the codes for DewarpNet training. Recent Updates [May, 2020] Added evaluation images and an important note about Ma

<a href=[email protected]"> 354 Jan 01, 2023
Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Streaming speaker diarization Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation by Juan Manuel Coria, Hervé

Juanma Coria 185 Jan 01, 2023
Python library to extract tabular data from images and scanned PDFs

Overview ExtractTable - API to extract tabular data from images and scanned PDFs The motivation is to make it easy for developers to extract tabular d

Org. Account 165 Dec 31, 2022
Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

EAST_ICPR: EAST for ICPR MTWI 2018 CHALLENGE Introduction This is a repository forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE. Origin Reposi

Haozheng Li 157 Aug 23, 2022